Skip to content

Pull requests: vllm-project/vllm-gaudi

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update README.md
#301 opened Oct 2, 2025 by cabelo Loading…
Update codeowners and testowners
#299 opened Oct 1, 2025 by afierka-intel Loading…
Port: Add assert for empty buckets
#298 opened Oct 1, 2025 by iboiko-habana Loading…
Fix Llama 405B docker loading issue
#296 opened Oct 1, 2025 by nngokhale Loading…
Adding docs for defragmenter and sampler warmup
#278 opened Sep 26, 2025 by ksmusz Loading…
Add unified attention Granite-8b test
#277 opened Sep 26, 2025 by kzawora-intel Loading…
[Docs] CI failures chapter
#276 opened Sep 26, 2025 by adobrzyn Loading…
Add Unified Attention docs
#275 opened Sep 26, 2025 by madamczyk-intel Loading…
Convert padding itertools.islice to list
#264 opened Sep 25, 2025 by malsbat Loading…
Update long context README
#256 opened Sep 25, 2025 by iboiko-habana Loading…
enable p2d4
#253 opened Sep 24, 2025 by hsubramony Draft
Support DP for unified attention
#242 opened Sep 24, 2025 by wuxun-zhang Loading…
Fix calculating used blocks in unified attn
#232 opened Sep 23, 2025 by madamczyk-intel Loading…
KV cache sharing
#223 opened Sep 22, 2025 by jakub-sochacki Draft
[DO NOT MERGE] Update README.md
#220 opened Sep 22, 2025 by kzawora-intel Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.