Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Core] Bookkeeping optimization: Vectorize updates v1
#25801 opened Sep 27, 2025 by Jialin Loading…
3 of 5 tasks
[Bugfix] Add missing image_size for phi4_multimodal ready ONLY add when PR is ready to merge/full CI is needed
#25796 opened Sep 27, 2025 by Renovamen Loading…
[Misc] Update openai client example file for multimodal documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#25795 opened Sep 27, 2025 by ywang96 Loading…
5 tasks
Add filtering for chat template kwargs frontend ready ONLY add when PR is ready to merge/full CI is needed
#25794 opened Sep 27, 2025 by russellb Loading… v0.11.0 Cherry Picks
[DRAFT][DO NOT REVIEW] Compile once per block qwen Related to Qwen models
#25792 opened Sep 26, 2025 by Lucaskabela Draft
5 tasks
perf: optimize rejection sampling triton kernel v1
#25791 opened Sep 26, 2025 by happierpig Loading…
5 tasks
[gpt-oss] disable tool server initialization if no tool in request frontend gpt-oss Related to GPT-OSS models
#25790 opened Sep 26, 2025 by qandrew Loading…
[Bugfix] Fix hang with DP+EP on B200 v1
#25789 opened Sep 26, 2025 by alexm-redhat Loading…
[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark ci/build documentation Improvements or additions to documentation performance Performance-related issues
#25786 opened Sep 26, 2025 by KuntaiDu Loading…
5 tasks
Validate API tokens in constant time frontend ready ONLY add when PR is ready to merge/full CI is needed
#25781 opened Sep 26, 2025 by russellb Loading… v0.11.0 Cherry Picks
Hybrid deepep
#25778 opened Sep 26, 2025 by bnellnm Draft
5 tasks
[BugFix] Potential B200 + full-CG hang fix
#25776 opened Sep 26, 2025 by LucasWilkinson Draft
5 tasks
fuse rope ci/build tpu Related to Google TPUs v1
#25774 opened Sep 26, 2025 by PatrykSaffer Draft
Fix GPTQ model loading in Transformers backend
#25770 opened Sep 26, 2025 by hmellor Loading…
Add batch invariant kernel override for FlashInfer backend v1
#25769 opened Sep 26, 2025 by bwasti Loading…
3 of 5 tasks
DP Coordination refactor v1
#25768 opened Sep 26, 2025 by SageMoore Loading…
[CI] Push multiarch manifests as nightly builds ci/build
#25764 opened Sep 26, 2025 by csahithi Loading…
5 tasks
[ROCm][Perf] New design on ROCm AITER MHA backend Implementation rocm Related to AMD ROCm v1
#25763 opened Sep 26, 2025 by ganyi1996ppo Loading…
5 tasks
[Core] Fix torch.dynamo compatibility for Qwen models on vllm-gaudi qwen Related to Qwen models v1
#25761 opened Sep 26, 2025 by pawel-olejniczak Draft
5 tasks done
ProTip! Filter pull requests by the default branch with base:main.