-
-
Notifications
You must be signed in to change notification settings - Fork 10.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core] Bookkeeping optimization: Vectorize updates
v1
#25801
opened Sep 27, 2025 by
Jialin
Loading…
3 of 5 tasks
[Feat][EPLB][Perf] Enable Round-robin expert placement strategy while eplb is enabled.
#25798
opened Sep 27, 2025 by
cboss6
Loading…
[Bugfix] Add missing ONLY add when PR is ready to merge/full CI is needed
image_size
for phi4_multimodal
ready
#25796
opened Sep 27, 2025 by
Renovamen
Loading…
[Misc] Update openai client example file for multimodal
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#25795
opened Sep 27, 2025 by
ywang96
Loading…
5 tasks
Add filtering for chat template kwargs
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
[DRAFT][DO NOT REVIEW] Compile once per block
qwen
Related to Qwen models
#25792
opened Sep 26, 2025 by
Lucaskabela
•
Draft
5 tasks
perf: optimize rejection sampling triton kernel
v1
#25791
opened Sep 26, 2025 by
happierpig
Loading…
5 tasks
[gpt-oss] disable tool server initialization if no tool in request
frontend
gpt-oss
Related to GPT-OSS models
#25790
opened Sep 26, 2025 by
qandrew
Loading…
[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark
ci/build
documentation
Improvements or additions to documentation
performance
Performance-related issues
#25786
opened Sep 26, 2025 by
KuntaiDu
Loading…
5 tasks
[Misc] Make EP kernels install script support uv
#25785
opened Sep 26, 2025 by
LucasWilkinson
Loading…
[Misc] Integrate Suffix Decoding from Arctic Inference
needs-rebase
v1
#25784
opened Sep 26, 2025 by
aurickq
Loading…
1 of 3 tasks
Validate API tokens in constant time
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
[BugFix] Potential B200 + full-CG hang fix
#25776
opened Sep 26, 2025 by
LucasWilkinson
•
Draft
5 tasks
[Bugfix] Improve GPU validation logging in Ray fallback scenarios
#25775
opened Sep 26, 2025 by
sairampillai
Loading…
3 of 5 tasks
Add batch invariant kernel override for FlashInfer backend
v1
#25769
opened Sep 26, 2025 by
bwasti
Loading…
3 of 5 tasks
[Bug]: Set LD_LIBRARY_PATH to include the 'standard' CUDA location
ci/build
#25766
opened Sep 26, 2025 by
smarterclayton
Loading…
5 tasks
[CI] Push multiarch manifests as nightly builds
ci/build
#25764
opened Sep 26, 2025 by
csahithi
Loading…
5 tasks
[ROCm][Perf] New design on ROCm AITER MHA backend Implementation
rocm
Related to AMD ROCm
v1
#25763
opened Sep 26, 2025 by
ganyi1996ppo
Loading…
5 tasks
[Core] Fix torch.dynamo compatibility for Qwen models on vllm-gaudi
qwen
Related to Qwen models
v1
#25761
opened Sep 26, 2025 by
pawel-olejniczak
•
Draft
5 tasks done
Refactor/understanding prepare inputs padded
speculative-decoding
v1
#25758
opened Sep 26, 2025 by
tomasruizt
•
Draft
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.