Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

cleanup at::Tag::needs_fixed_stride_order
#28974 opened Nov 19, 2025 by BoyuanFeng Loading…
[Feature] add session based streaming support to v1 tpu Related to Google TPUs v1
#28973 opened Nov 19, 2025 by joshuadeng Draft
2 of 5 tasks
[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 model gpt-oss Related to GPT-OSS models
#28971 opened Nov 18, 2025 by xyang16 Loading…
5 tasks
Opt beam search
#28969 opened Nov 18, 2025 by mgoin Draft
5 tasks
[DeepSeek] Fix DeepSeek V3.2 Rope Embedding deepseek Related to DeepSeek models
#28968 opened Nov 18, 2025 by zyongye Loading…
5 tasks
[Bug] Fix Batch Invariant MLA test ready ONLY add when PR is ready to merge/full CI is needed v1
#28967 opened Nov 18, 2025 by yewentao256 Loading…
Re-enable FlashInfer for Llama4 on Blackwell in e2e fusion tests llama Related to Llama models ready ONLY add when PR is ready to merge/full CI is needed
#28966 opened Nov 18, 2025 by Copilot AI Loading…
3 of 5 tasks
[Model][QwenVL] Replace torch.repeat_interleave with faster np.repeat qwen Related to Qwen models
#28964 opened Nov 18, 2025 by lgeiger Loading…
[Model][QwenVL] Simplify cos/sin rotary embedding indexing qwen Related to Qwen models
#28962 opened Nov 18, 2025 by lgeiger Loading…
[config] Expose get_total_num_hidden_layers() in ModelConfig ready ONLY add when PR is ready to merge/full CI is needed
#28961 opened Nov 18, 2025 by ptovam Loading…
[Bugfix] Fix typo in Qwen3 Next model executor qwen Related to Qwen models
#28960 opened Nov 18, 2025 by Nepherpitou Loading…
2 of 5 tasks
[Bugfix] Use lazy string reference for DeepseekV3Config in config registry deepseek Related to DeepSeek models
#28958 opened Nov 18, 2025 by yongming-qin Loading…
[amd] enable kimi k2 fp8 kv on amd mi300 rocm Related to AMD ROCm v1
#28955 opened Nov 18, 2025 by bradleyhd Draft
5 tasks
Speed up macOS smoke test ci/build ready ONLY add when PR is ready to merge/full CI is needed
#28954 opened Nov 18, 2025 by mgoin Loading…
5 tasks
[Core] Free KV cache GPU memory on engine shutdown v1
#28953 opened Nov 18, 2025 by markmc Loading…
Relax Transformers modeling backend MoE experts check documentation Improvements or additions to documentation
#28952 opened Nov 18, 2025 by hmellor Loading…
[Log] Optimize startup log ready ONLY add when PR is ready to merge/full CI is needed v1
#28948 opened Nov 18, 2025 by yewentao256 Loading…
[Doc]: fix typos in various files documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#28945 opened Nov 18, 2025 by didier-durand Loading…
2 tasks done
[CI/Build] Remove duplicate python installation ci/build ready ONLY add when PR is ready to merge/full CI is needed
#28944 opened Nov 18, 2025 by flpanbin Loading…
5 tasks
ProTip! Follow long discussions with comments:>50.