ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 48
Star 103

Code
Issues 5
Pull requests 33
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/vllm

Labels 14 Milestones 0

New pull request New

33 Open 665 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Support fp8 with static scales

#725 opened Oct 3, 2025 by lburzawa

Loading…

5 tasks

Quick port of fp4 fusedmoe

#724 opened Sep 30, 2025 by jpvillam-amd

Loading…

[Triton] Shaoclee/355 wip mha rope kv cache

#723 opened Sep 29, 2025 by k50112113

Loading…

Add dispatch for different mha backend

#722 opened Sep 29, 2025 by zhuyuhua-v • Draft

5 tasks

Fix attn bug in qwen3-8b benchmark test

#721 opened Sep 28, 2025 by PerryZhang01

Loading…

5 tasks

update aiter fused_moe interface

#720 opened Sep 28, 2025 by zhiding512

Loading…

[FEAT] Add support for AITER bpreshuffle block scale gemm

#717 opened Sep 27, 2025 by tjtanaavllm

Loading…

5 tasks

[Perf] refactor attention backend for perf boost

#713 opened Sep 26, 2025 by ganyi1996ppo

Loading…

5 tasks

add hipblas in Docker build

#708 opened Sep 25, 2025 by dllehr-amd

Loading…

5 tasks

[355_wip] Let dynamo capture rms/silu_mul+f4gemm pattern

#705 opened Sep 24, 2025 by xytpai

Loading…

[ROCm] Add allreduce dispatcher for ROCm device

#704 opened Sep 24, 2025 by zejunchen-zejun

Loading…

Qwen-next script

#702 opened Sep 24, 2025 by ZhiweiYan-96

Loading…

5 tasks

[ROCm] Add allreduce dispatcher for ROCm device

#695 opened Sep 18, 2025 by zejunchen-zejun

Loading…

[ROCm] warpSize is being made non constexpr in ROCm 7.0 (#20330)

#694 opened Sep 18, 2025 by xudonlyu

Loading…

[355_wip] Let inductor capture silu+mul+quant pattern and replace them with aiter operator

#669 opened Sep 11, 2025 by xytpai

Loading…

support ck-tile fused bias gemm for rocm unquantized gemm

#668 opened Sep 11, 2025 by eliotwang

Loading…

support rocblas for rocm_unquantized_gemm

#665 opened Sep 10, 2025 by eliotwang

Loading…

add fp8 gemm path choice for rocm_aiter_gemm_w8a8_blockscale

#659 opened Sep 8, 2025 by zhuyuhua-v

Loading…

Add cache config for gpt oss

#656 opened Sep 5, 2025 by cagrikymk • Draft

[NOT FOR LANDING] 355_wip_0909_rc2 -> 0909_rc2

#654 opened Sep 4, 2025 by maleksan85 • Draft

fix flashmla metadata build calls()

#636 opened Aug 19, 2025 by ZJLi2013

Loading…

Updated README.md for August 12 RC2 throughput results only

#631 opened Aug 13, 2025 by Mcirino1

Loading…

[Model] Add GPT-OSS model code and config

#625 opened Aug 7, 2025 by ashishtanwer

Loading…

add Fused_rms_quant for deepseek_v2 model

#611 opened Jul 29, 2025 by ZJLi2013

Loading…

add fused fp8 bmm

#604 opened Jul 25, 2025 by k50112113

Loading…

Previous 1 2 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-10-02.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!