[Bugfix][ROCm][Build] Fix build regression on ROCm #21393

gshtras · 2025-07-22T16:57:00Z

Moving CUDA specific file to the ifdef CUDA section
Fix regression from #21083

Signed-off-by: Gregory Shtrasberg <[email protected]>

github-actions · 2025-07-22T16:57:11Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request fixes a build regression on ROCm by correctly scoping a CUDA-specific quantization kernel and its corresponding PyTorch operator registration to only be included in CUDA builds. I've added one high-severity comment regarding a potential future build issue due to an unconditional function declaration in a header file that should also be made conditional to ensure long-term maintainability.

gemini-code-assist · 2025-07-22T16:57:53Z

csrc/torch_bindings.cpp

+      "scale_ue8m0) -> ()");
+  ops.impl("per_token_group_fp8_quant", torch::kCUDA,
+           &per_token_group_quant_fp8);
+


This change correctly makes the registration of per_token_group_fp8_quant conditional for CUDA builds. To prevent potential linker errors on ROCm builds, it's highly recommended to also wrap the declaration of per_token_group_fp8_quant in csrc/ops.h with #ifndef USE_ROCM.

yewentao256

Thanks for the fix!

yewentao256 · 2025-07-22T17:14:37Z

csrc/torch_bindings.cpp

+      "scale_ue8m0) -> ()");
+  ops.impl("per_token_group_fp8_quant", torch::kCUDA,
+           &per_token_group_quant_fp8);
+


mgoin

Thank you for fixing this, apologies for the disruption

Signed-off-by: Gregory Shtrasberg <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: qizixi <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: x22x22 <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Paul Pak <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

Signed-off-by: Gregory Shtrasberg <[email protected]>

Fix build on ROCm

a827c74

Signed-off-by: Gregory Shtrasberg <[email protected]>

gshtras requested review from LucasWilkinson and tlrmchlsmth as code owners July 22, 2025 16:57

gshtras added the rocm Related to AMD ROCm label Jul 22, 2025

mergify bot added the ci/build label Jul 22, 2025

gemini-code-assist bot reviewed Jul 22, 2025

View reviewed changes

mgoin mentioned this pull request Jul 22, 2025

[CI/Build] Only build per_token_group_quant.cu on CUDA #21392

Closed

yewentao256 reviewed Jul 22, 2025

View reviewed changes

mgoin approved these changes Jul 22, 2025

View reviewed changes

mgoin enabled auto-merge (squash) July 22, 2025 17:15

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 22, 2025

Move the declaration under ifdef as well

0b01e50

Signed-off-by: Gregory Shtrasberg <[email protected]>

vllm-bot merged commit 3ec7170 into vllm-project:main Jul 23, 2025
96 of 98 checks passed

tjtanaa mentioned this pull request Jul 23, 2025

Fix amd build fail caused by #21803 #21405

Closed

zixi-qi pushed a commit to zixi-qi/vllm that referenced this pull request Jul 23, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

701a331

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: qizixi <[email protected]>

gshtras deleted the rocm_build_fix branch August 5, 2025 16:52

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

1cea502

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: x22x22 <[email protected]>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

aebeb75

Signed-off-by: Gregory Shtrasberg <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

6736335

Signed-off-by: Gregory Shtrasberg <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

a88a793

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

f17fec1

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Paul Pak <[email protected]>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

a2ec109

Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

[Bugfix][ROCm][Build] Fix build regression on ROCm (vllm-project#21393)

3296ca5

Signed-off-by: Gregory Shtrasberg <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix][ROCm][Build] Fix build regression on ROCm #21393

[Bugfix][ROCm][Build] Fix build regression on ROCm #21393

Uh oh!

gshtras commented Jul 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jul 22, 2025

Uh oh!

yewentao256 Jul 22, 2025

Uh oh!

yewentao256 left a comment

Uh oh!

yewentao256 Jul 22, 2025

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Bugfix][ROCm][Build] Fix build regression on ROCm #21393

[Bugfix][ROCm][Build] Fix build regression on ROCm #21393

Uh oh!

Conversation

gshtras commented Jul 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

yewentao256 Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

yewentao256 Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gshtras commented Jul 22, 2025 •

edited by github-actions bot

Loading