Fix Transformers backend tensor parallel for multimodal models #22673

hmellor · 2025-08-11T20:22:42Z

This PR fixes 2 things:

Top level multimodal configs don't have base_model_tp_plan. So we make sure we get it from each sub config in TransformersBase
If a model with a TP plan is a sub model, the to plan patterns won't match because there is a missing prefix. We now add this prefix before matching.

Signed-off-by: Harry Mellor <[email protected]>

github-actions · 2025-08-11T20:22:49Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request introduces two targeted and important fixes for enabling tensor parallelism in multimodal models using the Transformers backend. The first change correctly retrieves the tensor parallelism plan from the text model's configuration, which is the appropriate location for multimodal architectures. The second change smartly restricts the application of tensor parallelization to only the language model component, which correctly prevents unintended and problematic modifications to the vision tower. Both changes are well-implemented and appear correct. I have no further comments.

vllm/model_executor/models/transformers.py

Signed-off-by: Harry Mellor <[email protected]>

…project#22673) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Diego-Castan <[email protected]>

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

…project#22673) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

Fix Transformers backend tensor parallel for multimodal models

e00de63

Signed-off-by: Harry Mellor <[email protected]>

hmellor requested a review from Isotr0py August 11, 2025 20:23

gemini-code-assist bot reviewed Aug 11, 2025

View reviewed changes

hmellor mentioned this pull request Aug 11, 2025

[New Model] Support Command-A-Vision #22660

Merged

4 tasks

Isotr0py reviewed Aug 12, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

Merge branch 'main' into fix-transformers-backend-tp

1492048

hmellor added this to the v0.10.1 milestone Aug 12, 2025

Dynamically update tp_plan depending on the current module

1a76ab6

Signed-off-by: Harry Mellor <[email protected]>

Isotr0py approved these changes Aug 12, 2025

View reviewed changes

hmellor enabled auto-merge (squash) August 12, 2025 16:18

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 12, 2025

vllm-bot merged commit d0a6301 into vllm-project:main Aug 13, 2025
40 of 48 checks passed

hmellor deleted the fix-transformers-backend-tp branch August 13, 2025 06:28

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 19, 2025

Fix Transformers backend tensor parallel for multimodal models (vllm-…

2df289c

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

divakar-amd pushed a commit to divakar-amd/vllm_upstream that referenced this pull request Aug 20, 2025

Fix Transformers backend tensor parallel for multimodal models (vllm-…

b4bcf2b

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Fix Transformers backend tensor parallel for multimodal models (vllm-…

e7808e4

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025

Fix Transformers backend tensor parallel for multimodal models (vllm-…

3af7900

…project#22673) Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Xiao Yu <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

Fix Transformers backend tensor parallel for multimodal models (vllm-…

10df26b

…project#22673) Signed-off-by: Harry Mellor <[email protected]>

hmellor added this to Transformers backend Sep 24, 2025

hmellor moved this to Done in Transformers backend Sep 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix Transformers backend tensor parallel for multimodal models #22673

Fix Transformers backend tensor parallel for multimodal models #22673

Uh oh!

hmellor commented Aug 11, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Fix Transformers backend tensor parallel for multimodal models #22673

Fix Transformers backend tensor parallel for multimodal models #22673

Uh oh!

Conversation

hmellor commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hmellor commented Aug 11, 2025 •

edited

Loading