[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es), spawn #26882

ProExpertProg · 2025-10-15T05:11:26Z

This is just #26738 but with spawn forced so we can try that in CI as well.

Signed-off-by: Huy Do <[email protected]>

Signed-off-by: ProExpertProg <[email protected]>

Signed-off-by: Luka Govedič <[email protected]>

commit a4ee300 Author: angelayi <[email protected]> Date: Tue Oct 14 19:19:25 2025 -0700 test moving import Signed-off-by: angelayi <[email protected]> commit 0ba846b Author: angelayi <[email protected]> Date: Mon Oct 13 13:36:43 2025 -0700 [BugFix] Patch inductor partitioning logic Signed-off-by: angelayi <[email protected]> Signed-off-by: ProExpertProg <[email protected]>

commit 6b0c3c3 Author: Boyuan Feng <[email protected]> Date: Tue Oct 14 21:30:29 2025 -0700 nit Signed-off-by: Boyuan Feng <[email protected]> commit 1016467 Author: Boyuan Feng <[email protected]> Date: Tue Oct 14 21:21:47 2025 -0700 fix multi-graph test Signed-off-by: Boyuan Feng <[email protected]> Signed-off-by: ProExpertProg <[email protected]>

Signed-off-by: ProExpertProg <[email protected]>

mergify · 2025-10-15T05:12:07Z

⚠️ The sha of the head commit of this PR conflicts with #26738. Mergify cannot evaluate rules on this PR. ⚠️

gemini-code-assist

Code Review

This pull request introduces a significant number of changes to support PyTorch 2.9, including updates to dependencies, CI/CD configurations, and Dockerfiles. It also enables Inductor graph partitioning by default for PyTorch 2.9+ and switches the default multiprocessing method for workers to spawn. Notably, it includes monkeypatches for PyTorch 2.9 to work around upstream issues, which are well-documented. The overall changes seem to be moving in the right direction for PyTorch 2.9 support. However, I've found some temporary testing code that should be removed before this PR is merged.

gemini-code-assist · 2025-10-15T05:12:48Z

tests/standalone_tests/python_only_compile.sh

+# TESTING, TO BE REMOVED
+VLLM_TEST_USE_PRECOMPILED_NIGHTLY_WHEEL=1 VLLM_USE_PRECOMPILED=1 pip3 install -vvv -e . \
+    --extra-index-url https://download.pytorch.org/whl/test/cu128


This change appears to be for temporary testing, as indicated by the comment # TESTING, TO BE REMOVED. Hardcoding the --extra-index-url and leaving such comments can lead to future issues if not cleaned up. Please remove this temporary code before this pull request is merged.

Suggested change

# TESTING, TO BE REMOVED

VLLM_TEST_USE_PRECOMPILED_NIGHTLY_WHEEL=1 VLLM_USE_PRECOMPILED=1 pip3 install -vvv -e . \

--extra-index-url https://download.pytorch.org/whl/test/cu128

VLLM_TEST_USE_PRECOMPILED_NIGHTLY_WHEEL=1 VLLM_USE_PRECOMPILED=1 pip3 install -vvv -e .

GhostCCCatHenry · 2025-10-15T05:13:18Z

这是来自QQ邮箱的假期自动回复邮件。您好，我最近正在休假中，无法亲自回复您的邮件。我将在假期结束后，尽快给您回复。

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

chatgpt-codex-connector · 2025-10-15T05:15:18Z

tests/compile/test_full_graph.py

        pytest.skip("inductor graph partition is only available in PyTorch 2.9+")

    model = "nvidia/Llama-4-Scout-17B-16E-Instruct-FP8"
+    if current_platform.get_device_capability()[0] < 10:
+        pytest.skip(f"{model} can only be loaded by B200 or above")


Guard device capability before indexing

The new B200 gate in test_inductor_graph_partition_attn_fusion assumes current_platform.get_device_capability() always returns a tuple, but on CPU builds get_device_capability() returns None. The added check current_platform.get_device_capability()[0] < 10 will raise a TypeError before the test can skip, causing the entire test run to crash on environments without CUDA rather than being skipped. Consider storing the capability in a variable and skipping when it is None or has a major version below 10.

Useful? React with 👍 / 👎.

ProExpertProg · 2025-10-15T13:43:59Z

Seems like spawn is resolved, closing for now

huydhn added 30 commits September 16, 2025 13:55

Update PyTorch to 2.9.0

e738cba

Signed-off-by: Huy Do <[email protected]>

Add a comment

497cffd

Signed-off-by: Huy Do <[email protected]>

Not setting --extra-index-url in test.in

89ba43a

Signed-off-by: Huy Do <[email protected]>

Use https://download.pytorch.org/whl/test

157aae3

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

ea1ef7a

Signed-off-by: Huy Do <[email protected]>

Put torchao back to the same state

0a81fb2

Signed-off-by: Huy Do <[email protected]>

Install the latest torchao nightly for quantization test

0325966

Signed-off-by: Huy Do <[email protected]>

Debug distributed failures

39b9cbf

Signed-off-by: Huy Do <[email protected]>

Wrong torchao package

0272040

Signed-off-by: Huy Do <[email protected]>

Attempt the fix in NVIDIA/nccl#1838

c2e0eaf

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

c16db74

Signed-off-by: Huy Do <[email protected]>

Set inductor_graph_partition to True by default

d3436a8

Signed-off-by: Huy Do <[email protected]>

Rerun with RC3

0e581a3

Signed-off-by: Huy Do <[email protected]>

Rerun with RC4

84c6cc3

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

3637adb

Signed-off-by: Huy Do <[email protected]>

Build CPU docker image

23c6427

Signed-off-by: Huy Do <[email protected]>

Leave CPU for later

ba8a85f

Signed-off-by: Huy Do <[email protected]>

CPU build should work now

ec7b5c4

Signed-off-by: Huy Do <[email protected]>

Rebuild flashinfer-python for 2.9.0

869d13e

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

a670c2e

Signed-off-by: Huy Do <[email protected]>

Fix precommit

145e225

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

9cd7683

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

47ae5d8

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

e7064b4

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

106bd40

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

76e438d

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

ebaa419

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

b4ed78c

Signed-off-by: Huy Do <[email protected]>

Skip some test unless it's B200

5d50c59

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into pytorch-2.9.0

210aa68

Signed-off-by: Huy Do <[email protected]>

huydhn and others added 11 commits October 14, 2025 18:14

Merge branch 'main' into pytorch-2.9.0

39cd1b2

Signed-off-by: Huy Do <[email protected]>

Merge remote-tracking branch 'upstream/main' into pytorch-2.9.0

2b337fc

Fix test.txt

737ea15

Signed-off-by: ProExpertProg <[email protected]>

Enable use_inductor_graph_partition by default in >=2.9

7108bd6

Signed-off-by: ProExpertProg <[email protected]>

Turn standalone compile back on

4b39e6f

Signed-off-by: Luka Govedič <[email protected]>

[Graph Partition] pass tests for decorator (vllm-project#26831)

125c888

Signed-off-by: ProExpertProg <[email protected]>

TEMP: disable nested torch compilation

e811cb5

Signed-off-by: ProExpertProg <[email protected]>

TEMP force spawn for tests

f1dcb6d

Signed-off-by: ProExpertProg <[email protected]>

TEMP: use spawn to circumvent CUDA init issue

4e2976b

Signed-off-by: ProExpertProg <[email protected]>

ProExpertProg requested review from LucasWilkinson, WoosukKwon, bigPYJ1151, hmellor, houseroad, mgoin, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256 and youkaichao as code owners October 15, 2025 05:11

ProExpertProg mentioned this pull request Oct 15, 2025

[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es) #26738

Open

gemini-code-assist bot reviewed Oct 15, 2025

View reviewed changes

ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 15, 2025

chatgpt-codex-connector bot reviewed Oct 15, 2025

View reviewed changes

ProExpertProg closed this Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es), spawn #26882

[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es), spawn #26882

Uh oh!

ProExpertProg commented Oct 15, 2025 •

edited

Loading

Uh oh!

mergify bot commented Oct 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 15, 2025

Uh oh!

GhostCCCatHenry commented Oct 15, 2025 via email

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 15, 2025

Uh oh!

ProExpertProg commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Uh oh!

[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es), spawn #26882

[DO NOT MERGE] 2.9, Inductor partition, standalone compile, monkeypatch fix(es), spawn #26882

Uh oh!

Conversation

ProExpertProg commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Oct 15, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

GhostCCCatHenry commented Oct 15, 2025 via email

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

ProExpertProg commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ProExpertProg commented Oct 15, 2025 •

edited

Loading