chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs #4603

Superjomn · 2025-05-23T06:02:47Z

Description

Eliminate the pytorch_backend_config and flatten all PyTorchConfig knobs into TorchLlmArgs
Update all the usages accordingly

Superjomn · 2025-05-23T09:51:31Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-05-23T09:56:45Z

PR_Github #6285 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-23T14:23:44Z

PR_Github #6285 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #4593 completed with status: 'FAILURE'

Superjomn · 2025-05-25T08:23:59Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-05-25T08:29:18Z

PR_Github #6379 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-25T11:57:03Z

PR_Github #6379 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #4661 completed with status: 'FAILURE'

Superjomn · 2025-05-26T03:39:17Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-05-26T03:45:27Z

PR_Github #6418 [ run ] triggered by Bot

Superjomn · 2025-05-26T05:29:27Z

/bot run --add-multi-gpu-test --disable-fail-fast

tensorrt-cicd · 2025-05-26T05:35:39Z

PR_Github #6425 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-26T05:35:42Z

PR_Github #6418 [ run ] completed with state ABORTED

examples/llm-eval/lm-eval-harness/lm_eval_tensorrt_llm.py

tensorrt_llm/_torch/auto_deploy/shim/demollm.py

tensorrt_llm/commands/eval.py

tensorrt_llm/commands/serve.py

tensorrt-cicd · 2025-05-27T07:03:41Z

PR_Github #6582 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-27T10:38:42Z

PR_Github #6582 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #4814 completed with status: 'FAILURE'

Superjomn · 2025-05-27T11:22:34Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-05-27T11:28:08Z

PR_Github #6630 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-28T09:58:28Z

PR_Github #6630 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #4845 completed with status: 'SUCCESS'

Signed-off-by: Superjomn <[email protected]>

update usages Signed-off-by: Superjomn <[email protected]>

Signed-off-by: Superjomn <[email protected]>

Superjomn · 2025-05-28T10:23:10Z

/bot reuse-pipeline

tensorrt-cicd · 2025-05-28T10:34:56Z

PR_Github #6763 [ reuse-pipeline ] triggered by Bot

tensorrt-cicd · 2025-05-28T10:43:03Z

PR_Github #6763 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #6630 for commit bee40fa

tensorrt_llm/llmapi/llm_args.py

pamelap-nvidia · 2025-05-30T21:16:48Z

docs/source/blogs/Best_perf_practice_on_DeepSeek-R1_in_TensorRT-LLM.md

 YOUR_DATA_PATH=<your dataset file following the format>

 cat >./extra-llm-api-config.yml<<EOF
-pytorch_backend_config:


I think we should throw a "deprecated" message when the old config type is detected. Currently old configs are still accepted, but all the fields under pytorch_backend_config are ignored, thus none of the configs are taking effect. This causes confusion to users (like myself).

I think this is a very fair request. @Superjomn

…e to trtllm config in NVIDIA/TensorRT-LLM#4603

NVIDIA#4603) Signed-off-by: Superjomn <[email protected]> Signed-off-by: darraghdog <[email protected]>

Superjomn force-pushed the flatten-pytorch-config branch 3 times, most recently from 678236c to cc4863c Compare May 23, 2025 06:31

Superjomn requested a review from a team as a code owner May 23, 2025 06:31

Superjomn requested review from yilin-void and Funatiq May 23, 2025 06:31

Superjomn force-pushed the flatten-pytorch-config branch 3 times, most recently from c7573da to 1ec59ec Compare May 23, 2025 08:37

Superjomn requested a review from QiJune May 23, 2025 08:40

Superjomn force-pushed the flatten-pytorch-config branch 2 times, most recently from 19f20a3 to c3d0f10 Compare May 23, 2025 09:51

Superjomn force-pushed the flatten-pytorch-config branch from c3d0f10 to 1115607 Compare May 25, 2025 08:18

Superjomn requested a review from a team as a code owner May 25, 2025 08:18

Superjomn force-pushed the flatten-pytorch-config branch 2 times, most recently from 7755876 to a9f24cb Compare May 26, 2025 05:29

Funatiq reviewed May 26, 2025

View reviewed changes

Funatiq approved these changes May 26, 2025

View reviewed changes

Superjomn force-pushed the flatten-pytorch-config branch from a9f24cb to 6393176 Compare May 27, 2025 01:57

yilin-void approved these changes May 27, 2025

View reviewed changes

Superjomn force-pushed the flatten-pytorch-config branch from 2624ad4 to 869e6c8 Compare May 27, 2025 11:21

Superjomn requested review from a team as code owners May 27, 2025 11:21

Superjomn enabled auto-merge (squash) May 27, 2025 11:22

Superjomn added 6 commits May 28, 2025 10:19

flatten PyTorchConfig

21e64eb

Signed-off-by: Superjomn <[email protected]>

Signed-off-by: Superjomn <[email protected]>

1b79773

update usages Signed-off-by: Superjomn <[email protected]>

remove PyTorchConfig.post_model_init

dad84ae

Signed-off-by: Superjomn <[email protected]>

fix cuda-graph settings

304aed7

Signed-off-by: Superjomn <[email protected]>

fix autodeploy

45ef5e6

Signed-off-by: Superjomn <[email protected]>

fix

bee40fa

Signed-off-by: Superjomn <[email protected]>

Superjomn force-pushed the flatten-pytorch-config branch from 869e6c8 to bee40fa Compare May 28, 2025 10:22

Superjomn merged commit 5506f60 into NVIDIA:main May 28, 2025
3 checks passed

Superjomn deleted the flatten-pytorch-config branch May 28, 2025 13:42

chang-l reviewed May 28, 2025

View reviewed changes

tensorrt_llm/llmapi/llm_args.py Show resolved Hide resolved

pamelap-nvidia reviewed May 30, 2025

View reviewed changes

chang-l mentioned this pull request May 30, 2025

[Bug] Users need to add cuda_graph_max_batch_size=0 to avoid crash when config from extra-llm-api-config.yml #4811

Open

4 tasks

rmccorm4 added a commit to ai-dynamo/dynamo that referenced this pull request Jun 2, 2025

fix: Flatten pytorch_backend_config section to address breaking chang…

1cad61f

…e to trtllm config in NVIDIA/TensorRT-LLM#4603

rmccorm4 mentioned this pull request Jun 2, 2025

fix: Flatten pytorch_backend_config section to address breaking change to trtllm config ai-dynamo/dynamo#1326

Merged

darraghdog pushed a commit to darraghdog/TensorRT-LLM that referenced this pull request Jun 3, 2025

chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs (

0652953

NVIDIA#4603) Signed-off-by: Superjomn <[email protected]> Signed-off-by: darraghdog <[email protected]>

lucaslie mentioned this pull request Jun 4, 2025

[AutoDeploy] _AutoDeployLlmArgs as primary config object #4891

Merged

rosenrodt mentioned this pull request Jul 3, 2025

Update DeepSeek R1 perf numbers to latest release/0.20 results #5235

Merged

chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs #4603

chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs #4603

Uh oh!

Conversation

Superjomn commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Superjomn commented May 23, 2025

Uh oh!

tensorrt-cicd commented May 23, 2025

Uh oh!

tensorrt-cicd commented May 23, 2025

Uh oh!

Superjomn commented May 25, 2025

Uh oh!

tensorrt-cicd commented May 25, 2025

Uh oh!

tensorrt-cicd commented May 25, 2025

Uh oh!

Superjomn commented May 26, 2025

Uh oh!

tensorrt-cicd commented May 26, 2025

Uh oh!

Superjomn commented May 26, 2025

Uh oh!

tensorrt-cicd commented May 26, 2025

Uh oh!

tensorrt-cicd commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tensorrt-cicd commented May 27, 2025

Uh oh!

tensorrt-cicd commented May 27, 2025

Uh oh!

Superjomn commented May 27, 2025

Uh oh!

tensorrt-cicd commented May 27, 2025

Uh oh!

tensorrt-cicd commented May 28, 2025

Uh oh!

Superjomn commented May 28, 2025

Uh oh!

tensorrt-cicd commented May 28, 2025

Uh oh!

tensorrt-cicd commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

pamelap-nvidia May 30, 2025

Choose a reason for hiding this comment

Uh oh!

juney-nvidia Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Superjomn commented May 23, 2025 •

edited

Loading