fix: Set `trust_remote_code=True` when verifying config.json load #4068

venkywonka · 2025-05-05T18:42:01Z

PR title

Set trust_remote_code=True when verifying config.json load.
When this isn't set, the behavior defaults to the model-config-specific from_hugging_face implementation's default trust_remote_code value.
For models like LlamaConfig, their internal usage default to trust_remote_code=True unless specified - but others like DeciLMConfig deafults to trust_remote_code=False. When its False, it raises an error when loading from huggingface.

Test Coverage

There is a unittest that hits that code-path (tests/unittest/llmapi/test_llm_utils.py#L15-L36), but it uses TinyLlama hence passes.

GitHub Bot Help

/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...

Provide a user friendly way for developers to interact with a Jenkins server.

Run /bot [-h|--help] to print this help message.

See details below for each supported subcommand.

run [--disable-fail-fast --skip-test --stage-list "A10-1, xxx" --gpu-type "A30, H100_PCIe" --add-multi-gpu-test --only-multi-gpu-test --disable-multi-gpu-test --post-merge --extra-stage "H100_PCIe-[Post-Merge]-1, xxx"]

Launch build/test pipelines. All previously running jobs will be killed.

--disable-fail-fast (OPTIONAL) : Disable fail fast on build/tests/infra failures.

--skip-test (OPTIONAL) : Skip all test stages, but still run build stages, package stages and sanity check stages. Note: Does NOT update GitHub check status.

--stage-list "A10-1, xxx" (OPTIONAL) : Only run the specified test stages. Examples: "A10-1, xxx". Note: Does NOT update GitHub check status.

--gpu-type "A30, H100_PCIe" (OPTIONAL) : Only run the test stages on the specified GPU types. Examples: "A30, H100_PCIe". Note: Does NOT update GitHub check status.

--only-multi-gpu-test (OPTIONAL) : Only run the multi-GPU tests. Note: Does NOT update GitHub check status.

--disable-multi-gpu-test (OPTIONAL) : Disable the multi-GPU tests. Note: Does NOT update GitHub check status.

--add-multi-gpu-test (OPTIONAL) : Force run the multi-GPU tests. Will also run L0 pre-merge pipeline.

--post-merge (OPTIONAL) : Run the L0 post-merge pipeline instead of the ordinary L0 pre-merge pipeline.

--extra-stage "H100_PCIe-[Post-Merge]-1, xxx" (OPTIONAL) : Run the ordinary L0 pre-merge pipeline and specified test stages. Examples: --extra-stage "H100_PCIe-[Post-Merge]-1, xxx".

kill

kill

Kill all running builds associated with pull request.

skip

skip --comment COMMENT

Skip testing for latest commit on pull request. --comment "Reason for skipping build/test" is required. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

reuse-pipeline

reuse-pipeline

Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

Copilot

Pull Request Overview

This PR fixes model configuration loading by setting trust_remote_code=True in the call to AutoConfig.from_hugging_face, ensuring consistent behavior when loading config.json.

Adjusted AutoConfig.from_hugging_face call in the HF branch
Aims to prevent errors for models like DeciLMConfig when trust_remote_code is False

tensorrt_llm/llmapi/llm_args.py

Superjomn

LGTM.

Signed-off-by: Venky <[email protected]>

venkywonka · 2025-05-07T17:22:52Z

closing this, as this is model-specific, and included in #4128

venkywonka requested review from Copilot and kaiyux May 5, 2025 18:42

Copilot AI reviewed May 5, 2025

View reviewed changes

venkywonka requested a review from schetlur-nv May 5, 2025 18:42

venkywonka marked this pull request as ready for review May 5, 2025 18:45

FrankD412 assigned venkywonka May 5, 2025

FrankD412 self-requested a review May 5, 2025 18:46

FrankD412 approved these changes May 5, 2025

View reviewed changes

Superjomn reviewed May 5, 2025

View reviewed changes

tensorrt_llm/llmapi/llm_args.py Outdated Show resolved Hide resolved

Superjomn approved these changes May 5, 2025

View reviewed changes

Superjomn enabled auto-merge (squash) May 5, 2025 23:25

set trust_remote_code=True as default when loading AutoConfig

d537643

Signed-off-by: Venky <[email protected]>

auto-merge was automatically disabled May 6, 2025 13:40
Head branch was pushed to by a user without write access

venkywonka force-pushed the user/venky/fix-bench-hf-config-load branch from 46d0fab to d537643 Compare May 6, 2025 13:40

venkywonka closed this May 7, 2025

venkywonka deleted the user/venky/fix-bench-hf-config-load branch May 8, 2025 04:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Set `trust_remote_code=True` when verifying config.json load #4068

fix: Set `trust_remote_code=True` when verifying config.json load #4068

Uh oh!

venkywonka commented May 5, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Superjomn left a comment

Uh oh!

venkywonka commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: Set trust_remote_code=True when verifying config.json load #4068

fix: Set trust_remote_code=True when verifying config.json load #4068

Uh oh!

Conversation

venkywonka commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR title

Test Coverage

GitHub Bot Help

kill

skip

reuse-pipeline

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Superjomn left a comment

Choose a reason for hiding this comment

Uh oh!

venkywonka commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: Set `trust_remote_code=True` when verifying config.json load #4068

fix: Set `trust_remote_code=True` when verifying config.json load #4068

venkywonka commented May 5, 2025 •

edited

Loading