Skip to content

Conversation

@Fan-Yunfan
Copy link
Contributor

@Fan-Yunfan Fan-Yunfan commented Aug 7, 2025

Problem

In the cpp/include/tensorrt_llm/common directory, there is an unclear boundary of responsibilities between the assert.h file and the tllmException.h file.

From a framework design perspective, assert.h should focus on providing assertion mechanisms to verify assumptions during program runtime, while tllmException.h should specialize in exception handling. However, in the current implementation, assert.h includes exception-related macros such as TLLM_THROW, TLLM_WRAP, and related functions like throwRuntimeError.

Although this poses no syntactic issues, from a design and standardization standpoint, it creates ambiguity in the responsibilities of the files, which may confuse framework users. For example, a user might encounter the TLLM_THROW macro in their code but fail to find the corresponding exception-related header file included. Eventually, they might discover the macro in assert.h, leading to confusion: Why are exception-related macros placed in an assertion-related header file instead of the exception-related tllmException.h?

While this design consolidates commonly used assertion and exception-throwing macros in one place, reducing the number of header inclusions (from 2 to 1), it is not considered a good practice because it violates design principles and causes confusion. Moreover, by the same logic, placing LOG-related macros in assert.h could further reduce header inclusions, but this would clearly be undesirable.

Current Implementation

cpp/include/tensorrt_llm/common/assert.h

#pragma once

#include "tensorrt_llm/common/stringUtils.h"
#include "tensorrt_llm/common/tllmException.h"

#include <string>

namespace tensorrt_llm::common
{
[[noreturn]] inline void throwRuntimeError(char const* const file, int const line, char const* info)
{
    throw TllmException(file, line, fmtstr("[TensorRT-LLM][ERROR] Assertion failed: %s", info).c_str());
}

[[noreturn]] inline void throwRuntimeError(char const* const file, int const line, std::string const& info = "")
{
    throw TllmException(file, line, fmtstr("[TensorRT-LLM][ERROR] Assertion failed: %s", info.c_str()).c_str());
}

} // namespace tensorrt_llm::common

......

#define TLLM_THROW(...)                                                                                                \
    do                                                                                                                 \
    {                                                                                                                  \
        throw NEW_TLLM_EXCEPTION(__VA_ARGS__);                                                                         \
    } while (0)

#define TLLM_WRAP(ex)                                                                                                  \
    NEW_TLLM_EXCEPTION("%s: %s", tensorrt_llm::common::TllmException::demangle(typeid(ex).name()).c_str(), ex.what())

Solution

Migrate the exception-related macro definitions and functions from the assert.h header file to the tllmException.h header file.

The content of the files after migration is as follows:

cpp/include/tensorrt_llm/common/assert.h

#pragma once

#include "tensorrt_llm/common/tllmException.h"

class DebugConfig
{
public:
    static bool isCheckDebugEnabled();
};

#if defined(_WIN32)
#define TLLM_LIKELY(x) (__assume((x) == 1), (x))
#define TLLM_UNLIKELY(x) (__assume((x) == 0), (x))
#else
#define TLLM_LIKELY(x) __builtin_expect((x), 1)
#define TLLM_UNLIKELY(x) __builtin_expect((x), 0)
#endif

#define TLLM_CHECK(val)                                                                                                \
    do                                                                                                                 \
    {                                                                                                                  \
        TLLM_LIKELY(static_cast<bool>(val)) ? ((void) 0)                                                               \
                                            : tensorrt_llm::common::throwRuntimeError(__FILE__, __LINE__, #val);       \
    } while (0)

#define TLLM_CHECK_WITH_INFO(val, info, ...)                                                                           \
    do                                                                                                                 \
    {                                                                                                                  \
        TLLM_LIKELY(static_cast<bool>(val))                                                                            \
        ? ((void) 0)                                                                                                   \
        : tensorrt_llm::common::throwRuntimeError(                                                                     \
            __FILE__, __LINE__, tensorrt_llm::common::fmtstr(info, ##__VA_ARGS__).c_str());                            \
    } while (0)

#define TLLM_CHECK_DEBUG(val)                                                                                          \
    do                                                                                                                 \
    {                                                                                                                  \
        if (TLLM_UNLIKELY(DebugConfig::isCheckDebugEnabled()))                                                         \
        {                                                                                                              \
            TLLM_LIKELY(static_cast<bool>(val)) ? ((void) 0)                                                           \
                                                : tensorrt_llm::common::throwRuntimeError(__FILE__, __LINE__, #val);   \
        }                                                                                                              \
    } while (0)

#define TLLM_CHECK_DEBUG_WITH_INFO(val, info, ...)                                                                     \
    do                                                                                                                 \
    {                                                                                                                  \
        if (TLLM_UNLIKELY(DebugConfig::isCheckDebugEnabled()))                                                         \
        {                                                                                                              \
            TLLM_LIKELY(static_cast<bool>(val))                                                                        \
            ? ((void) 0)                                                                                               \
            : tensorrt_llm::common::throwRuntimeError(                                                                 \
                __FILE__, __LINE__, tensorrt_llm::common::fmtstr(info, ##__VA_ARGS__).c_str());                        \
        }                                                                                                              \
    } while (0)

cpp/include/tensorrt_llm/common/tllmException.h

#pragma once

#include "tensorrt_llm/common/stringUtils.h"

#include <array>
#include <cstddef>
#include <stdexcept>
#include <string>

#define TLLM_THROW(...)                                                                                                \
    do                                                                                                                 \
    {                                                                                                                  \
        throw NEW_TLLM_EXCEPTION(__VA_ARGS__);                                                                         \
    } while (0)

#define TLLM_WRAP(ex)                                                                                                  \
    NEW_TLLM_EXCEPTION("%s: %s", tensorrt_llm::common::TllmException::demangle(typeid(ex).name()).c_str(), ex.what())

#define NEW_TLLM_EXCEPTION(...)                                                                                        \
    tensorrt_llm::common::TllmException(__FILE__, __LINE__, tensorrt_llm::common::fmtstr(__VA_ARGS__).c_str())

namespace tensorrt_llm::common
{

class TllmException : public std::runtime_error
{
......
};

[[noreturn]] inline void throwRuntimeError(char const* const file, int const line, char const* info)
{
    throw TllmException(file, line, fmtstr("[TensorRT-LLM][ERROR] Assertion failed: %s", info).c_str());
}

[[noreturn]] inline void throwRuntimeError(char const* const file, int const line, std::string const& info = "")
{
    throw TllmException(file, line, fmtstr("[TensorRT-LLM][ERROR] Assertion failed: %s", info.c_str()).c_str());
}

} // namespace tensorrt_llm::common

Note

  1. The original assert.h header file included stringUtils.h (for fmtstr), which has now been moved to the tllmException.h header file. Additionally, since the header file included in assert.h is already present in tllmException.h, simply including tllmException.h is sufficient.

  2. This change will not require large-scale code modifications, nor will it necessitate additional header file adjustments, because the exception-related functions and macros remain accessible in assert.h—only their source has shifted from being defined directly in the file to being imported from tllmException.h.

Summary by CodeRabbit

  • Bug Fixes

    • Improved error handling by standardizing exception throwing and error message formatting for assertion failures and runtime errors.
  • New Features

    • Enhanced exception messages now include detailed context and type information when errors occur.

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Aug 7, 2025

📝 Walkthrough

Walkthrough

Exception handling utilities and macros have been moved from assert.h to tllmException.h. The throwRuntimeError functions and related macros (TLLM_THROW, TLLM_WRAP) were deleted from assert.h and reimplemented in tllmException.h, now with improved error message formatting and demangled exception type support. stringUtils.h is now included in tllmException.h.

Changes

Cohort / File(s) Change Summary
Removal of exception utilities from assert.h
cpp/include/tensorrt_llm/common/assert.h
Removed throwRuntimeError inline functions, TLLM_THROW and TLLM_WRAP macros, and related includes. Macros and functions for exception throwing are no longer present in this header, but references to the removed functions remain.
Addition of exception utilities to tllmException.h
cpp/include/tensorrt_llm/common/tllmException.h
Added throwRuntimeError inline functions, TLLM_THROW and TLLM_WRAP macros, and included stringUtils.h. Exception throwing and wrapping are now handled here, with enhanced message formatting and demangled type information for wrapped exceptions.

Sequence Diagram(s)

sequenceDiagram
    participant User Code
    participant tllmException.h
    participant TllmException

    User Code->>tllmException.h: Call TLLM_THROW(...) or throwRuntimeError(...)
    tllmException.h->>TllmException: Construct TllmException with formatted message
    tllmException.h-->>User Code: Exception thrown
Loading

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~15 minutes

Note

🔌 MCP (Model Context Protocol) integration is now available in Early Access!

Pro users can now connect to remote MCP servers under the Integrations page to get reviews and chat conversations that understand additional development context.

✨ Finishing Touches
  • 📝 Generate Docstrings
🧪 Generate unit tests
  • Create PR with unit tests
  • Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Explain this complex logic.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai explain this code block.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and explain its main purpose.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
  • @coderabbitai generate unit tests to generate unit tests for this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai or @coderabbitai title anywhere in the PR title to generate the title automatically.

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

@Fan-Yunfan Fan-Yunfan changed the title [None][Fix] Fix the issue of responsibility boundary between the assert and tllmException files [None][fix] Fix the issue of responsibility boundary between the assert and tllmException files Aug 7, 2025
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (1)
cpp/include/tensorrt_llm/common/tllmException.h (1)

2-2: Update copyright year to include 2025.

The copyright header should include the current year (2025) according to TensorRT-LLM coding guidelines.

Apply this diff to update the copyright year:

- * Copyright (c) 2022-2024, NVIDIA CORPORATION.  All rights reserved.
+ * Copyright (c) 2022-2025, NVIDIA CORPORATION.  All rights reserved.
📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 980929e and bd3c5c6.

📒 Files selected for processing (2)
  • cpp/include/tensorrt_llm/common/assert.h (0 hunks)
  • cpp/include/tensorrt_llm/common/tllmException.h (2 hunks)
💤 Files with no reviewable changes (1)
  • cpp/include/tensorrt_llm/common/assert.h
🧰 Additional context used
📓 Path-based instructions (3)
**/*.{cpp,h,hpp,cc,cxx}

📄 CodeRabbit Inference Engine (CODING_GUIDELINES.md)

**/*.{cpp,h,hpp,cc,cxx}: Closing braces of namespaces should have a comment saying the namespace it closes (e.g., } // namespace foo).
Prefer const or constexpr variables over #defines whenever possible.
A variable that is not modified after its initialization should be declared as const.
Except 0 (used for checking signness/existence/emptiness), nullptr, true, false, all other literals should only be used for variable initialization.
Use the Allman indentation style for braces in C++ code.
Put the semicolon for an empty for or while loop in a new line.
The statement forming the body of a switch, while, do..while, or for statement shall be a compound statement (use brace-delimited statements).
If and else should always be followed by brace-delimited statements, even if empty or a single statement.
C++ filenames should use camel case with the first letter lowercase (e.g., thisIsAFilename.cpp), and all files involved in a compilation target must have case-insensitive unique filenames.
All types (including class names) should use camel case with uppercase first letter (e.g., FooBarClass).
Local variables, methods, and namespaces should use camel case with first letter lowercase (e.g., localFooBar).
Non-magic-number global variables that are non-static and not defined in anonymous namespace should use camel case prefixed by 'g' (e.g., gDontUseGlobalFoos).
Non-magic-number global variables that are static or defined in an anonymous namespace should use camel case prefixed by 's' (e.g., sMutableStaticGlobal).
Locally visible static variables should use camel case with lowercase prefix 's' as the first letter (e.g., static std::once_flag sFlag;).
Class member variables should use camel case prefixed with 'm' (e.g., mNbFooValues). Public member variables do not require the 'm' prefix but it is encouraged for clarity.
Enumerations, global constants, static constants at class-scope, and function-scope magic-number/literal constants should be uppercase snake case with prefix...

Files:

  • cpp/include/tensorrt_llm/common/tllmException.h
**/*.{h,hpp}

📄 CodeRabbit Inference Engine (CODING_GUIDELINES.md)

Use a preprocessor guard in header files. The guard name must have prefix TRTLLM_ followed by the filename, all in caps, and no trailing underscore.

Files:

  • cpp/include/tensorrt_llm/common/tllmException.h
**/*.{cpp,h,hpp,cc,cxx,cu,py}

📄 CodeRabbit Inference Engine (CODING_GUIDELINES.md)

All TensorRT-LLM Open Source Software code should contain an NVIDIA copyright header that includes the current year. This includes .cpp, .h, .cu, .py, and any other source files which are compiled or interpreted.

Files:

  • cpp/include/tensorrt_llm/common/tllmException.h
🧠 Learnings (7)
📓 Common learnings
Learnt from: galagam
PR: NVIDIA/TensorRT-LLM#6487
File: tests/unittest/_torch/auto_deploy/unit/singlegpu/test_ad_trtllm_bench.py:1-12
Timestamp: 2025-08-06T13:58:07.506Z
Learning: In TensorRT-LLM, test files (files under tests/ directories) do not require NVIDIA copyright headers, unlike production source code files. Test files typically start directly with imports, docstrings, or code.
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{cpp,h,hpp,cc,cxx} : Exceptions must not be thrown across library boundaries.
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{h,hpp} : Use a preprocessor guard in header files. The guard name must have prefix TRTLLM_ followed by the filename, all in caps, and no trailing underscore.
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{cpp,h,hpp,cc,cxx,cu,py} : All TensorRT-LLM Open Source Software code should contain an NVIDIA copyright header that includes the current year. This includes .cpp, .h, .cu, .py, and any other source files which are compiled or interpreted.
📚 Learning: 2025-08-06T21:22:55.018Z
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{cpp,h,hpp,cc,cxx} : Exceptions must not be thrown across library boundaries.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
📚 Learning: 2025-08-06T21:22:55.018Z
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{h,hpp} : Use a preprocessor guard in header files. The guard name must have prefix TRTLLM_ followed by the filename, all in caps, and no trailing underscore.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
📚 Learning: 2025-08-06T21:22:55.018Z
Learnt from: CR
PR: NVIDIA/TensorRT-LLM#0
File: CODING_GUIDELINES.md:0-0
Timestamp: 2025-08-06T21:22:55.018Z
Learning: Applies to **/*.{cpp,h,hpp,cc,cxx,cu,py} : All TensorRT-LLM Open Source Software code should contain an NVIDIA copyright header that includes the current year. This includes .cpp, .h, .cu, .py, and any other source files which are compiled or interpreted.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
📚 Learning: 2025-08-06T13:58:07.506Z
Learnt from: galagam
PR: NVIDIA/TensorRT-LLM#6487
File: tests/unittest/_torch/auto_deploy/unit/singlegpu/test_ad_trtllm_bench.py:1-12
Timestamp: 2025-08-06T13:58:07.506Z
Learning: In TensorRT-LLM, test files (files under tests/ directories) do not require NVIDIA copyright headers, unlike production source code files. Test files typically start directly with imports, docstrings, or code.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
📚 Learning: 2025-08-01T15:14:45.673Z
Learnt from: yibinl-nvidia
PR: NVIDIA/TensorRT-LLM#6506
File: examples/models/core/mixtral/requirements.txt:3-3
Timestamp: 2025-08-01T15:14:45.673Z
Learning: In TensorRT-LLM, examples directory can have different dependency versions than the root requirements.txt file. Version conflicts between root and examples dependencies are acceptable because examples are designed to be standalone and self-contained.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
📚 Learning: 2025-07-28T17:06:08.621Z
Learnt from: moraxu
PR: NVIDIA/TensorRT-LLM#6303
File: tests/integration/test_lists/qa/examples_test_list.txt:494-494
Timestamp: 2025-07-28T17:06:08.621Z
Learning: In TensorRT-LLM testing, it's common to have both CLI flow tests (test_cli_flow.py) and PyTorch API tests (test_llm_api_pytorch.py) for the same model. These serve different purposes: CLI flow tests validate the traditional command-line workflow, while PyTorch API tests validate the newer LLM API backend. Both are legitimate and should coexist.

Applied to files:

  • cpp/include/tensorrt_llm/common/tllmException.h
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
  • GitHub Check: Pre-commit Check
🔇 Additional comments (4)
cpp/include/tensorrt_llm/common/tllmException.h (4)

19-19: LGTM!

The inclusion of stringUtils.h is necessary to support the fmtstr function used by the new exception handling utilities.


26-30: LGTM!

The TLLM_THROW macro is well-implemented using the standard do-while(0) pattern and properly delegates to the existing NEW_TLLM_EXCEPTION macro.


32-33: LGTM!

The TLLM_WRAP macro effectively combines exception type information with the original message using the existing demangle utility.


59-67: LGTM!

The throwRuntimeError function overloads are well-implemented with proper [[noreturn]] attributes, consistent error message formatting, and appropriate parameter types for flexibility.

@svc-trtllm-gh-bot svc-trtllm-gh-bot added the Community want to contribute PRs initiated from Community label Aug 8, 2025
@Fan-Yunfan
Copy link
Contributor Author

/bot run

@Fan-Yunfan
Copy link
Contributor Author

Fan-Yunfan commented Aug 8, 2025

Dear @tongyuantongyu ,I noticed that you have made some related modifications to the assert.h file (a139eae), so I was wondering if you could take a look at this PR I submitted when you have some free time?

image

By the way, I have another PR(https://github.com/NVIDIA/TensorRT-LLM/pull/6758) that needs your review.

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14731 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14731 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11118 completed with status: 'FAILURE'

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14891 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14891 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11235 completed with status: 'FAILURE'

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14961 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #14961 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11294 completed with status: 'FAILURE'

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15047 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15047 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11365 completed with status: 'FAILURE'

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15210 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15210 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11488 completed with status: 'FAILURE'

@tongyuantongyu
Copy link
Member

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15273 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #15273 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #11531 completed with status: 'SUCCESS'
Pipeline passed with automatic retried tests. Check the rerun report for details.

@tongyuantongyu tongyuantongyu merged commit 11d08c3 into NVIDIA:main Aug 15, 2025
4 checks passed
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 17, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 17, 2025
…ception files (NVIDIA#6723)

Signed-off-by: fanyunfan <[email protected]>
Co-authored-by: Yuan Tong <[email protected]>
Signed-off-by: Wangshanshan <[email protected]>
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 17, 2025
…ception files (NVIDIA#6723)

Signed-off-by: fanyunfan <[email protected]>
Co-authored-by: Yuan Tong <[email protected]>
Signed-off-by: Wangshanshan <[email protected]>
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 17, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 18, 2025
…ception files (NVIDIA#6723)

Signed-off-by: fanyunfan <[email protected]>
Co-authored-by: Yuan Tong <[email protected]>
Signed-off-by: Wangshanshan <[email protected]>
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 18, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Aug 18, 2025
…ception files (NVIDIA#6723)

Signed-off-by: fanyunfan <[email protected]>
Co-authored-by: Yuan Tong <[email protected]>
Signed-off-by: Wangshanshan <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Community want to contribute PRs initiated from Community

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants