-
Notifications
You must be signed in to change notification settings - Fork 1.8k
test(perf): Add some Llama-3_3-Nemotron-Super-49B-v1 integration-perf-tests (TRT flow, trtllm-bench)
#4128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test(perf): Add some Llama-3_3-Nemotron-Super-49B-v1 integration-perf-tests (TRT flow, trtllm-bench)
#4128
Conversation
|
/bot run --disable-fail-fast |
238c866 to
af359d2
Compare
|
PR_Github #4410 [ run ] triggered by Bot |
|
/bot run --disable-fail-fast |
|
PR_Github #4421 [ run ] triggered by Bot |
|
PR_Github #4410 [ run ] completed with state |
|
PR_Github #4421 [ run ] completed with state |
8330f99 to
4a95b91
Compare
|
/bot run --disable-fail-fast |
|
PR_Github #4472 [ run ] triggered by Bot |
|
PR_Github #4472 [ run ] completed with state |
4a95b91 to
28b41d8
Compare
|
/bot run --disable-fail-fast |
|
PR_Github #4584 [ run ] triggered by Bot |
|
PR_Github #4584 [ run ] completed with state |
|
/bot run --disable-fail-fast |
|
PR_Github #4737 [ run ] triggered by Bot |
|
PR_Github #4737 [ run ] completed with state |
|
/bot run --disable-fail-fast |
|
PR_Github #5109 [ run ] triggered by Bot |
|
PR_Github #5109 [ run ] completed with state |
36c87dc to
9b7eb3b
Compare
|
/bot run --disable-fail-fast |
|
PR_Github #5216 [ run ] triggered by Bot |
9b7eb3b to
721533d
Compare
|
PR_Github #5216 [ run ] completed with state |
721533d to
85aeba4
Compare
e61f001 to
abb4c42
Compare
|
/bot run --disable-fail-fast |
|
PR_Github #5534 [ run ] triggered by Bot |
|
PR_Github #5534 [ run ] completed with state |
|
/bot run --disable-fail-fast |
|
PR_Github #5544 [ run ] triggered by Bot |
|
PR_Github #5544 [ run ] completed with state |
Signed-off-by: Venky Ganesh <[email protected]>
Signed-off-by: Venky Ganesh <[email protected]>
Signed-off-by: Venky Ganesh <[email protected]>
Signed-off-by: Venky <[email protected]>
abb4c42 to
9db8469
Compare
|
/bot run --disable-fail-fast |
|
PR_Github #5723 [ run ] triggered by Bot |
|
PR_Github #5723 [ run ] completed with state |
Description
Llama-3_3-Nemotron-Super-49B-v1integration-perf-tests (cpp backend, trtllm-bench).--trust_remote_codeflag in thetrtllm-bench-buildsubcommand, that is required fortransformerslibrary to use Autoclasses to load DeciLM-based models (Llama-Nemotron-Super being one of them).config.pyandmodel.pyfor theDeciLMForCausalLMclasses to havetrust_remote_code=Trueby default (it was False by default previously) for thing to work smoothly without extra parametrizations when run from top-level trtllm-bench.Performance Summary –
llama_v3.3_nemotron_super_49bRun Invariants
llama_v3.3_nemotron_super_49btrtllm-benchExecution Status Matrix