-
Notifications
You must be signed in to change notification settings - Fork 1.9k
[https://nvbugs/5289907][fix] Restore per-channel pre-quant #4545
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[https://nvbugs/5289907][fix] Restore per-channel pre-quant #4545
Conversation
|
/bot run |
|
PR_Github #6041 [ run ] triggered by Bot |
|
PR_Github #6041 [ run ] completed with state |
12aac5f to
8dee53e
Compare
|
/bot run |
|
PR_Github #6082 [ run ] triggered by Bot |
|
/bot kill |
|
PR_Github #6085 [ kill ] triggered by Bot |
|
PR_Github #6082 [ run ] completed with state |
|
PR_Github #6085 [ kill ] completed with state |
|
/bot run --disable-fail-fast |
|
PR_Github #6114 [ run ] triggered by Bot |
|
/bot run --disable-fail-fast |
|
PR_Github #6116 [ run ] triggered by Bot |
|
PR_Github #6114 [ run ] completed with state |
|
LGTM |
|
PR_Github #6116 [ run ] completed with state |
f37b3ca to
dec4eb5
Compare
|
/bot run |
|
PR_Github #6145 [ run ] triggered by Bot |
|
PR_Github #6145 [ run ] completed with state |
dec4eb5 to
039f4be
Compare
|
/bot run |
|
PR_Github #6171 [ run ] triggered by Bot |
|
PR_Github #6171 [ run ] completed with state |
Signed-off-by: Barry Kang <[email protected]>
Signed-off-by: Barry Kang <[email protected]>
Signed-off-by: Barry Kang <[email protected]>
039f4be to
a970958
Compare
|
/bot run |
|
PR_Github #6194 [ run ] triggered by Bot |
|
PR_Github #6194 [ run ] completed with state |
) * Restore per-channel pre-quant Signed-off-by: Barry Kang <[email protected]> * Update TRT test script Signed-off-by: Barry Kang <[email protected]> * Fix pre-commit Signed-off-by: Barry Kang <[email protected]> --------- Signed-off-by: Barry Kang <[email protected]>
) * Restore per-channel pre-quant Signed-off-by: Barry Kang <[email protected]> * Update TRT test script Signed-off-by: Barry Kang <[email protected]> * Fix pre-commit Signed-off-by: Barry Kang <[email protected]> --------- Signed-off-by: Barry Kang <[email protected]>
This PR restores the per-expert pre-quant scale kernel into the original per-channel one in MoE modules to fix the broken ModelOpt Mixtral-AWQ support.