-
Couldn't load subscription status.
- Fork 1.8k
[TRTLLM-6368] Update deepep dispatch API #6037
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TRTLLM-6368] Update deepep dispatch API #6037
Conversation
|
TODO:
|
|
Updates to DeepEP can be viewed at: https://github.com/yuantailing/DeepEP/compare/tensorrt_llm_3...yifeizhang-c:DeepEP:dev-yifeiz-update-dispatch?expand=1 |
f7e371a to
c06ff41
Compare
Signed-off-by: Yifei Zhang <[email protected]>
c06ff41 to
fb70f7f
Compare
WalkthroughThe changes update the DeepEP submodule version and modify the Changes
Sequence Diagram(s)sequenceDiagram
participant FusedMoeWideEP
participant VariableLengthBuffer
participant DeepEPBuffer
FusedMoeWideEP->>VariableLengthBuffer: dispatch(x, topk_idx, topk_weights, num_experts, global_expert_id_offset)
VariableLengthBuffer->>DeepEPBuffer: dispatch(..., global_expert_id_offset)
DeepEPBuffer-->>VariableLengthBuffer: results
VariableLengthBuffer-->>FusedMoeWideEP: results
Poem
📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (3)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
🔇 Additional comments (7)
✨ Finishing Touches
🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
|
@yuantailing for extra review of this PR (corresponding to task 9.1). |
|
/bot run |
1 similar comment
|
/bot run |
|
PR_Github #12171 [ run ] triggered by Bot |
|
PR_Github #12171 [ run ] completed with state |
Signed-off-by: Yifei Zhang <[email protected]>
Signed-off-by: Yifei Zhang <[email protected]>
Signed-off-by: Yifei Zhang <[email protected]> Signed-off-by: Shreyas Misra <[email protected]>
[TRTLLM-6368] Update DeepEP dispatch API
Description
This PR updates DeepEP's
dispatchAPI to hidetopk_idx's value conversion and dtype casting inside kernels to better adapt to currenttorch.ops.trtllm.fused_moeimplementation.Summary by CodeRabbit
New Features
Refactor
Chores