Skip to content

Conversation

lowsfer
Copy link
Member

@lowsfer lowsfer commented Jun 25, 2025

Improve sm120 XQA-MLA perf with better latency hiding, test_wait and simplified intra-CGA data transfer

Improve sm120 XQA-MLA perf with better latency hiding, test_wait and
simplified intra-CGA data transfer

Signed-off-by: Yao Yao <[email protected]>
@lowsfer
Copy link
Member Author

lowsfer commented Jun 25, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9849 [ run ] triggered by Bot

@NVIDIA NVIDIA deleted a comment from tensorrt-cicd Jun 25, 2025
@NVIDIA NVIDIA deleted a comment from tensorrt-cicd Jun 25, 2025
@lowsfer lowsfer requested a review from ming-wei June 25, 2025 09:41
@tensorrt-cicd
Copy link
Collaborator

PR_Github #9849 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #7265 completed with status: 'FAILURE'

@lowsfer
Copy link
Member Author

lowsfer commented Jun 26, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9997 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #9997 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #7373 completed with status: 'SUCCESS'

@lowsfer lowsfer merged commit 0788c5d into NVIDIA:main Jun 26, 2025
3 checks passed
@lowsfer lowsfer deleted the xqa-mla branch June 26, 2025 10:09
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants