Skip to content

Conversation

jananisriram
Copy link
Contributor

Summary: Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347

@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

jananisriram added a commit that referenced this pull request Sep 16, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
jananisriram added a commit that referenced this pull request Sep 16, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

jananisriram added a commit that referenced this pull request Sep 16, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

jananisriram added a commit that referenced this pull request Sep 16, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
facebook-github-bot pushed a commit that referenced this pull request Sep 17, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
jananisriram added a commit that referenced this pull request Sep 17, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
jananisriram added a commit that referenced this pull request Sep 17, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

jananisriram added a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: njriasan

Differential Revision: D82516347
facebook-github-bot pushed a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: njriasan

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

facebook-github-bot pushed a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: njriasan

Differential Revision: D82516347
facebook-github-bot pushed a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: njriasan

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

facebook-github-bot pushed a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: njriasan

Differential Revision: D82516347
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: NikhilAPatel, njriasan

Differential Revision: D82516347
@facebook-github-bot
Copy link
Contributor

@jananisriram has exported this pull request. If you are a Meta employee, you can view the originating diff in D82516347.

facebook-github-bot pushed a commit that referenced this pull request Sep 18, 2025
Summary:

Support per-row scaling for the FP8 Blackwell persistent + TMA kernel with warp specialization.

Reviewed By: NikhilAPatel, njriasan

Differential Revision: D82516347
@facebook-github-bot facebook-github-bot merged commit 5d4679f into main Sep 18, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants