Boost performance of MXFP4 quantization with inline PTX #4694

jiawenliu64 · 2025-08-13T16:57:49Z

Summary:

Add inline PTX to boost MXFP4 quantization kernel performance
Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398

netlify · 2025-08-13T16:57:54Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`e05f3b8`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68a3abbcc437e30008f4ea1b
😎 Deploy Preview	https://deploy-preview-4694--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

facebook-github-bot · 2025-08-13T16:57:57Z

This pull request was exported from Phabricator. Differential Revision: D80182398

Summary: X-link: facebookresearch/FBGEMM#1720 - Add inline PTX to boost MXFP4 quantization kernel performance - Fix MXFP4 scaling factor in grouped GEMM Differential Revision: D80182398

facebook-github-bot · 2025-08-18T22:32:25Z

This pull request was exported from Phabricator. Differential Revision: D80182398

Summary: Pull Request resolved: pytorch#4694 X-link: facebookresearch/FBGEMM#1720 - Add inline PTX to boost MXFP4 quantization kernel performance - Fix MXFP4 scaling factor in grouped GEMM Differential Revision: D80182398

facebook-github-bot · 2025-08-18T22:39:50Z

This pull request was exported from Phabricator. Differential Revision: D80182398

facebook-github-bot · 2025-08-19T19:29:05Z

This pull request has been merged in 92d6117.

meta-cla bot added the cla signed label Aug 13, 2025

facebook-github-bot added the fb-exported label Aug 13, 2025

jiawenliu64 force-pushed the export-D80182398 branch from fbe1cb1 to ef0dc3a Compare August 18, 2025 22:28

jiawenliu64 force-pushed the export-D80182398 branch from ef0dc3a to 7c2f9b2 Compare August 18, 2025 22:29

jiawenliu64 force-pushed the export-D80182398 branch from 7c2f9b2 to 082aa9a Compare August 18, 2025 22:32

jiawenliu64 force-pushed the export-D80182398 branch from 082aa9a to e05f3b8 Compare August 18, 2025 22:39

facebook-github-bot closed this in 92d6117 Aug 19, 2025

facebook-github-bot added the Merged label Aug 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Boost performance of MXFP4 quantization with inline PTX #4694

Boost performance of MXFP4 quantization with inline PTX #4694

Uh oh!

jiawenliu64 commented Aug 13, 2025

Uh oh!

netlify bot commented Aug 13, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Aug 13, 2025

Uh oh!

facebook-github-bot commented Aug 18, 2025

Uh oh!

facebook-github-bot commented Aug 18, 2025

Uh oh!

facebook-github-bot commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Boost performance of MXFP4 quantization with inline PTX #4694

Boost performance of MXFP4 quantization with inline PTX #4694

Uh oh!

Conversation

jiawenliu64 commented Aug 13, 2025

Uh oh!

netlify bot commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

facebook-github-bot commented Aug 13, 2025

Uh oh!

facebook-github-bot commented Aug 18, 2025

Uh oh!

facebook-github-bot commented Aug 18, 2025

Uh oh!

facebook-github-bot commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netlify bot commented Aug 13, 2025 •

edited

Loading