Skip to content

Conversation

jiawenliu64
Copy link
Member

Summary:

  • Add inline PTX to boost MXFP4 quantization kernel performance
  • Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398

Copy link

netlify bot commented Aug 13, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit e05f3b8
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68a3abbcc437e30008f4ea1b
😎 Deploy Preview https://deploy-preview-4694--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@meta-cla meta-cla bot added the cla signed label Aug 13, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D80182398

jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Aug 15, 2025
Summary:

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Aug 15, 2025
Summary:

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Aug 18, 2025
Summary:

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Aug 18, 2025
Summary:

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D80182398

jiawenliu64 added a commit to jiawenliu64/FBGEMM that referenced this pull request Aug 18, 2025
Summary:
Pull Request resolved: pytorch#4694

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
Summary:
Pull Request resolved: pytorch#4694

X-link: facebookresearch/FBGEMM#1720

- Add inline PTX to boost MXFP4 quantization kernel performance
- Fix MXFP4 scaling factor in grouped GEMM

Differential Revision: D80182398
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D80182398

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 92d6117.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants