[CPU] Add ops for float8 linear #3052

Xia-Weiwen · 2025-09-24T06:08:13Z

Summary
This is part of a previous PR #2505 since the original is too big.
This PR adds two ops for float8 linear on CPU, one for weight packing and the other for computation.

float8_linear_prepack_cpu
float8_linear_cpu

They will be used for float8 tensor subclass in the future.

Test plan

pytest -sv test/test_ops.py -k test_float8_linear_cpu

pytorch-bot · 2025-09-24T06:08:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3052

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fd3d6b5 with merge base 8e2ca35 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Xia-Weiwen · 2025-09-24T06:11:31Z

CC @mingfeima This PR is a copy of the op implementation of the previous PR #2505 because the original PR is too big. Thanks.

mingfeima

LGTM

torchao/csrc/cpu/aten_kernels/float8_linear.cpp

jcaip · 2025-09-29T20:28:04Z

Sorry @Xia-Weiwen @jerryzh168 but this is breaking some internal tests, going to revert: https://www.internalfb.com/tasks/?t=239795912

It looks to be complaining about a warning with the switch statement:

fbcode/pytorch/ao/torchao/csrc/cpu/aten_kernels/float8_linear.cpp:588:3: error: switch condition has boolean value [-Werror,-Wswitch-bool]
  588 |   AT_DISPATCH_LINEAR_KERNEL(output_dtype, cpublas_can_pack, act_quant_mode, wei_quant_mode, [&](){
      |   ^                                       ~~~~~~~~~~~~~~~~
fbcode/pytorch/ao/torchao/csrc/cpu/aten_kernels/utils.h:100:5: note: expanded from macro 'AT_DISPATCH_LINEAR_KERNEL'
  100 |     AT_DISPATCH_BOOL(                                                                   \
      |     ^
  101 |         CAN_PACK, "cpublas_can_pack", can_pack,                                         \
      |         ~~~~~~~~
fbcode/pytorch/ao/torchao/csrc/cpu/aten_kernels/utils.h:84:5: note: expanded from macro 'AT_DISPATCH_BOOL'
   84 |     switch (VALUE) {                                         \
      |     ^       ~~~~~

This reverts commit 5e90c47.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 24, 2025

Xia-Weiwen added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Sep 24, 2025

mingfeima approved these changes Sep 24, 2025

View reviewed changes

torchao/csrc/cpu/aten_kernels/float8_linear.cpp Outdated Show resolved Hide resolved

Xia-Weiwen marked this pull request as ready for review September 24, 2025 06:30

Xia-Weiwen requested review from msaroufim and jerryzh168 September 24, 2025 06:30

Xia-Weiwen added 2 commits September 24, 2025 14:00

[CPU] Add ops for float8 linear

5730278

Refine code

fd3d6b5

jerryzh168 approved these changes Sep 25, 2025

View reviewed changes

Xia-Weiwen merged commit 5e90c47 into pytorch:main Sep 25, 2025
18 checks passed

This was referenced Sep 26, 2025

[CPU] add Float8OpaqueTensor for dynamic float8 act float8 weight #3075

Open

[CPU] Add Float8OpaqueTensor for dynamic float8 act float8 weight #2505

Closed

jcaip added a commit that referenced this pull request Sep 29, 2025

Revert "[CPU] Add ops for float8 linear (#3052)"

fe841ad

This reverts commit 5e90c47.

jcaip added a commit that referenced this pull request Sep 29, 2025

Revert "[CPU] Add ops for float8 linear (#3052)" (#3095)

5cbbd73

This reverts commit 5e90c47.

Xia-Weiwen mentioned this pull request Sep 30, 2025

[Reland][CPU] Add ops for float8 linear #3100

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU] Add ops for float8 linear #3052

[CPU] Add ops for float8 linear #3052

Xia-Weiwen commented Sep 24, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 24, 2025 •

edited

Loading

Uh oh!

Xia-Weiwen commented Sep 24, 2025 •

edited

Loading

Uh oh!

mingfeima left a comment

Uh oh!

Uh oh!

Uh oh!

jcaip commented Sep 29, 2025

Uh oh!

Uh oh!

[CPU] Add ops for float8 linear #3052

[CPU] Add ops for float8 linear #3052

Conversation

Xia-Weiwen commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3052

✅ No Failures

Uh oh!

Xia-Weiwen commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mingfeima left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jcaip commented Sep 29, 2025

Uh oh!

Uh oh!

Xia-Weiwen commented Sep 24, 2025 •

edited

Loading

pytorch-bot bot commented Sep 24, 2025 •

edited

Loading

Xia-Weiwen commented Sep 24, 2025 •

edited

Loading