Add quantized embedding kernels to torchao #1018

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 1 commit into pytorch:main from metascroy:export-D63839255

Oct 17, 2024

+609 −18

Contributor

metascroy commented Oct 4, 2024

Summary: This diff adds lowbit embedding kernels to torchao. These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

pytorch-bot bot commented Oct 4, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1018

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2169c32 with merge base 7aaf0ff ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Oct 4, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

facebook-github-bot added the fb-exported label

Contributor

facebook-github-bot commented Oct 4, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

d596bcb

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from cceed9e to d596bcb Compare

October 4, 2024 20:43

Contributor

facebook-github-bot commented Oct 4, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

10ff165

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from d596bcb to 10ff165 Compare

October 4, 2024 21:12

Contributor

facebook-github-bot commented Oct 7, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

99ca201

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 10ff165 to 99ca201 Compare

October 7, 2024 15:17

Contributor

facebook-github-bot commented Oct 7, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

49363f4

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 99ca201 to 49363f4 Compare

October 7, 2024 15:23

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

aabe6db

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 49363f4 to aabe6db Compare

October 8, 2024 00:15

jerryzh168 approved these changes

View reviewed changes

Contributor

facebook-github-bot commented Oct 8, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

17e2deb

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from aabe6db to 17e2deb Compare

October 8, 2024 00:24

Contributor Author

metascroy commented Oct 8, 2024

@jerryzh168 if things look good to you, can you approve the diff too D63839255

Contributor

facebook-github-bot commented Oct 11, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

7ef09aa

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 17e2deb to 7ef09aa Compare

October 11, 2024 20:41

Contributor

jerryzh168 commented Oct 12, 2024

@metascroy I'm mostly just stamping the ao/experimental PR, do you need a proper review or just stamps for the diff?

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

8b62d71

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 7ef09aa to 8b62d71 Compare

October 15, 2024 17:22

Contributor

facebook-github-bot commented Oct 15, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

1 similar comment

Contributor

facebook-github-bot commented Oct 15, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

3b92449

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 8b62d71 to 3b92449 Compare

October 15, 2024 17:33

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

8361f53

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 3b92449 to 8361f53 Compare

October 15, 2024 23:58

Contributor

facebook-github-bot commented Oct 15, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

1 similar comment

Contributor

facebook-github-bot commented Oct 16, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

a719b34

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 8361f53 to a719b34 Compare

October 16, 2024 16:52

Contributor

facebook-github-bot commented Oct 16, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

4aed4c2

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from a719b34 to 4aed4c2 Compare

October 16, 2024 20:10

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

e5368f2

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 4aed4c2 to e5368f2 Compare

October 16, 2024 20:16

Contributor

facebook-github-bot commented Oct 16, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

digantdesai approved these changes

View reviewed changes

Contributor

facebook-github-bot commented Oct 17, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

cd0f40f

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Reviewed By: digantdesai

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from e5368f2 to cd0f40f Compare

October 17, 2024 18:14

Contributor

facebook-github-bot commented Oct 17, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy added a commit to metascroy/ao that referenced this pull request


          Add quantized embedding kernels to torchao (pytorch#1018)

7af0756

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Reviewed By: digantdesai

Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from cd0f40f to 7af0756 Compare

October 17, 2024 18:22


          Add quantized embedding kernels to torchao (pytorch#1018)

2169c32

Summary:
Pull Request resolved: pytorch#1018

This diff adds lowbit embedding kernels to torchao.  These reuse the same bitpacking code as the linear kernels.

Reviewed By: digantdesai

Differential Revision: D63839255

Contributor

facebook-github-bot commented Oct 17, 2024

This pull request was exported from Phabricator. Differential Revision: D63839255

metascroy force-pushed the export-D63839255 branch from 7af0756 to 2169c32 Compare

October 17, 2024 20:22

facebook-github-bot merged commit 6653b45 into pytorch:main

17 of 19 checks passed

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request


          [AOTI] Add a --max-seq-length option for export (pytorch#1018)

5aed7ae

Summary: This improves best tokens/sec from 73 to 85.

Co-authored-by: Jack-Khuu <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported