Move scaling logic to input generation #338

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

jananisriram wants to merge 1 commit into meta-pytorch:main from jananisriram:export-D80571223

Contributor

jananisriram commented Aug 20, 2025

Summary:
Move scaling logic for FP8 benchmarks to get_input_iter().

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (bfloat16, float16), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (float8_e4m3fn).

This diff also circumvents performing unsupported operations, like torch.max and torch.abs, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 02:07

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 02:07

— with

GitHub Actions Error

meta-cla bot added the cla signed label

Contributor

facebook-github-bot commented Aug 20, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

facebook-github-bot added the fb-exported label

Contributor

facebook-github-bot commented Aug 20, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from 3484c61 to a5bc2fa Compare

August 20, 2025 02:14

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

a5bc2fa

Summary:
Pull Request resolved: meta-pytorch#338

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 02:14

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 02:14

— with

GitHub Actions Error

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

a582d1b

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

6ad54c2

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

edfdb2a

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from a5bc2fa to edfdb2a Compare

August 20, 2025 04:36

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 04:36

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 04:36

— with

GitHub Actions Error

Contributor

facebook-github-bot commented Aug 20, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

a609415

Summary:
Pull Request resolved: meta-pytorch#338

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from edfdb2a to a609415 Compare

August 20, 2025 04:40

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 04:40

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 04:40

— with

GitHub Actions Error

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

62884f6

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

b024198

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from a609415 to b024198 Compare

August 20, 2025 16:05

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 16:05

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 20, 2025 16:05

— with

GitHub Actions Error

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

c0225c9

Summary:

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

Contributor

facebook-github-bot commented Aug 20, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

4efc17d

Summary:
Pull Request resolved: meta-pytorch#338

Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Reviewed By: NikhilAPatel

Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from b024198 to 4efc17d Compare

August 20, 2025 16:08

jananisriram force-pushed the export-D80571223 branch from c817c15 to e3adb1b Compare

August 21, 2025 15:55

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 15:56

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 15:56

— with

GitHub Actions Failure

xuzhao9 requested review from NikhilAPatel and removed request for NikhilAPatel

August 21, 2025 15:58

Contributor

facebook-github-bot commented Aug 21, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

d9e0627

Summary:
Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Test Plan:
Imported from GitHub, without a `Test Plan:` line.

Rollback Plan:

Differential Revision: D80571223

Pulled By: jananisriram

jananisriram force-pushed the export-D80571223 branch from e3adb1b to d9e0627 Compare

August 21, 2025 16:17

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:18

— with

GitHub Actions Error

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:18

— with

GitHub Actions Error

Contributor

facebook-github-bot commented Aug 21, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

1acc603

Summary:
Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Test Plan:
Imported from GitHub, without a `Test Plan:` line.

Rollback Plan:

Differential Revision: D80571223

Pulled By: jananisriram

jananisriram force-pushed the export-D80571223 branch from d9e0627 to 1acc603 Compare

August 21, 2025 16:23

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:24

— with

GitHub Actions Failure

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:24

— with

GitHub Actions Failure

Contributor

facebook-github-bot commented Aug 21, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

ad791c4

Summary:
Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Test Plan:
Imported from GitHub, without a `Test Plan:` line.

Rollback Plan:

Differential Revision: D80571223

Pulled By: jananisriram

jananisriram force-pushed the export-D80571223 branch from 1acc603 to ad791c4 Compare

August 21, 2025 16:49

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:49

— with

GitHub Actions Failure

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 16:49

— with

GitHub Actions Failure

jananisriram added a commit to jananisriram/tritonbench that referenced this pull request


          Move scaling logic to input generation (meta-pytorch#338)

da77dd8

Summary:
Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Test Plan:
Imported from GitHub, without a `Test Plan:` line.

Rollback Plan:

Differential Revision: D80571223

Pulled By: jananisriram

jananisriram force-pushed the export-D80571223 branch from ad791c4 to da77dd8 Compare

August 21, 2025 17:22

Contributor

facebook-github-bot commented Aug 21, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 17:23

— with

GitHub Actions Failure

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 17:23

— with

GitHub Actions Failure


          Move scaling logic to input generation (meta-pytorch#338)

7d67063

Summary:
Move scaling logic for FP8 benchmarks to `get_input_iter()`.

This diff aligns our fp8_gemm benchmarking suite with real-world practices: input tensors are of high precision types (`bfloat16`, `float16`), scales are computed on the high-precision input tensors, and input tensors are then casted to a lower precision (`float8_e4m3fn`).

This diff also circumvents performing unsupported operations, like `torch.max` and `torch.abs`, on low-precision data types.

Pull Request resolved: meta-pytorch#338

Test Plan:
Imported from GitHub, without a `Test Plan:` line.

Rollback Plan:

Reviewed By: xuzhao9

Differential Revision: D80571223

Pulled By: jananisriram

Contributor

facebook-github-bot commented Aug 21, 2025

This pull request was exported from Phabricator. Differential Revision: D80571223

jananisriram force-pushed the export-D80571223 branch from da77dd8 to 7d67063 Compare

August 21, 2025 18:56

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 18:56

— with

GitHub Actions Failure

jananisriram had a problem deploying to docker-s3-upload

August 21, 2025 18:56

— with

GitHub Actions Failure

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed fb-exported