[WebGPU] Support warp-level shuffle primitives with subgroup #17699

CharlieFRuan · 2025-03-03T03:48:33Z

Overview

This PR supports warp-level shuffle primitives using the newly introduced subgroup in WebGPU. We then use them in the implementation of allreduce lowering.

The introduced primitives are:

subgroupShuffle()
subgroupShuffleUp()
subgroupShuffleDown()

This PR largely follows the Metal counterpart:

[Codegen][Metal] Support metal warp-level primitive #15401

Tested with Llama3.2-1B-q4f16_1 and Llama3.1-8B-q4f16_1 E2E with WebLLM. The dumped WebGPU kernel indeed contains subgroup shuffle primitives: https://gist.github.com/CharlieFRuan/cb54a8db0513ecbbc16c5de8df5ab845

Remaining TODOs

Benchmark speedup
Be able to parameterize whether to use subgroup or not when targeting WebGPU, since not all devices support it
Check GPUFeatureName's inclusion of subgroups in @webgpu/types
Some WebGPU devices can have > 256 max num thread per block, be able to target different kinds

Resources

beaufortfrancois · 2025-03-05T07:37:38Z

web/src/webgpu.ts

    }

-    const requiredFeatures: GPUFeatureName[] = [];
+    // TODO(Charlie): cannot type annotate because @webgpu/types


@webgpu/types 0.1.55 should work now. See gpuweb/types#167

Great, thanks!

[WebGPU] Support warp-level shuffle ops with subgroup

3faae71

CharlieFRuan mentioned this pull request Mar 3, 2025

Use subgroup operations when possible mlc-ai/web-llm#553

Open

beaufortfrancois mentioned this pull request Mar 3, 2025

Add subgroups feature support huggingface/transformers.js#1217

Open

beaufortfrancois reviewed Mar 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WebGPU] Support warp-level shuffle primitives with subgroup #17699

[WebGPU] Support warp-level shuffle primitives with subgroup #17699

CharlieFRuan commented Mar 3, 2025 •

edited

Loading

Uh oh!

beaufortfrancois Mar 5, 2025

Uh oh!

CharlieFRuan Mar 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WebGPU] Support warp-level shuffle primitives with subgroup #17699

Are you sure you want to change the base?

[WebGPU] Support warp-level shuffle primitives with subgroup #17699

Conversation

CharlieFRuan commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Remaining TODOs

Resources

Uh oh!

beaufortfrancois Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

CharlieFRuan Mar 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CharlieFRuan commented Mar 3, 2025 •

edited

Loading