[TensorIR][Transform] Enable warp shuffling for `LowerWarpMemory` #14280

yzh119 · 2023-03-12T15:03:49Z

Motivation

The LowerWarpMemory pass cannot emit shfl_sync instructions because of an internal check introduced in #9727 . Actually if we load value from another lane in the warp, the local_index would inevitably carry the warp index, and this case would be disabled by the check.

This PR fix the issue by disabling the check and add an unit test for warp shuffling.

The PR depends on #14279 , I'll rebase to upstream/main after that PR is merged.

@Lunderberg @masahi @tqchen

tvm-bot · 2023-03-12T15:03:52Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

No users to tag found in teams: tensorir, transform _{See #10317 for details}

_{Generated by tvm-bot}

yzh119 · 2023-03-12T15:55:31Z

It seems the unit test still works if I add the ICHECK back, I'll close the PR first.

junrushao · 2023-03-12T19:01:01Z

Its fine to keep it open as a draft PR :-)

Lunderberg

Made a couple of comments as I was reading through, though your comment about the tests passing even with the ICHECK present is interesting. It looks like local_index in the case of warp shuffle is 0, and is used to build the A_warp[0] argument to tvm_warp_shuffle.

Lunderberg · 2023-03-13T13:15:11Z

tests/python/unittest/test_tir_transform_lower_warp_memory.py

+            B[vi] = A[(vi % 4) * 8 + vi // 4] + T.float32(1)
+
+
+def test_warp_shuffle_transform():


The test looks reasonable as-is, though there's also a tvm.testing.CompareBeforeAfter that you could use to further reduce the boilerplate.

class TestWarpShuffleTransform(tvm.testing.CompareBeforeAfter): transform = tvm.tir.transform.LowerWarpMemory() def before(A: T.handle("float32", "global"), B: T.handle("float32", "global")): ... def expected(A: T.handle("float32", "global"), B: T.handle("float32", "global")): ...

Lunderberg · 2023-03-13T13:34:06Z

tests/python/unittest/test_tir_transform_lower_warp_memory.py

+        def main(A: T.handle("float32", "global"), B: T.handle("float32", "global")):
+            blockIdx_x = T.env_thread("blockIdx.x")
+            threadIdx_x = T.env_thread("threadIdx.x")
+            T.func_attr(


It looks like the test case only requires the "target" attribute, and only requires "kind" and "thread_warp_size" within that. Can we remove the extra attributes from the unit test?

Lunderberg · 2023-03-13T13:36:14Z

tests/python/unittest/test_tir_transform_lower_warp_memory.py

+            B_warp = T.allocate([32], "float32", "warp")
+            T.launch_thread(threadIdx_x, 32)
+            A_warp_1 = T.Buffer((32,), data=A_warp, scope="warp")
+            A_1 = T.Buffer((32,), data=A)


Instead of having a separate A: T.handle and A_1: T.Buffer, the buffer could be declared as a parameter A_1: T.Buffer(32). It does result in slightly different TIR, as it follows the style from before MakePackedAPI is applied, but for a unit test would help to emphasize the change being tested.

Lunderberg · 2023-03-13T13:37:04Z

tests/python/unittest/test_tir_transform_lower_warp_memory.py

+            A_warp_1[threadIdx_x] = A_1[threadIdx_x]
+            B_warp_1 = T.Buffer((32,), data=B_warp, scope="warp")
+            T.tvm_storage_sync("warp")
+            B_warp_1[threadIdx_x] = A_warp_1[threadIdx_x % 4 * 8 + threadIdx_x // 4] + T.float32(1)


Could we add a comment here, indicating that this line is the one that should be updated correctly?

Lunderberg · 2023-03-13T13:45:10Z

(Also, it looks like the initial check dates back to PR#1050, just with different refactorings that touched that line along the way.)

yzh119 added 7 commits March 11, 2023 10:50

init

e70d8eb

upd

b3b98fa

upd

409f7c5

add tests

9b92252

upd

ec53460

remove _op_wrapper

4cc6681

fix

4a5396a

yzh119 added 6 commits March 12, 2023 08:08

flake

fdabd56

pylint

5c4228a

try

c75c233

upd

107b21e

fix

8ed3953

upd

7d8eff7

yzh119 force-pushed the enable-warp-shuffle branch from 3aea3ff to 7d8eff7 Compare March 12, 2023 15:40

yzh119 closed this Mar 12, 2023

Lunderberg reviewed Mar 13, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TensorIR][Transform] Enable warp shuffling for `LowerWarpMemory` #14280

[TensorIR][Transform] Enable warp shuffling for `LowerWarpMemory` #14280

Uh oh!

yzh119 commented Mar 12, 2023

Uh oh!

tvm-bot commented Mar 12, 2023

Uh oh!

yzh119 commented Mar 12, 2023 •

edited

Loading

Uh oh!

junrushao commented Mar 12, 2023

Uh oh!

Lunderberg left a comment

Uh oh!

Lunderberg Mar 13, 2023

Uh oh!

Lunderberg Mar 13, 2023

Uh oh!

Lunderberg Mar 13, 2023

Uh oh!

Lunderberg Mar 13, 2023

Uh oh!

Lunderberg commented Mar 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		B[vi] = A[(vi % 4) * 8 + vi // 4] + T.float32(1)


		def test_warp_shuffle_transform():

[TensorIR][Transform] Enable warp shuffling for LowerWarpMemory #14280

[TensorIR][Transform] Enable warp shuffling for LowerWarpMemory #14280

Uh oh!

Conversation

yzh119 commented Mar 12, 2023

Motivation

Uh oh!

tvm-bot commented Mar 12, 2023

Uh oh!

yzh119 commented Mar 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junrushao commented Mar 12, 2023

Uh oh!

Lunderberg left a comment

Choose a reason for hiding this comment

Uh oh!

Lunderberg Mar 13, 2023

Choose a reason for hiding this comment

Uh oh!

Lunderberg Mar 13, 2023

Choose a reason for hiding this comment

Uh oh!

Lunderberg Mar 13, 2023

Choose a reason for hiding this comment

Uh oh!

Lunderberg Mar 13, 2023

Choose a reason for hiding this comment

Uh oh!

Lunderberg commented Mar 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[TensorIR][Transform] Enable warp shuffling for `LowerWarpMemory` #14280

[TensorIR][Transform] Enable warp shuffling for `LowerWarpMemory` #14280

yzh119 commented Mar 12, 2023 •

edited

Loading