[Codegen][CUDA] Fix: cuda codegen vectorize cast #7561

kongroo · 2021-03-02T03:57:15Z

Data types such as float32x8 and int32x8 are not supported in CUDA, which will result in errors like "Cannot convert type float32x8 to CUDA type!" in code generation. I tried to fix this by storing 2 32-bits values in 1 64-bits value.
Could you please help review this fix? @Laurawly

Laurawly

Just a few comments. Also cc @wpan11nv @vinx13 .

Laurawly · 2021-03-02T06:13:35Z

src/target/source/codegen_cuda.cc

+        }
+        if (!fail) {
+          return;
+        }


Missing break here?

Laurawly · 2021-03-02T06:28:42Z

tests/python/unittest/test_target_codegen_cuda.py

        s = tvm.te.create_schedule(C.op)
-        ob, ib = s[C].split(s[C].op.axis[0], nparts=32)
-        _, iib = s[C].split(ib, factor=4)
+        ob, ib = s[C].split(s[C].op.axis[0], nparts=n // factor)


We can also directly say factor=factor here.

like this?

ob, ib = s[C].split(s[C].op.axis[0], factor=factor) # _, iib = s[C].split(ib, factor=factor) s[C].vectorize(ib)

vinx13 · 2021-03-02T18:46:19Z

Thanks @kongroo @Laurawly

* fix: cuda codegen vectorize cast * style: fix python coding style * fix: missing break * refactor: directly split by factor Co-authored-by: jiangchengquan <[email protected]>

kongroo added 2 commits March 2, 2021 11:43

fix: cuda codegen vectorize cast

4900849

style: fix python coding style

4a9aeec

kongroo force-pushed the fix_cuda_vectorized_cast branch from 0ecc450 to 4a9aeec Compare March 2, 2021 05:36

Laurawly reviewed Mar 2, 2021

View reviewed changes

kongroo added 2 commits March 2, 2021 14:43

fix: missing break

63d9a43

refactor: directly split by factor

4b2e1a3

tqchen assigned Laurawly Mar 2, 2021

tqchen added the status: need review label Mar 2, 2021

tqchen assigned vinx13 and unassigned Laurawly and vinx13 Mar 2, 2021

vinx13 approved these changes Mar 2, 2021

View reviewed changes

vinx13 merged commit 5d354e4 into apache:main Mar 2, 2021

vinx13 added status: accepted and removed status: need review labels Mar 2, 2021

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Codegen][CUDA] Fix: cuda codegen vectorize cast #7561

[Codegen][CUDA] Fix: cuda codegen vectorize cast #7561

Uh oh!

kongroo commented Mar 2, 2021

Uh oh!

Laurawly left a comment

Uh oh!

Laurawly Mar 2, 2021

Uh oh!

kongroo Mar 2, 2021

Uh oh!

Laurawly Mar 2, 2021

Uh oh!

kongroo Mar 2, 2021

Uh oh!

Laurawly Mar 2, 2021

Uh oh!

kongroo Mar 2, 2021

Uh oh!

vinx13 commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Codegen][CUDA] Fix: cuda codegen vectorize cast #7561

[Codegen][CUDA] Fix: cuda codegen vectorize cast #7561

Uh oh!

Conversation

kongroo commented Mar 2, 2021

Uh oh!

Laurawly left a comment

Choose a reason for hiding this comment

Uh oh!

Laurawly Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

kongroo Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

Laurawly Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

kongroo Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

Laurawly Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

kongroo Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

vinx13 commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants