[LLVM] Support atomic for GPU backend (NVPTX, ROCm) #7051

masahi · 2020-12-07T22:14:34Z

This adds a new tir builtin atomic_add and corresponding lowering rule for LLVM GPU backends. So far, atomic_add is introduced and used by CUDA topi, and LLVM based GPU backend cannot compile ops that use it (nms, scatter_add, argwhere).

Unfortunately I couldn't get atomic_add working for CPU backend. There is some pointer cast issue that llvm IR verifier rejects. I think it is related to implicit cast to i8* done by LLVM CPU backend, but I haven't looked into details. So for now, only GPU backends support lowering atomic_add.

Other restriction is I've only supported 32 bit atomics. Supporting int64 atomic would be desirable but it looks complicated (need to generate CAS loop etc).

Obviously I'm a complete noob to atomic issues, any help would be appreciated.

please review @tqchen @zhiics @yzhliu @yidawang

Laurawly

LGTM. Just a small comment that we can add a todo in the comment for CPU atomic.

zhiics · 2020-12-08T17:14:14Z

Thanks @masahi @Laurawly

* support atomic add on llvm * make atomic builtin intrin * test bincount on nvptx * use builtin::atomic_add * add atomic llvm codegen test, only works on int8 input somehow * supports fp32 atomic * drop support for cpu atomic * add comment * add atomic gpu unit test * reenable other tests * add doc string * run black * fix build with llvm 8 and older * fix format * do not run float32 atomic test on ci * do not run scatter_add 1d with float inputs on CI * fix typo * add todo comment for cpu backend * fix build on ci Co-authored-by: masa <[email protected]>

masa and others added 17 commits December 8, 2020 07:06

support atomic add on llvm

dce3438

make atomic builtin intrin

b8ae806

test bincount on nvptx

e604240

use builtin::atomic_add

30e88dc

add atomic llvm codegen test, only works on int8 input somehow

e37706f

supports fp32 atomic

faf20d6

drop support for cpu atomic

07c50fa

add comment

e48563e

add atomic gpu unit test

8dc9fd6

reenable other tests

9147f9e

add doc string

552aa5f

run black

337f209

fix build with llvm 8 and older

c9f413a

fix format

0d6b480

do not run float32 atomic test on ci

9dea22a

do not run scatter_add 1d with float inputs on CI

7551cba

fix typo

65498b0

Laurawly approved these changes Dec 8, 2020

View reviewed changes

masahi added 2 commits December 8, 2020 20:02

add todo comment for cpu backend

82a887c

fix build on ci

e4eda99

zhiics approved these changes Dec 8, 2020

View reviewed changes

zhiics merged commit 3144cec into apache:main Dec 8, 2020

masahi mentioned this pull request Dec 11, 2020

[ONNX] NMS in ONNX #6839

Merged

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LLVM] Support atomic for GPU backend (NVPTX, ROCm) #7051

[LLVM] Support atomic for GPU backend (NVPTX, ROCm) #7051

Uh oh!

masahi commented Dec 7, 2020 •

edited

Loading

Uh oh!

Laurawly left a comment •

edited

Loading

Uh oh!

zhiics commented Dec 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[LLVM] Support atomic for GPU backend (NVPTX, ROCm) #7051

[LLVM] Support atomic for GPU backend (NVPTX, ROCm) #7051

Uh oh!

Conversation

masahi commented Dec 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Laurawly left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhiics commented Dec 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

masahi commented Dec 7, 2020 •

edited

Loading

Laurawly left a comment •

edited

Loading