[Unity][BYOC] Add cuBLAS backend #14291

masahi · 2023-03-14T07:20:29Z

This PR adds support for cuBLAS offloading in Relax via BYOC. In particular, we are targeting cuBLASLt API, which has a limited but useful set of epilogue operations (bias / relu / gelu).

Compared to the CUTLASS BYOC, the introduction of cuBLAS BYOC is motivated by dynamic shape support - For dynamic shape, we cannot tune CUTLASS kernels, so we either end up choosing a kernel that works for any shape (align1) at build time or developing some runtime heuristics. I realized that cuBLAS doesn't differentiate static / dynamic shape and already has tons of heuristics that are likely better than anything we can come up with. So I believe cuBLAS is a better default solution for dynamic shape.

cc @vinx13 @yelite @mbaret

tvm-bot · 2023-03-14T07:20:33Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @billishyahao, @quic-sanirudh _{See #10317 for details}

_{Generated by tvm-bot}

src/runtime/contrib/cublas/cublas.cc

masahi · 2023-03-22T09:18:19Z

python/tvm/contrib/cutlass/build.py

-    return arg_idx
+    extract_func = tvm.get_global_func("relax.contrib.extract_arg_idx")
+    arg_indices = extract_func(pattern_name, f)
+    return {k: int(v) for k, v in arg_indices.items()}


cc @yelite this has been ported to cpp

masahi · 2023-03-22T11:56:11Z

Need to wait for rebase against main to get #14363 in the unity branch

vinx13 · 2023-03-27T21:36:23Z

@masahi please rebase as the other PR is merged

Hzfengsy · 2023-03-30T20:50:35Z

any updates?

masahi · 2023-03-30T20:57:26Z

waiting for the next rebase

vinx13 · 2023-03-30T21:08:02Z

src/runtime/contrib/cublas/cublas.cc

+  auto C_data = static_cast<char*>(C->data) + C->byte_offset;
+
+  CHECK_CUBLAS_ERROR(cublasLtMatmul(hdl, op_desc, alpha, B_data, A_desc, A_data, B_desc, beta,
+                                    C_data, C_desc, C_data, C_desc, nullptr, nullptr, 0, nullptr));


cublas API has a default workspace pool, it seems cublasLT always require workspace being explicit set, does passing nullptr here impact the performance? We may want to have default workspace allocated and stored as thread local in CublasThreadEntry

yeah there are some performance knobs that might worth exploring, in terms of memory management and algorithm selection (see also https://docs.nvidia.com/cuda/cublas/#heuristics-cache). I haven't tested any of them, those are good items for future work if cuBLAS BYOC gets more traction.

masahi · 2023-04-03T10:47:08Z

Just realized that the unity branch is specifying the CI setup in a different (and old?) way https://github.com/apache/tvm/blob/unity/ci/jenkins/unity_jenkinsfile.groovy.

In particular, the GPU image is using an outdated one https://github.com/apache/tvm/blob/unity/ci/jenkins/unity_jenkinsfile.groovy#L34. That's why I'm still getting an build error after I've updated CUDA version on main #14363

Shouldn't unity be using the same set of CI image tags as main? @tqchen @driazati (UPDATE: Just updated the gpu image tag in this PR for now)

masahi force-pushed the byoc-cublas branch from 11b7c7e to 381e7bd Compare March 14, 2023 07:47

masahi commented Mar 14, 2023

View reviewed changes

src/runtime/contrib/cublas/cublas.cc Show resolved Hide resolved

github-actions bot requested review from mbaret and vinx13 March 14, 2023 07:54

masahi mentioned this pull request Mar 14, 2023

[CI] Update CUDA to 11.7 #14293

Merged

vinx13 approved these changes Mar 14, 2023

View reviewed changes

yelite approved these changes Mar 14, 2023

View reviewed changes

tqchen force-pushed the unity branch from 2a9709c to 18c19fb Compare March 20, 2023 17:10

masahi force-pushed the byoc-cublas branch from 381e7bd to d7e4336 Compare March 22, 2023 08:23

masahi commented Mar 22, 2023

View reviewed changes

masahi force-pushed the byoc-cublas branch from 97db819 to c48a6f4 Compare March 28, 2023 01:59

vinx13 reviewed Mar 30, 2023

View reviewed changes

tqchen force-pushed the unity branch from 8caeab9 to a425bc7 Compare April 1, 2023 17:26

masahi force-pushed the byoc-cublas branch from c48a6f4 to 0775cea Compare April 1, 2023 19:48

tqchen force-pushed the unity branch from a425bc7 to 5c8b7af Compare April 1, 2023 20:00

masahi force-pushed the byoc-cublas branch 2 times, most recently from 2b5d190 to 75deeed Compare April 3, 2023 05:06

This was referenced Apr 3, 2023

[Unity] Support simple dynamic-shape-aware fusion #14396

Merged

[Unity][BYOC] Fix RunCodegen pass on symbolic shape #14472

Merged

[Unity][CI] Update gpu and lint image #14473

Merged

masahi added 4 commits April 4, 2023 08:56

stub

0683120

fixed build

91b8c0c

test stub

c493962

basic gemm working

8e8fa35

masahi added 11 commits April 4, 2023 08:56

transposed gemm work

3ce49a6

wip

9f48c2b

bias and epilogue work

a834b0f

support fp16 and transposed bias

5e7480e

support batched gemm

f9ce24e

clean up

bc0b2c5

access arguments properly

021f6ae

expose ExtractArgIdx to python and use it in cutlass byoc

57db54f

put matmul ir into common testing file

5238694

updated for the latest rev

4019e05

pylint

54ca782

masahi force-pushed the byoc-cublas branch from dc56d75 to 54ca782 Compare April 3, 2023 23:57

masahi merged commit e54e04d into apache:unity Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Unity][BYOC] Add cuBLAS backend #14291

[Unity][BYOC] Add cuBLAS backend #14291

Uh oh!

masahi commented Mar 14, 2023 •

edited

Loading

Uh oh!

tvm-bot commented Mar 14, 2023

Uh oh!

Uh oh!

masahi Mar 22, 2023

Uh oh!

masahi commented Mar 22, 2023

Uh oh!

vinx13 commented Mar 27, 2023

Uh oh!

Hzfengsy commented Mar 30, 2023

Uh oh!

masahi commented Mar 30, 2023

Uh oh!

vinx13 Mar 30, 2023 •

edited

Loading

Uh oh!

masahi Mar 30, 2023

Uh oh!

masahi commented Apr 3, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[Unity][BYOC] Add cuBLAS backend #14291

[Unity][BYOC] Add cuBLAS backend #14291

Uh oh!

Conversation

masahi commented Mar 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented Mar 14, 2023

Uh oh!

Uh oh!

masahi Mar 22, 2023

Choose a reason for hiding this comment

Uh oh!

masahi commented Mar 22, 2023

Uh oh!

vinx13 commented Mar 27, 2023

Uh oh!

Hzfengsy commented Mar 30, 2023

Uh oh!

masahi commented Mar 30, 2023

Uh oh!

vinx13 Mar 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

masahi Mar 30, 2023

Choose a reason for hiding this comment

Uh oh!

masahi commented Apr 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

masahi commented Mar 14, 2023 •

edited

Loading

vinx13 Mar 30, 2023 •

edited

Loading

masahi commented Apr 3, 2023 •

edited

Loading