[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

srkreddy1238 · 2023-01-25T13:10:08Z

Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file.

New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning.

CLML layer profiling now uses OpenCL Timer interface.

This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details.

Co-Authored-By: Krishna Raju Vegiraju [email protected]

tvm-bot · 2023-01-25T13:10:11Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @areusch _{See #10317 for details}

_{Generated by tvm-bot}

Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file. New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning. CLML layer profiling now uses OpenCL Timer interface. This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details.

echuraev

Several comments

src/runtime/opencl/opencl_common.h

src/runtime/contrib/clml/clml_runtime.cc

Co-authored-by: Egor Churaev <[email protected]>

echuraev

LGTM. Thanks

* [RUNTIME][CLML] OpenCLML tuning and profiling enhanced Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file. New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning. CLML layer profiling now uses OpenCL Timer interface. This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details. * Update src/runtime/opencl/opencl_common.h Co-authored-by: Egor Churaev <[email protected]> * * review comments --------- Co-authored-by: Egor Churaev <[email protected]>

srkreddy1238 force-pushed the clml_tuning branch 3 times, most recently from 199755d to 4f672d5 Compare January 26, 2023 02:09

srkreddy1238 force-pushed the clml_tuning branch from 4f672d5 to 9960020 Compare January 26, 2023 03:35

echuraev reviewed Jan 26, 2023

View reviewed changes

src/runtime/opencl/opencl_common.h Outdated Show resolved Hide resolved

src/runtime/opencl/opencl_common.h Outdated Show resolved Hide resolved

src/runtime/contrib/clml/clml_runtime.cc Outdated Show resolved Hide resolved

srkreddy1238 and others added 2 commits January 27, 2023 12:24

Update src/runtime/opencl/opencl_common.h

7e77a7e

Co-authored-by: Egor Churaev <[email protected]>

* review comments

535d1e0

srkreddy1238 force-pushed the clml_tuning branch from a8b79f5 to 535d1e0 Compare January 28, 2023 05:11

echuraev approved these changes Jan 30, 2023

View reviewed changes

echuraev merged commit 3c81d9b into apache:main Jan 30, 2023

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

Uh oh!

srkreddy1238 commented Jan 25, 2023 •

edited

Loading

Uh oh!

tvm-bot commented Jan 25, 2023

Uh oh!

echuraev left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echuraev left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

Uh oh!

Conversation

srkreddy1238 commented Jan 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented Jan 25, 2023

Uh oh!

echuraev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echuraev left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

srkreddy1238 commented Jan 25, 2023 •

edited

Loading