[DLIGHT][GPU] Enhance opencl thread limit for schedules #16972

krishnaraj36 · 2024-05-07T07:28:37Z

Enhanced the opencl thread limit and improved the gpu schedules for opencl targets.
It improves decode performance 20 % for few set of MLC llm models.

LLM model --- baseline --- improved
gemma-2b-it --- 22.4 tok/sec --- 28.2 tok/sec
Qwen-7b-chat --- 11 tok/sec --- 11.8 tok/sec

Enhanced the opencl thread limit and improved the gpu schedules for opencl targets. It improves decode performance 20 % for few set of models.

krishnaraj36 · 2024-05-07T07:30:23Z

@srkreddy1238 @tqchen : Please take a look to this PR and let me know your advise.

src/target/target_kind.cc

tqchen · 2024-05-09T13:25:37Z

cc @mengshyu would be nice to confirm metal improvement and see if we want it for webgpu/metal

[DLIGHT][GPU] Enhance opencl thread limit for schedules

1adbedc

Enhanced the opencl thread limit and improved the gpu schedules for opencl targets. It improves decode performance 20 % for few set of models.

tqchen approved these changes May 7, 2024

View reviewed changes

Update the build test

ebe70b3

tqchen reviewed May 8, 2024

View reviewed changes

src/target/target_kind.cc Outdated Show resolved Hide resolved

reverted opencl max_thread enhancement

4a7cfc5

krishnaraj36 mentioned this pull request May 8, 2024

[KVCACHE][TIR] Improved tir schedule for decode tir page attention mlc-ai/mlc-llm#2289

Merged

Fix in opencl thread assign

3af9b21

Hzfengsy merged commit 3b97658 into apache:main May 21, 2024

ysh329 mentioned this pull request Jul 20, 2024

[Release] v0.17.0 Release Candidate Notes #17178

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DLIGHT][GPU] Enhance opencl thread limit for schedules #16972

[DLIGHT][GPU] Enhance opencl thread limit for schedules #16972

Uh oh!

krishnaraj36 commented May 7, 2024 •

edited

Loading

Uh oh!

krishnaraj36 commented May 7, 2024

Uh oh!

Uh oh!

tqchen commented May 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[DLIGHT][GPU] Enhance opencl thread limit for schedules #16972

[DLIGHT][GPU] Enhance opencl thread limit for schedules #16972

Uh oh!

Conversation

krishnaraj36 commented May 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krishnaraj36 commented May 7, 2024

Uh oh!

Uh oh!

tqchen commented May 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

krishnaraj36 commented May 7, 2024 •

edited

Loading