[KVCache] TIR attention kernel support for MLA #17618

MasterJH5574 · 2025-02-02T02:46:40Z

This PR introduces the MLA attention kernels written in TIR. It also implements the KV cache MLA computation logic.

A new unit test file is added to ensure the correctness of the TIR kernels.

This PR also fixes a few TIR prefill kernel tile size initialization.

MasterJH5574 · 2025-02-04T15:15:59Z

This PR introduces the MLA attention kernels written in TIR. It also implements the KV cache MLA computation logic. A new unit test file is added to ensure the correctness of the TIR kernels. This PR also fixes a few TIR prefill kernel tile size initialization.

MasterJH5574 mentioned this pull request Feb 2, 2025

[Model] Support weight absorption for DeepSeek-v2 mlc-ai/mlc-llm#3115

Merged

MasterJH5574 force-pushed the tvm-dev/2025-02-01-tir-mla branch 7 times, most recently from 98f36e1 to 0ab745e Compare February 4, 2025 00:22

MasterJH5574 force-pushed the tvm-dev/2025-02-01-tir-mla branch from 0ab745e to dfa4d07 Compare February 5, 2025 14:12

jinhongyii approved these changes Feb 5, 2025

View reviewed changes

jinhongyii merged commit 3eb5ad6 into apache:main Feb 5, 2025
19 checks passed

ysh329 mentioned this pull request Apr 19, 2025

[Release] v0.20.0 Release Candidate Notes #17860

Closed

kurisu6912 mentioned this pull request Sep 5, 2025

kurisu add assume attr patch 1 tile-ai/tvm#8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KVCache] TIR attention kernel support for MLA #17618

[KVCache] TIR attention kernel support for MLA #17618

Uh oh!

MasterJH5574 commented Feb 2, 2025

Uh oh!

MasterJH5574 commented Feb 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[KVCache] TIR attention kernel support for MLA #17618

[KVCache] TIR attention kernel support for MLA #17618

Uh oh!

Conversation

MasterJH5574 commented Feb 2, 2025

Uh oh!

MasterJH5574 commented Feb 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants