Skip to content

Conversation

@MasterJH5574
Copy link
Contributor

This PR fixes two bugs of the attention prefill ragged kernel.

  • The first bug is the unroll of loop ki, which causes the TIR build failure in the PointerValueTypeRewrite pass due to vector size.
  • The second is the tile sizes of tile_z and tile_y may violate the assertion check in get_tile_size.

This PR fixes two bugs of the attention prefill ragged kernel.

* The first bug is the unroll of loop `ki`, which causes the TIR build
failure in the PointerValueTypeRewrite pass due to vector size.
* The second is the tile sizes of `tile_z` and `tile_y` may violate
the assertion check in `get_tile_size`.
@tqchen tqchen merged commit 42b1e97 into apache:main Nov 21, 2024
18 checks passed
ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025
…17539)

This PR fixes two bugs of the attention prefill ragged kernel.

* The first bug is the unroll of loop `ki`, which causes the TIR build
failure in the PointerValueTypeRewrite pass due to vector size.
* The second is the tile sizes of `tile_z` and `tile_y` may violate
the assertion check in `get_tile_size`.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants