Skip to content

Conversation

@MasterJH5574
Copy link
Contributor

This PR bumps the 3rdparty FlashInfer revision to include the efficient sampling function implementation on CUDA.

This PR bumps the 3rdparty FlashInfer revision to include the
efficient sampling function implementation on CUDA.
@tqchen
Copy link
Member

tqchen commented Apr 27, 2024

@tvm-bot rerun

@tqchen tqchen merged commit 0b09ed0 into apache:main Apr 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants