Skip to content

Conversation

@MasterJH5574
Copy link
Contributor

This PR supports a "None" Rotary Embedding mode in PagedKVCache. When the mode is None, the rotary embedding will not be applied to when computing attention.

This PR supports a "None" Rotary Embedding mode in
PagedKVCache. When the mode is None, the rotary embedding
will not be applied to when computing attention.
@MasterJH5574 MasterJH5574 force-pushed the tvm-dev/2024-02-15-kv-rope-none branch from 4564d82 to 880733a Compare February 16, 2024 00:19
@tqchen tqchen merged commit 6333d86 into apache:main Feb 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants