-
-
Notifications
You must be signed in to change notification settings - Fork 10.8k
[ROCm][Bugfix] Add missing parameter to ROCm backend #26029
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Gregory Shtrasberg <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request is a bugfix that aligns the get_kv_cache_shape method in RocmAttentionBackend with the base class interface by adding the cache_dtype_str parameter. This change is a follow-up to a broader update that was applied to other attention backends. The new parameter is currently unused within the method, which is consistent with other backends where the KV cache shape does not depend on the cache data type. The change is correct and addresses the interface inconsistency.
) Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: yewentao256 <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Tomer Asida <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]>
) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>
Follow up to #25896 that skipped ROCm attention backend when adding the new parameter to get_kv_cache_shape