Skip to content

Conversation

gshtras
Copy link
Collaborator

@gshtras gshtras commented Apr 3, 2025

Before we can start upstreaming this fusion for V0 or V1 (pending vllm-project#12591 and vllm-project#15734), we should have it as a reference point, and a performance optimization for V1 here

@gshtras gshtras merged commit 732455b into main Apr 7, 2025
2 of 4 checks passed
@gshtras gshtras deleted the fp8_attention_out_v1 branch April 7, 2025 15:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants