-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
[Kernel][Performance] Add Triton kernel for Qwen3-VL interleaved MRoPE #25055
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Benchmarkserver
client
ResultsMain
PR
|
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Can you also check lm-eval? |
@Isotr0py @DarkLight1337 BTW let's not post the actual eval numbers since the model hasn't been released - just make sure the results match |
Oh, I just realized there is no generation tests for Qwen3-VL yet. Let me add ones later today (after my seminar) |
Have confirmed Qwen3-VL generation test added by #25185 can still pass with Triton kernel. |
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]>
vllm-project#25055) Signed-off-by: Isotr0py <[email protected]> Signed-off-by: charlifu <[email protected]>
Purpose
Test Plan
Test Result
Test should pass
Essential Elements of an Effective PR Description Checklist
supported_models.md
andexamples
for a new model.