-
-
Notifications
You must be signed in to change notification settings - Fork 10.8k
[Easy][Model Registry] Add Llama4ForCausalLM in model registry #19580
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -78,6 +78,7 @@ | |
| "LlamaForCausalLM": ("llama", "LlamaForCausalLM"), | ||
| # For decapoda-research/llama-* | ||
| "LLaMAForCausalLM": ("llama", "LlamaForCausalLM"), | ||
| "Llama4ForCausalLM": ("llama4", "Llama4ForCausalLM"), | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. On a related note, I think the proper way to support the text-only usage of models that are released as "natively multimodal" like llama4 or mistral-small 3.1 is to add a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Maybe we should just go with "--language-model-only" solution? @liuzijing2014 thoughts? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I see, I will try out this idea for Llama4. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @liuzijing2014 Happy to collaborate on this! This was one of the items that I'm planning to work on too :) |
||
| "MambaForCausalLM": ("mamba", "MambaForCausalLM"), | ||
| "FalconMambaForCausalLM": ("mamba", "MambaForCausalLM"), | ||
| "FalconH1ForCausalLM":("falcon_h1", "FalconH1ForCausalLM"), | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Currently our
basic-models-testalways assumes that the tested architectures have a corresponding huggingface model repository to test with.vllm/tests/models/registry.py
Lines 440 to 447 in ace5cda
Do you think it's possible to add a dummy model repo on HF with the architecture
Llama4ForCausalLM? Alternatively you will need to modifytest_registry.pyfor CI to pass.