[Model] Adding Granite model. #7436

shawntan · 2024-08-12T19:05:55Z

Adds Granite model class to vLLM.

Model will be on Hugging Face once huggingface/transformers#31502 is merged.

github-actions · 2024-08-12T19:06:07Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

njhill · 2024-08-12T22:29:40Z

Thanks @shawntan! Could you rebase on the latest main branch, and add a test in https://github.com/vllm-project/vllm/tree/main/tests/models similar to the other model architectures?

vllm/model_executor/models/granite.py

njhill · 2024-08-12T22:41:56Z

@shawntan I have moved this to draft since it currently depends on a future version of the transformers library.

How about including the GraniteConfig class in its own file here in the meantime? Then we can clean that up later once vLLM moves to the necessary transformers version.

vllm/model_executor/models/granite.py

njhill · 2024-08-16T02:35:22Z

@shawntan to avoid duplication / maintenance overhead would it make more sense to just add the optional multipliers to llama.py?

shawntan · 2024-08-28T21:09:11Z

@njhill HF PR merged.

njhill

Thanks @shawntan, looks great.

However, we may need to keep the config class here for the time being, until a new transformers version is released containing it and vLLM moves to that version. Could you reinstate that?

And take it out of draft if it's now ready to be merged?

tests/models/test_granite.py

- Removed granite config. - Updated test with offiicial model

Co-authored-by: Nick Hill <[email protected]> Signed-off-by: Alvant <[email protected]>

Co-authored-by: Nick Hill <[email protected]> Signed-off-by: LeiWang1999 <[email protected]>

njhill reviewed Aug 12, 2024

View reviewed changes

vllm/model_executor/models/granite.py Outdated Show resolved Hide resolved

njhill marked this pull request as draft August 12, 2024 22:34

shawntan force-pushed the granite branch 2 times, most recently from b79bb1d to 19d622f Compare August 13, 2024 21:17

njhill reviewed Aug 16, 2024

View reviewed changes

shawntan force-pushed the granite branch from 19d622f to a5c7b7a Compare August 19, 2024 17:55

shawntan force-pushed the granite branch from 553e89c to 45d1e3b Compare August 28, 2024 20:17

njhill reviewed Aug 28, 2024

View reviewed changes

tests/models/test_granite.py Outdated Show resolved Hide resolved

tests/models/test_granite.py Outdated Show resolved Hide resolved

njhill marked this pull request as ready for review August 29, 2024 17:45

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 29, 2024

njhill approved these changes Aug 29, 2024

View reviewed changes

shawntan and others added 14 commits August 30, 2024 19:05

Add Granite model.

b7e24a1

Style.

598f5a3

Fixed comments.

7739dd2

Copied config.

b056e01

Passes formatting.

1bc07b8

Granite test.

e257f45

Minor changes.

903d1cc

Formatting & adding quant_config

8e0ffa5

HF PR accepted.

ade3eec

- Removed granite config. - Updated test with offiicial model

Formatting.

55080ae

Re-added configs and updated test.

febebef

Make test conditional on transformers version

10ab53b

ruff

d849a27

Final norm fix.

a339b4d

shawntan force-pushed the granite branch from 1b74bed to a339b4d Compare August 30, 2024 19:05

shawntan and others added 3 commits August 30, 2024 19:08

Formatting.

c4b5c48

Merge remote-tracking branch 'refs/remotes/origin/main' into granite

01c92d1

Fix SamplerOutput import

574ac7e

njhill merged commit f8d6014 into vllm-project:main Sep 2, 2024

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Model] Add Granite model (vllm-project#7436)

d9d2bfa

Co-authored-by: Nick Hill <[email protected]> Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[Model] Add Granite model (vllm-project#7436)

770e341

Co-authored-by: Nick Hill <[email protected]> Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Adding Granite model. #7436

[Model] Adding Granite model. #7436

Uh oh!

shawntan commented Aug 12, 2024

Uh oh!

github-actions bot commented Aug 12, 2024

Uh oh!

njhill commented Aug 12, 2024

Uh oh!

Uh oh!

njhill commented Aug 12, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Aug 16, 2024

Uh oh!

shawntan commented Aug 28, 2024

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Model] Adding Granite model. #7436

[Model] Adding Granite model. #7436

Uh oh!

Conversation

shawntan commented Aug 12, 2024

Uh oh!

github-actions bot commented Aug 12, 2024

Uh oh!

njhill commented Aug 12, 2024

Uh oh!

Uh oh!

njhill commented Aug 12, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Aug 16, 2024

Uh oh!

shawntan commented Aug 28, 2024

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants