[Misc] Update `GPTQ` to use `vLLMParameters` #7976

dsikka · 2024-08-29T00:52:09Z

Summary

Update gptq to use vLLMParameters

github-actions · 2024-08-29T00:52:22Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

dsikka · 2024-08-29T00:52:42Z

/ready

mgoin · 2024-09-03T21:11:59Z

tests/weight_loading/models.txt

+gptq, robertgshaw2/zephyr-7b-beta-channelwise-gptq, main
+gptq, TheBloke/Llama-2-7B-GPTQ, main


It seems like these are probably covered by the TinyLlama models, do you think we can remove?

Signed-off-by: Alvant <[email protected]>

Signed-off-by: LeiWang1999 <[email protected]>

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 29, 2024

maxdebayser mentioned this pull request Sep 3, 2024

[Bug]: Loading GPTQ-quantized GPTBigCode fails in weight_loader_v2 of qptq_marlin #8116

Closed

1 task

dsikka added 3 commits September 3, 2024 15:56

fix loading for unfused pathway

5fa8250

update gptq parameters

3043b80

update lm head; try test fix

e666b22

dsikka force-pushed the update_gptq branch from 7bb6dd3 to e666b22 Compare September 3, 2024 16:05

mgoin approved these changes Sep 3, 2024

View reviewed changes

mgoin merged commit 2188a60 into vllm-project:main Sep 3, 2024

mgoin deleted the update_gptq branch September 3, 2024 21:21

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Misc] Update GPTQ to use vLLMParameters (vllm-project#7976)

ee73bea

Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[Misc] Update GPTQ to use vLLMParameters (vllm-project#7976)

5bd62ff

Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Update `GPTQ` to use `vLLMParameters` #7976

[Misc] Update `GPTQ` to use `vLLMParameters` #7976

Uh oh!

dsikka commented Aug 29, 2024

Uh oh!

github-actions bot commented Aug 29, 2024

Uh oh!

dsikka commented Aug 29, 2024

Uh oh!

mgoin Sep 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		gptq, robertgshaw2/zephyr-7b-beta-channelwise-gptq, main
		gptq, TheBloke/Llama-2-7B-GPTQ, main

Uh oh!

[Misc] Update GPTQ to use vLLMParameters #7976

[Misc] Update GPTQ to use vLLMParameters #7976

Uh oh!

Conversation

dsikka commented Aug 29, 2024

Summary

Uh oh!

github-actions bot commented Aug 29, 2024

Uh oh!

dsikka commented Aug 29, 2024

Uh oh!

mgoin Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Misc] Update `GPTQ` to use `vLLMParameters` #7976

[Misc] Update `GPTQ` to use `vLLMParameters` #7976