can vllm use autogptq compressed model like llama-2-70b-chat-gptq #2077

Closed

opened

on Dec 13, 2023

I want to use llama-2-70b-chat-gptq. Can it be used in vllm?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests