[INTEGRATION] Add GPTQModel support into transformers + optimum + peft

## Function Uptreaming

#### optimum
[2064](https://github.com/huggingface/optimum/pull/2064) <--  MERGED 
#### transformers
[35012](https://github.com/huggingface/transformers/pull/35012) <-- MERGED
#### peft
[2247](https://github.com/huggingface/peft/pull/2247) <-- MERGED


## Tests

#### optimum
[test_quantization](https://github.com/huggingface/optimum/blob/main/tests/gptq/test_quantization.py)
`RUN_SLOW=1 pytest tests/gptq/test_quantization.py`

- [x] cpu tests
- [x] cuda tests

#### transformers
[test_gptq](https://github.com/huggingface/transformers/blob/main/tests/quantization/gptq/test_gptq.py)
`RUN_SLOW=1 pytest tests/quantization/gptq/test_gptq.py`

- [x] cpu tests
- [x] cuda tests

#### peft
[PeftGPTQGPUTests](https://github.com/huggingface/peft/blob/main/tests/test_gpu_examples.py#L1376)
`pytest tests/test_gpu_examples.py::PeftGPTQTests` and `pytest tests/test_common_gpu.py::PeftCommonTests::test_lora_gptq_quantization_from_pretrained_safetensors`

- [x] cpu tests
- [x] cuda tests

I suppose we don't need new unit tests for gptq in HF, just need to pass all gptq tests with gptqmodel lib. Please help to confirm it. 
cc @Qubitium  @SunMarc 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[INTEGRATION] Add GPTQModel support into transformers + optimum + peft #729

Function Uptreaming

optimum

transformers

peft

Tests

optimum

transformers

peft

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[INTEGRATION] Add GPTQModel support into transformers + optimum + peft #729

Description

Function Uptreaming

optimum

transformers

peft

Tests

optimum

transformers

peft

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions