Function Uptreaming
optimum
2064 <-- MERGED
transformers
35012 <-- MERGED
peft
2247 <-- MERGED
Tests
optimum
test_quantization
RUN_SLOW=1 pytest tests/gptq/test_quantization.py
transformers
test_gptq
RUN_SLOW=1 pytest tests/quantization/gptq/test_gptq.py
peft
PeftGPTQGPUTests
pytest tests/test_gpu_examples.py::PeftGPTQTests and pytest tests/test_common_gpu.py::PeftCommonTests::test_lora_gptq_quantization_from_pretrained_safetensors
I suppose we don't need new unit tests for gptq in HF, just need to pass all gptq tests with gptqmodel lib. Please help to confirm it.
cc @Qubitium @SunMarc