modify context length for GPTQ + version bump #25899

SunMarc · 2023-08-31T21:18:22Z

What does this PR do ?

This PR adds the possibility to change the max input length when using exllama backend + act_order. We also bump the required version of gptq to 0.4.2.
The gptq tests passed and I skipped a test because we need to wait for a release on optimum side.

HuggingFaceDocBuilderDev · 2023-08-31T21:39:32Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Amazing work @SunMarc , thanks a lot! 🔥

ArthurZucker

Thanks, left a few nits on the doc. Could you link / detail what is exllama and act_order?

tests/quantization/gptq/test_gptq.py

src/transformers/modeling_utils.py

Co-authored-by: Arthur <[email protected]>

SunMarc · 2023-09-05T16:03:23Z

It works with the new model path :)

ArthurZucker

Thanks! (make sure to rebase on main before merging for the failing tests)

* add new arg for gptq * add tests * add min version autogptq * fix order * skip test * fix * Update src/transformers/modeling_utils.py Co-authored-by: Arthur <[email protected]> * fix style * change model path --------- Co-authored-by: Arthur <[email protected]>

SunMarc added 6 commits August 31, 2023 19:50

add new arg for gptq

0435749

add tests

513cab3

add min version autogptq

4db188a

fix order

bffa86a

skip test

ae91f07

fix

ac2cdb5

SunMarc requested review from ArthurZucker and younesbelkada August 31, 2023 21:18

younesbelkada approved these changes Sep 1, 2023

View reviewed changes

ArthurZucker reviewed Sep 1, 2023

View reviewed changes

tests/quantization/gptq/test_gptq.py Outdated Show resolved Hide resolved

tests/quantization/gptq/test_gptq.py Outdated Show resolved Hide resolved

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

SunMarc and others added 2 commits September 1, 2023 11:10

Update src/transformers/modeling_utils.py

8f52e9c

Co-authored-by: Arthur <[email protected]>

fix style

4903459

SunMarc requested a review from ArthurZucker September 5, 2023 13:12

change model path

2c7b026

Merge remote-tracking branch 'upstream/main' into add_max_input_length

a3a231f

ArthurZucker approved these changes Sep 6, 2023

View reviewed changes

Merge remote-tracking branch 'upstream/main' into add_max_input_length

f422474

SunMarc merged commit fa6107c into huggingface:main Sep 6, 2023

SunMarc deleted the add_max_input_length branch September 6, 2023 15:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

modify context length for GPTQ + version bump #25899

modify context length for GPTQ + version bump #25899

Uh oh!

SunMarc commented Aug 31, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 31, 2023 •

edited

Loading

Uh oh!

younesbelkada left a comment

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SunMarc commented Sep 5, 2023

Uh oh!

ArthurZucker left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

modify context length for GPTQ + version bump #25899

modify context length for GPTQ + version bump #25899

Uh oh!

Conversation

SunMarc commented Aug 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SunMarc commented Sep 5, 2023

Uh oh!

ArthurZucker left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SunMarc commented Aug 31, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 31, 2023 •

edited

Loading

ArthurZucker left a comment •

edited

Loading