Fix bnb training test failure #34414

matthewdouglas · 2024-10-25T13:20:07Z

What does this PR do?

Fixes a test failure that resulted from the introduction of OPTSdpaAttention as the default implementation in #33298. Discussed internally on Slack.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@BenjaminBossan @SunMarc

BenjaminBossan

Thanks for identifying the issue and providing a fix. I tested the change locally and the tests now pass for me.

HuggingFaceDocBuilderDev · 2024-10-25T13:47:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc · 2024-10-25T15:18:52Z

I saw that the issue was with the OPTSdpaAttention but why it doesn't work with it ? Any ideas ? cc @matthewdouglas

BenjaminBossan · 2024-10-25T15:21:58Z

The issue was that the old check "OPTAttention" in repr(type(module)) didn't match any layer anymore, hence no LoRA layers were added, hence no trainable parameters were found, resulting in the error.

SunMarc · 2024-10-25T15:39:40Z

Oh, I read the PR too fast. Thanks ! It makes sense now

* Fix bnb training test: compatibility with OPTSdpaAttention

* Support BatchNorm in Hubert pos_conv_emb as in fairseq * Correct the new defaults (#34377) * Correct the new defaults * CIs * add check * Update utils.py * Update utils.py * Add the max_length in generate test checking shape without passing length * style * CIs * fix fx CI issue * [auto. ping] Avoid sending empty info + add more team members (#34383) * update * update --------- Co-authored-by: ydshieh <[email protected]> * Fix glm (#34388) * Fix duplicated * fix import * Use non nested images and batched text Idefics2/3 (#34222) * add support for non nested images and add tests * add tests error scenario * fix style * added single and no image to error tests * Fix onnx non-expotable inplace aten op (#34376) * fix onnx non-expotable inplace op * mistral, qwen2, qwen2_vl, starcoder2 * fixup copies * Fix right padding in LLaVA models (#34305) * fix right pad llavas * device mismatch * no filter (#34391) * no filter * no filter * no filter --------- Co-authored-by: ydshieh <[email protected]> * SynthID: better example (#34372) * better example * Update src/transformers/generation/configuration_utils.py * Update src/transformers/generation/logits_process.py * nits * Tests: upgrade `test_eager_matches_sdpa_generate` (#34386) * Fix bnb training test failure (#34414) * Fix bnb training test: compatibility with OPTSdpaAttention * Avoid check expected exception when it is on CUDA (#34408) * update * update --------- Co-authored-by: ydshieh <[email protected]> * Fix typos in agents_advanced.md (#34405) * [docs] Cache implementations (#34325) cache * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq Add conversion integration test, and make batchnorm explicit variable * Support BatchNorm in Hubert pos_conv_emb as in fairseq fix make fixup styling changes * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq * [run-slow] hubert * Support BatchNorm in Hubert pos_conv_emb as in fairseq Add conversion integration test, and make batchnorm explicit variable * Support BatchNorm in Hubert pos_conv_emb as in fairseq fix make fixup styling changes * [run-slow] hubert * [run-slow] hubert --------- Co-authored-by: Cyril Vallez <[email protected]> Co-authored-by: Yih-Dar <[email protected]> Co-authored-by: ydshieh <[email protected]> Co-authored-by: Yoni Gozlan <[email protected]> Co-authored-by: Ilyas Moutawwakil <[email protected]> Co-authored-by: Raushan Turganbay <[email protected]> Co-authored-by: Joao Gante <[email protected]> Co-authored-by: Matthew Douglas <[email protected]> Co-authored-by: Rudy Delouya <[email protected]> Co-authored-by: Steven Liu <[email protected]> Co-authored-by: Yoach Lacombe <[email protected]>

matthewdouglas added 2 commits October 25, 2024 09:13

Fix bnb training test: compatibility with OPTSdpaAttention

034c532

Revert unrelated changes

e55e2e6

matthewdouglas requested a review from BenjaminBossan October 25, 2024 13:20

matthewdouglas added the Tests Related to tests label Oct 25, 2024

BenjaminBossan approved these changes Oct 25, 2024

View reviewed changes

matthewdouglas changed the title ~~Fix te~~ Fix bnb training test failure Oct 25, 2024

matthewdouglas merged commit e447185 into main Oct 25, 2024
13 checks passed

matthewdouglas deleted the fix-test-bnb-opt-sdpa branch October 25, 2024 14:23

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

Fix bnb training test failure (huggingface#34414)

bc05948

* Fix bnb training test: compatibility with OPTSdpaAttention

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bnb training test failure #34414

Fix bnb training test failure #34414

Uh oh!

matthewdouglas commented Oct 25, 2024

Uh oh!

BenjaminBossan left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 25, 2024

Uh oh!

Uh oh!

SunMarc commented Oct 25, 2024

Uh oh!

BenjaminBossan commented Oct 25, 2024

Uh oh!

SunMarc commented Oct 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix bnb training test failure #34414

Fix bnb training test failure #34414

Uh oh!

Conversation

matthewdouglas commented Oct 25, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 25, 2024

Uh oh!

Uh oh!

SunMarc commented Oct 25, 2024

Uh oh!

BenjaminBossan commented Oct 25, 2024

Uh oh!

SunMarc commented Oct 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants