fix a typo in flax T5 attention - attention_mask variable is misnamed #26663

giganttheo · 2023-10-07T17:32:44Z

What does this PR do?

Fixes a typo in the Flax code for T5 model.

There is a typo in the Attention module of the Flax version of T5, where the attention_mask updated by the _concatenate_to_cache method should override the previous attention_mask but does not because of a misnamed variable.

Fixes #26564

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

Who can review?

@sanchit-gandhi

sanchit-gandhi

Very nice @giganttheo! Thanks for identifying the bug and proposing the fix 🤗 Confirming that the slow tests pass following the fix? As per #26564 (comment) If so, then this all LGTM!

HuggingFaceDocBuilderDev · 2023-10-09T16:56:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

giganttheo · 2023-10-10T13:49:20Z

Very nice @giganttheo! Thanks for identifying the bug and proposing the fix 🤗 Confirming that the slow tests pass following the fix? As per #26564 (comment) If so, then this all LGTM!

The slow tests are passing for t5 and longt5:

RUN_SLOW=1 pytest -sv tests/models/t5/test_modeling_flax_t5.py::FlaxT5ModelIntegrationTests

outputs: ================== 6 passed, 4 warnings in 331.38s (0:05:31) ===================

and for the longT5 version:

RUN_SLOW=1 pytest -sv tests/models/longt5/test_modeling_flax_longt5.py::FlaxLongT5ModelIntegrationTests

outputs: =================== 1 passed, 1 warning in 401.61s (0:06:41) ===================

sanchit-gandhi · 2023-10-10T15:58:20Z

Awesome - thanks for confirming! Requesting a final review from @ArthurZucker

ArthurZucker

LGTM! Thanks for catching 🤗

giganttheo added 2 commits October 7, 2023 19:21

fix a typo in flax t5 attention

62fc06a

fix the typo in flax longt5 attention

5257602

sanchit-gandhi approved these changes Oct 9, 2023

View reviewed changes

sanchit-gandhi requested a review from ArthurZucker October 10, 2023 15:58

ArthurZucker approved these changes Oct 10, 2023

View reviewed changes

ArthurZucker merged commit 975003e into huggingface:main Oct 10, 2023

giganttheo deleted the fix_typo_flax_t5_attn branch May 17, 2024 11:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix a typo in flax T5 attention - attention_mask variable is misnamed #26663

fix a typo in flax T5 attention - attention_mask variable is misnamed #26663

Uh oh!

giganttheo commented Oct 7, 2023 •

edited by sanchit-gandhi

Loading

Uh oh!

sanchit-gandhi left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2023

Uh oh!

giganttheo commented Oct 10, 2023

Uh oh!

sanchit-gandhi commented Oct 10, 2023

Uh oh!

ArthurZucker left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix a typo in flax T5 attention - attention_mask variable is misnamed #26663

fix a typo in flax T5 attention - attention_mask variable is misnamed #26663

Uh oh!

Conversation

giganttheo commented Oct 7, 2023 • edited by sanchit-gandhi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

sanchit-gandhi left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2023

Uh oh!

giganttheo commented Oct 10, 2023

Uh oh!

sanchit-gandhi commented Oct 10, 2023

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

giganttheo commented Oct 7, 2023 •

edited by sanchit-gandhi

Loading