🚨 [`Clip`] Fix masking and enable flash attention on all model types #41750

vasqu · 2025-10-20T14:53:33Z

Clip used old mask APIs leading to a confused usage:

A causal mask (normal triu mask)
A padding mask (encoder mask == only accounting for padding)
Add both of above == final mask --> causal mask with padding

^ works only for interfaces with support for 4D masks which disabled FA usage in general.

This PR now correctly changes this to the new API which handles padding automatically. We have to additionally pass the is_causal kwarg to dynamically switch between modality types (text == causal, image == full). This is only enabled through recent PRs (fa #39707, sdpa #41692).

Closes #41673
Fixes #41668

vasqu · 2025-10-20T14:55:14Z

cc @yonigozlan when you come across models like these in the vision refactors

src/transformers/models/clip/modeling_clip.py

HuggingFaceDocBuilderDev · 2025-10-20T15:03:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/transformers/models/clip/modeling_clip.py

zucchini-nlp

Didn't see that we're changing only the text model. LGTM, as long as the slow tests are passing

src/transformers/models/clip/modeling_clip.py

vasqu · 2025-10-20T16:49:52Z

run-slow: clip

github-actions · 2025-10-20T16:51:26Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip']
quantizations: [] ...

vasqu · 2025-10-20T16:52:46Z

@molbap @zucchini-nlp I changed a few things to align the kwargs with our modern practices, i.e. see 764e63f

This makes kwarg easy to type properly, otherwise we probably need to type it as FA kwargs 🤔

molbap

Much better with the typing, thanks!

src/transformers/models/clip/modeling_clip.py

vasqu · 2025-10-20T17:07:07Z

CI has issues today, will probably check tomorrow again (and propogate the changes to metaclip 2 + mlcd) and merge then

vasqu · 2025-10-20T17:48:44Z

run-slow: clip, metaclip_2, mlcd, llava

github-actions · 2025-10-20T17:50:46Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip', 'models/llava', 'models/metaclip_2', 'models/mlcd']
quantizations: [] ...

…blocks

vasqu · 2025-10-20T18:27:24Z

run-slow: clip, metaclip_2, mlcd, llava

github-actions · 2025-10-20T18:29:06Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip', 'models/llava', 'models/metaclip_2', 'models/mlcd']
quantizations: [] ...

vasqu · 2025-10-20T18:43:33Z

Yea, the CI is not having a good day :D locally all the relevant tests passed, especially the integration tests - checking tomorrow

vasqu · 2025-10-21T12:00:33Z

run-slow: clip, metaclip_2, mlcd, llava

github-actions · 2025-10-21T12:01:59Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip', 'models/llava', 'models/metaclip_2', 'models/mlcd']
quantizations: [] ...

vasqu · 2025-10-21T12:06:00Z

run-slow: clip, metaclip_2, mlcd, llava

github-actions · 2025-10-21T12:08:25Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip', 'models/llava', 'models/metaclip_2', 'models/mlcd']
quantizations: [] ...

github-actions · 2025-10-21T15:07:13Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: clip, metaclip_2, mlcd

tests/models/mlcd/test_modeling_mlcd.py

vasqu · 2025-10-21T15:07:56Z

run-slow: clip, metaclip_2, mlcd, llava

github-actions · 2025-10-21T15:09:23Z

This comment contains run-slow, running the specified jobs:

models: ['models/clip', 'models/llava', 'models/metaclip_2', 'models/mlcd']
quantizations: [] ...

ArthurZucker

thanlks

ArthurZucker · 2025-10-24T17:55:14Z

src/transformers/models/clip/modeling_clip.py

        )

-        attn_output = attn_output.reshape(batch_size, seq_length, embed_dim).contiguous()
+        attn_output = attn_output.reshape(batch_size, seq_length, -1).contiguous()


usually we use -1 for the batchsize as text can be ragged but not an issue

src/transformers/models/clip/modeling_clip.py

…uggingface#41750) * fix * make kwargs fully passed and adjust with outputs xxx * propogate metaclip 2 * propogate mlcd and fix test * style * fix repo consistency, need to add ignore rules as those are building blocks * style * oops * fix mlcd

fix

11322db

vasqu requested review from ArthurZucker, molbap and zucchini-nlp October 20, 2025 14:54

vasqu commented Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Show resolved Hide resolved

vasqu mentioned this pull request Oct 20, 2025

[FIX]: CLIP support for flash-attention-3 #41673

Closed

5 tasks

zucchini-nlp reviewed Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

src/transformers/models/clip/modeling_clip.py Show resolved Hide resolved

src/transformers/models/clip/modeling_clip.py Show resolved Hide resolved

zucchini-nlp approved these changes Oct 20, 2025

View reviewed changes

molbap reviewed Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

make kwargs fully passed and adjust with outputs xxx

764e63f

vasqu commented Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

vasqu commented Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

vasqu commented Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

vasqu changed the title ~~[Clip] Fix masking and enable flash attention on all model types~~ 🚨 [Clip] Fix masking and enable flash attention on all model types Oct 20, 2025

molbap approved these changes Oct 20, 2025

View reviewed changes

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

src/transformers/models/clip/modeling_clip.py Outdated Show resolved Hide resolved

propogate metaclip 2

9132b3b

vasqu added the Vision label Oct 20, 2025

vasqu added 2 commits October 20, 2025 19:46

propogate mlcd and fix test

d025c56

style

47bb91e

vasqu added 3 commits October 20, 2025 20:14

fix repo consistency, need to add ignore rules as those are building …

94d916f

…blocks

style

4edb4e2

oops

bef3279

fix mlcd

d845e1a

Merge branch 'main' into fix-clip-fa

34c0362

Merge branch 'main' into fix-clip-fa

f346399

merge conflict ~

eccf8c7

vasqu commented Oct 21, 2025

View reviewed changes

tests/models/mlcd/test_modeling_mlcd.py Show resolved Hide resolved

ArthurZucker approved these changes Oct 24, 2025

View reviewed changes

vasqu merged commit 7a833d1 into huggingface:main Oct 24, 2025
19 checks passed

vasqu deleted the fix-clip-fa branch October 24, 2025 18:44

This was referenced Nov 11, 2025

CLIP issues with Flash Attention 3 pt.2 #42137

Closed

Sdpa for owlvit #42136

Open

kernels-community/flash-attn3 does not work with SmolVLM2 #42121

Closed

🚨 [Clip] Fix masking and enable flash attention on all model types #41750

🚨 [Clip] Fix masking and enable flash attention on all model types #41750

Uh oh!

Conversation

vasqu commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

github-actions bot commented Oct 20, 2025

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vasqu commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

github-actions bot commented Oct 20, 2025

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

github-actions bot commented Oct 20, 2025

Uh oh!

vasqu commented Oct 20, 2025

Uh oh!

vasqu commented Oct 21, 2025

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

vasqu commented Oct 21, 2025

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

Uh oh!

vasqu commented Oct 21, 2025

Uh oh!

github-actions bot commented Oct 21, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

🚨 [`Clip`] Fix masking and enable flash attention on all model types #41750

🚨 [`Clip`] Fix masking and enable flash attention on all model types #41750

vasqu commented Oct 20, 2025 •

edited

Loading

vasqu commented Oct 20, 2025 •

edited

Loading