[fix] CLIPTextModel with transformers >= 5.6 and from_single_file by asomoza · Pull Request #13843 · huggingface/diffusers

asomoza · 2026-05-30T03:16:54Z

What does this PR do?

This PR fixes the issue when using recent transformers versions and from_single_file with CLIPTextModel . This prevents apps and libraries to pin transformers to more recent versions when they supports this.

Tested with an older version of transformers without the fix and with a newer one (latest as today) with the fix, using SD1.5 and SD2.1 and the images are identical.

SD 1.5

older without fix	newer with fix

SD 2.1

older without fix	newer with fix

fixes #13833

Who can review?

@kappacommit

asomoza · 2026-05-30T03:28:07Z

@askserge review this PR

github-actions

🤗 Serge says:

Clean, correct fix for CLIPTextModel compatibility with transformers ≥5.6 where the text_model wrapper was flattened.

Correctness

The hasattr(model, "text_model") check correctly distinguishes between the old (CLIPTextModel with wrapper) and new (flattened) layouts, while leaving CLIPTextModelWithProjection (which still has text_model) unaffected.
Both convert_ldm_clip_checkpoint and convert_open_clip_checkpoint produce keys prefixed with text_model. in the diffusers format. The removeprefix("text_model.") call correctly strips this for the flattened model, and is a no-op for keys that don't carry the prefix (e.g. text_projection.weight).
removeprefix (Python 3.9+) is already used elsewhere in the codebase.

No issues found. LGTM.

13 LLM turns · 14 tool calls · 66.3s · 148568 in / 2630 out tokens

HuggingFaceDocBuilderDev · 2026-05-30T03:29:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ghunkins · 2026-05-31T18:43:30Z

Thanks for the work @asomoza, would love to see this on main 🙏

fix

40a2ea7

github-actions Bot added fixes-issue single-file size/S PR with diff < 50 LOC and removed fixes-issue labels May 30, 2026

asomoza added 2 commits May 29, 2026 23:19

code quality

3c5843d

Merge branch 'main' into clip-text-encoder-transformers-fix

93c7e4f

github-actions Bot added the fixes-issue label May 30, 2026

github-actions Bot reviewed May 30, 2026

View reviewed changes

asomoza requested a review from DN6 May 30, 2026 03:32

kappacommit mentioned this pull request May 30, 2026

feat - Migrate to Transformers 5.5.4 invoke-ai/InvokeAI#9248

Open

5 tasks

DN6 approved these changes Jun 1, 2026

View reviewed changes

asomoza added 2 commits June 1, 2026 12:04

Merge branch 'main' into clip-text-encoder-transformers-fix

4df34ff

Merge branch 'main' into clip-text-encoder-transformers-fix

9036ab1

DN6 merged commit b95637a into main Jun 1, 2026
15 of 16 checks passed

asomoza deleted the clip-text-encoder-transformers-fix branch June 1, 2026 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] CLIPTextModel with transformers >= 5.6 and from_single_file#13843

[fix] CLIPTextModel with transformers >= 5.6 and from_single_file#13843
DN6 merged 5 commits into
mainfrom
clip-text-encoder-transformers-fix

asomoza commented May 30, 2026 •

edited

Loading

Uh oh!

asomoza commented May 30, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

HuggingFaceDocBuilderDev commented May 30, 2026

Uh oh!

ghunkins commented May 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

asomoza commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

SD 1.5

SD 2.1

Who can review?

Uh oh!

asomoza commented May 30, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 30, 2026

Uh oh!

ghunkins commented May 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

asomoza commented May 30, 2026 •

edited

Loading