Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline by akshay-babbar · Pull Request #12263 · huggingface/diffusers

akshay-babbar · 2025-08-30T22:52:46Z

Problem

Fixes #12116

Short prompts generate corrupted images due to attention mask dtype conversion bug.

Root Cause

Attention masks converted from bool → float16/bfloat16, but PyTorch's scaled_dot_product_attention requires boolean masks.

Solution

Preserve boolean dtype in _get_t5_prompt_embeds
Remove dtype conversions in _prepare_attention_mask
Fix both positive and negative attention masks

Testing

✅ Added @slow unit tests for dtype preservation
✅ Verified fix with prompts: "man", "cat"
✅ All tests pass locally

Please review when you have a chance. Thank you for your time and consideration!

- Convert attention masks to bool and prevent dtype corruption - Fix both positive and negative mask handling in _get_t5_prompt_embeds - Remove float conversion in _prepare_attention_mask method Fixes huggingface#12116

akshay-babbar · 2025-09-01T14:47:40Z

hello @DN6 @yiyixuxu , can you please review this PR and share feedback!
Thanks!

akshay-babbar · 2025-09-08T05:27:05Z

hello @DN6, just checking in to see if you’ve had a chance to look at the above PR. If you’re not the right person or are keeping busy, would you mind pointing me to someone who could review it?

Thanks!

yiyixuxu · 2025-09-10T19:52:07Z

thanks @akshay-babbar
can you show outputs before/after?

HuggingFaceDocBuilderDev · 2025-09-10T20:01:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

akshay-babbar · 2025-09-13T12:35:32Z

hello @yiyixuxu @DN6

Thanks for the response! I'm new to diffusers, so still learning best practices through the docs and codebase. Do let me know if there are any issues with my changes.

I used these these 3 prompts - [man,king, doctor]

negative prompt used - blurry, low quality, naked, NSFW, nude, deformed

Below are the results!

Please review and let me know your feedback and any next steps.

Thanks!

Before

Man

Doctor

King

After Code Changes

Man

Doctor

King

akshay-babbar · 2025-09-19T13:41:35Z

Hi @DN6, I completely understand you must be busy, but I’d really appreciate any thoughts/feedback you might have on the PR and the results I’ve shown above whenever you get the chance.
Thanks!
CC: @yiyixuxu

DN6

Thanks @akshay-babbar. Minor changes requested.

DN6 · 2025-09-22T07:16:52Z

src/diffusers/pipelines/chroma/pipeline_chroma.py

        )
        text_input_ids = text_inputs.input_ids
        attention_mask = text_inputs.attention_mask.clone()
+        attention_mask = attention_mask.bool()


Don't think we need the type conversion here. We can just convert the final mask to bool.

DN6 · 2025-09-22T08:26:15Z

tests/pipelines/chroma/test_pipeline_chroma.py

            assert (output_height, output_width) == (expected_height, expected_width)
+
+
+class ChromaPipelineAttentionMaskTests(unittest.TestCase):


Dedicated tests aren't needed here. The existing tests should catch changes in numerical output if they are significant.

akshay-babbar · 2025-09-26T02:37:38Z

Hello @DN6 , thanks for the review! I have made the changes, let me know your feedback and next steps.

Thanks!

dxqb · 2025-09-29T12:04:12Z

Sorry for not seeing this earlier, I was just notified by the commit.

Are you sure that passing the attention mask as boolean...
https://github.com/akshay-babbar/diffusers/blob/58557d44893802acb2f68b1334bfee9c0e726bea/src/diffusers/pipelines/chroma/pipeline_chroma.py#L249
...to the text encoder is okay?

It seems to work I guess, but it's documented to require a FloatTensor:
https://github.com/huggingface/transformers/blob/cd74917ffc3e8f84e4a886052c5ab32b7ac623cc/src/transformers/models/t5/modeling_t5.py#L1899

Textencoder: Float between 0 and 1
Attention processor: either bool between 0 and 1, or Float between -inf and +inf

akshay-babbar added 4 commits August 30, 2025 21:51

fix: preserve boolean dtype for attention masks in ChromaPipeline

7bb6e4a

- Convert attention masks to bool and prevent dtype corruption - Fix both positive and negative mask handling in _get_t5_prompt_embeds - Remove float conversion in _prepare_attention_mask method Fixes huggingface#12116

test: add ChromaPipeline attention mask dtype tests

1cf90c4

test: add slow ChromaPipeline attention mask tests

3f9228b

chore: removed comments

60d7385

akshay-babbar changed the title ~~Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline~~ Fix #12116 : preserve boolean dtype for attention masks in ChromaPipeline Aug 30, 2025

akshay-babbar changed the title ~~Fix #12116 : preserve boolean dtype for attention masks in ChromaPipeline~~ Fix #12116 preserve boolean dtype for attention masks in ChromaPipeline Aug 30, 2025

akshay-babbar changed the title ~~Fix #12116 preserve boolean dtype for attention masks in ChromaPipeline~~ Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline Aug 30, 2025

yiyixuxu requested a review from DN6 September 10, 2025 19:51

DN6 reviewed Sep 22, 2025

View reviewed changes

akshay-babbar added 2 commits September 25, 2025 16:54

refactor: removing redundant type conversion

26c33ef

Remove dedicated dtype tests as per feedback

7c16aa8

akshay-babbar requested a review from DN6 September 26, 2025 02:38

DN6 added 2 commits September 29, 2025 09:34

Merge branch 'main' into fix/chroma-attention-mask

ff30295

Merge branch 'main' into fix/chroma-attention-mask

58557d4

DN6 approved these changes Sep 29, 2025

View reviewed changes

DN6 merged commit 0a15111 into huggingface:main Sep 29, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline #12263

Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline #12263
DN6 merged 8 commits intohuggingface:mainfrom
akshay-babbar:fix/chroma-attention-mask

akshay-babbar commented Aug 30, 2025 •

edited

Loading

Uh oh!

akshay-babbar commented Sep 1, 2025 •

edited

Loading

Uh oh!

akshay-babbar commented Sep 8, 2025

Uh oh!

yiyixuxu commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

akshay-babbar commented Sep 13, 2025 •

edited

Loading

Uh oh!

akshay-babbar commented Sep 19, 2025

Uh oh!

DN6 left a comment

Uh oh!

DN6 Sep 22, 2025

Uh oh!

DN6 Sep 22, 2025

Uh oh!

akshay-babbar commented Sep 26, 2025

Uh oh!

Uh oh!

dxqb commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		assert (output_height, output_width) == (expected_height, expected_width)


		class ChromaPipelineAttentionMaskTests(unittest.TestCase):

Conversation

akshay-babbar commented Aug 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Solution

Testing

Uh oh!

akshay-babbar commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akshay-babbar commented Sep 8, 2025

Uh oh!

yiyixuxu commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

akshay-babbar commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before

Man

Doctor

King

After Code Changes

Man

Doctor

King

Uh oh!

akshay-babbar commented Sep 19, 2025

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

DN6 Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

DN6 Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

akshay-babbar commented Sep 26, 2025

Uh oh!

Uh oh!

dxqb commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

akshay-babbar commented Aug 30, 2025 •

edited

Loading

akshay-babbar commented Sep 1, 2025 •

edited

Loading

akshay-babbar commented Sep 13, 2025 •

edited

Loading