Fix GPT-2 Flash Attention 2 generation with left-padding #41966

Abdennacer-Badaoui · 2025-10-31T14:58:40Z

What does this PR do?

Fixes Flash Attention 2 generation with left-padding in GPT-2 by ensuring is_causal=True is set even when padding masks are provided.

Flash Attention 2 handles causal masking and padding independently, but the original code incorrectly disabled causal masking when any attention mask was present.

Fixes: test_flash_attn_2_generate_padding_left

HuggingFaceDocBuilderDev · 2025-10-31T15:14:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

github-actions · 2025-10-31T15:16:53Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: decision_transformer, gpt2

Abdennacer-Badaoui · 2025-10-31T15:27:47Z

@vasqu , ready for review :)

Abdennacer-Badaoui added 2 commits October 31, 2025 15:11

Fix GPT-2 Flash Attention 2 generation with left-padding

42e719e

repo consistency

4f20a1f

Abdennacer-Badaoui force-pushed the fix/test_flash_attn_2_generate_padding_left branch from 2877ab9 to 4f20a1f Compare October 31, 2025 15:13

remi-or requested review from Cyrilvallez and vasqu and removed request for Cyrilvallez November 3, 2025 14:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix GPT-2 Flash Attention 2 generation with left-padding #41966

Fix GPT-2 Flash Attention 2 generation with left-padding #41966

Abdennacer-Badaoui commented Oct 31, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2025

Uh oh!

github-actions bot commented Oct 31, 2025

Uh oh!

Abdennacer-Badaoui commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix GPT-2 Flash Attention 2 generation with left-padding #41966

Are you sure you want to change the base?

Fix GPT-2 Flash Attention 2 generation with left-padding #41966

Conversation

Abdennacer-Badaoui commented Oct 31, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2025

Uh oh!

github-actions bot commented Oct 31, 2025

Uh oh!

Abdennacer-Badaoui commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants