[Inductor] support mixed dtype in the native_layer_norm_backward meta function by markc-614 · Pull Request #159830 · pytorch/pytorch

markc-614 · 2025-08-05T01:28:28Z

Fixes #159829

pytorch-bot · 2025-08-05T01:28:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159830

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit b257578 with merge base df4ebdd ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2025-08-05T01:28:34Z

The committers listed above are authorized under a signed CLA.

✅ login: markc-614 / name: Shaobin Ma (0ea1463, b257578, 51fa8ee)

…vateuse1

markc-614 · 2025-08-05T01:55:25Z

@pytorchbot label "topic: not user facing"

markc-614 · 2025-08-12T11:45:15Z

@masnesral please help to review this, thank you

markc-614 · 2025-08-15T09:59:18Z

@eellison @ysiraichi please help to review this, thank you

eellison · 2025-08-25T12:33:07Z

torch/_decomp/decompositions.py

        _maybe_cast(d_input, input.dtype),
-        _maybe_cast(d_weight, input.dtype),
-        _maybe_cast(d_bias, input.dtype),
+        _maybe_cast(d_weight, weight.dtype if weight is not None else None),


@albanD is this right ?

Yes it is correct.
It's because the weight and bias are optional. But when they're not there, both these gradients will be None anyways so no cast is needed.

@albanD thanks for your reply. @eellison please help to review this. thanks.

albanD

SGTM

albanD · 2025-09-15T14:31:36Z

torch/_decomp/decompositions.py

        _maybe_cast(d_input, input.dtype),
-        _maybe_cast(d_weight, input.dtype),
-        _maybe_cast(d_bias, input.dtype),
+        _maybe_cast(d_weight, weight.dtype if weight is not None else None),


Yes it is correct.
It's because the weight and bias are optional. But when they're not there, both these gradients will be None anyways so no cast is needed.

albanD · 2025-09-16T13:34:44Z

@markc-614 can you rebase on top of the latest main branch so we get CI signal please?

albanD · 2025-09-16T13:36:39Z

Also @markc-614 I think it would be great to test the privateuse1 integration in compile a bit more in core to make it more stable.
We have a few items in #158917 related to the compiler and I think it would be a great addition to the OpenReg in-core testing!
FYI @fffrog

markc-614 · 2025-09-17T01:47:36Z

@markc-614 can you rebase on top of the latest main branch so we get CI signal please?

@albanD done

markc-614 · 2025-09-17T06:59:23Z

Also @markc-614 I think it would be great to test the privateuse1 integration in compile a bit more in core to make it more stable. We have a few items in #158917 related to the compiler and I think it would be a great addition to the OpenReg in-core testing! FYI @fffrog

@albanD do we need to add test cases to torch_openreg?

fffrog · 2025-09-17T08:39:26Z

We have a few items in #158917 related to the compiler and I think it would be a great addition to the OpenReg in-core testing!
FYI @fffrog

Thank you for your reminder, I will add this to the OpenReg to-do list.

fffrog · 2025-09-17T08:45:25Z

@albanD do we need to add test cases to torch_openreg?

@markc-614 These changes aren't specific to PrivateUse1, they apply to the general logic for all accelerators in PyTorch, right? Also, torch_openreg currently doesn't support torch.compiler, so you can merge this PR into the main branch first. I'll keep track of this issue and add it to the torch_openreg to-do list.

Any other suggestions @albanD :D

markc-614 · 2025-09-17T09:05:11Z

@fffrog Yes, exactly.

fffrog · 2025-09-17T09:08:11Z

@fffrog Yes, exactly.

Thank you, it would be even better if you could modify the title of this PR and remove the "for privateuse1" suffix.

markc-614 · 2025-09-17T09:52:03Z

@fffrog Yes, exactly.

Thank you, it would be even better if you could modify the title of this PR and remove the "for privateuse1" suffix.

@fffrog Done. I have updated the PR title.

albanD

Sounds good thanks!

albanD · 2025-09-17T17:31:41Z

@pytorchbot merge

pytorchmergebot · 2025-09-17T17:33:54Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

… function (pytorch#159830) Fixes pytorch#159829 Pull Request resolved: pytorch#159830 Approved by: https://github.com/albanD

pytorchbot added the open source label Aug 5, 2025

markc-614 force-pushed the mashaobin/inductor/fix-ln-decompostion branch 2 times, most recently from b10aba6 to 8da3c4f Compare August 5, 2025 01:41

fix(inductor): support native_layer_norm_backward mixed dtype for pri…

51fa8ee

…vateuse1

markc-614 force-pushed the mashaobin/inductor/fix-ln-decompostion branch from 8da3c4f to 51fa8ee Compare August 5, 2025 01:53

pytorch-bot bot added the topic: not user facing topic category label Aug 5, 2025

markc-614 changed the title ~~fix(inductor): support native_layer_norm_backward mixed dtype for privateuse1~~ [INDUCTOR] support native_layer_norm_backward mixed dtype for privateuse1 Aug 5, 2025

markc-614 changed the title ~~[INDUCTOR] support native_layer_norm_backward mixed dtype for privateuse1~~ [Inductor] support native_layer_norm_backward mixed dtype for privateuse1 Aug 5, 2025

markc-614 mentioned this pull request Aug 6, 2025

native_layer_norm_backward supports mixed precision for PrivateUse1 #159829

Closed

add test case

0ea1463

jerryzh168 requested review from eellison and ysiraichi August 13, 2025 02:39

jerryzh168 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Aug 13, 2025

markc-614 marked this pull request as draft August 13, 2025 09:41

markc-614 marked this pull request as ready for review August 13, 2025 09:44

eellison requested a review from albanD August 25, 2025 12:32

eellison reviewed Aug 25, 2025

View reviewed changes

albanD approved these changes Sep 15, 2025

View reviewed changes

markc-614 force-pushed the mashaobin/inductor/fix-ln-decompostion branch from 6fe8bc7 to 72d0714 Compare September 17, 2025 01:30

markc-614 requested review from XuehaiPan, eqy, janeyx99 and mikaylagawarecki as code owners September 17, 2025 01:30

markc-614 requested review from digantdesai, fmassa, jbschlosser, jianyuh and soulitzer as code owners September 17, 2025 01:30

markc-614 force-pushed the mashaobin/inductor/fix-ln-decompostion branch from 1d01742 to 0ea1463 Compare September 17, 2025 01:41

Merge branch 'pytorch:main' into mashaobin/inductor/fix-ln-decompostion

b257578

markc-614 changed the title ~~[Inductor] support native_layer_norm_backward mixed dtype for privateuse1~~ [Inductor] support native_layer_norm_backward mixed dtype Sep 17, 2025

pytorch-bot bot added the ciflow/inductor label Sep 17, 2025

markc-614 changed the title ~~[Inductor] support native_layer_norm_backward mixed dtype~~ [Inductor] support mixed dtype in the native_layer_norm_backward meta function Sep 17, 2025

albanD reviewed Sep 17, 2025

View reviewed changes

albanD approved these changes Sep 17, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 17, 2025

pytorchmergebot added the merging label Sep 17, 2025

pytorchmergebot added the Merged label Sep 17, 2025

pytorchmergebot closed this in 63276ed Sep 17, 2025

pytorchmergebot removed the merging label Sep 17, 2025

Conversation

markc-614 commented Aug 5, 2025

Uh oh!

pytorch-bot bot commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159830

⏳ No Failures, 1 Pending

Uh oh!

linux-foundation-easycla bot commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markc-614 commented Aug 5, 2025

Uh oh!

markc-614 commented Aug 12, 2025

Uh oh!

markc-614 commented Aug 15, 2025

Uh oh!

eellison Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

albanD Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

markc-614 Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

albanD Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

albanD commented Sep 16, 2025

Uh oh!

albanD commented Sep 16, 2025

Uh oh!

markc-614 commented Sep 17, 2025

Uh oh!

markc-614 commented Sep 17, 2025

Uh oh!

fffrog commented Sep 17, 2025

Uh oh!

fffrog commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markc-614 commented Sep 17, 2025

Uh oh!

fffrog commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markc-614 commented Sep 17, 2025

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

albanD commented Sep 17, 2025

Uh oh!

pytorchmergebot commented Sep 17, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

pytorch-bot bot commented Aug 5, 2025 •

edited

Loading

linux-foundation-easycla bot commented Aug 5, 2025 •

edited

Loading

fffrog commented Sep 17, 2025 •

edited

Loading

fffrog commented Sep 17, 2025 •

edited

Loading