Skip to content

[inductor] Support out_dtype arg to matmul#163393

Closed
jansel wants to merge 8 commits intogh/jansel/536/basefrom
gh/jansel/536/head
Closed

[inductor] Support out_dtype arg to matmul#163393
jansel wants to merge 8 commits intogh/jansel/536/basefrom
gh/jansel/536/head

Conversation

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Sep 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163393

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b799ad4 with merge base 51152ef (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
Copy link
Contributor

@eellison eellison left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc @coconutruben to review as well

Lowering for autotuning aten.mm with different backends (Aten, Triton, CUTLASS, etc.)
"""
if out_dtype is not None:
input_dtype = mat1.get_dtype()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: would expect kernel error checking to already occur in meta registration.

@@ -2238,7 +2238,7 @@ def meta__fused_moving_avg_obs_fq_helper(

@register_meta(aten.mm)
@out_wrapper(exact_dtype=True)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is exact_dtype=True still correct ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure is this referring to matching the reference?

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@pytorchmergebot
Copy link
Collaborator

Starting merge as part of PR stack under #163422

1 similar comment
@pytorchmergebot
Copy link
Collaborator

Starting merge as part of PR stack under #163422

pytorchmergebot pushed a commit that referenced this pull request Sep 23, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 23, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
pytorchmergebot pushed a commit that referenced this pull request Sep 24, 2025
This reverts commit a8cd437.

See #163481 (comment)

This PR might also cause issues with cudagraphs.

Pull Request resolved: #163737
Approved by: https://github.com/ezyang
ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
jainapurva pushed a commit that referenced this pull request Sep 29, 2025
This reverts commit a8cd437.

See #163481 (comment)

This PR might also cause issues with cudagraphs.

Pull Request resolved: #163737
Approved by: https://github.com/ezyang
ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482
@github-actions github-actions bot deleted the gh/jansel/536/head branch October 24, 2025 02:09
Khanaksahu pushed a commit to Khanaksahu/pytorch that referenced this pull request Nov 17, 2025
Fixes #163275


ghstack-source-id: 9fc01dd
Pull-Request: pytorch/pytorch#163393
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants