Skip to content

fixes a few issues with out_dtype overload for addmm/baddbmm#167931

Closed
ngimel wants to merge 3 commits intomainfrom
ngimel/addmm_dtype
Closed

fixes a few issues with out_dtype overload for addmm/baddbmm#167931
ngimel wants to merge 3 commits intomainfrom
ngimel/addmm_dtype

Conversation

@ngimel
Copy link
Collaborator

@ngimel ngimel commented Nov 16, 2025

Per title

  1. allows self argument to have the same precision as output
  2. fixes broadcasting of self argument - it used to allocate incorrectly sized output and resize it later, causing a warning, in addmm, and error out in baddbmm
  3. fixes out handling for out baddbmm overload, where the implementation used uninitialized memory in out instead of copying self to out.
  4. removes couple unneeded iife patterns
    cc @PaulZhang12 @nikitaved

@pytorch-bot
Copy link

pytorch-bot bot commented Nov 16, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167931

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 146a4d9 with merge base e5eb89e (image):

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added ciflow/b200 ciflow/h100 ciflow/rocm Trigger "default" config CI on ROCm labels Nov 16, 2025
@ngimel ngimel added the release notes: cuda release notes category label Nov 16, 2025
@ngimel ngimel force-pushed the ngimel/addmm_dtype branch from 68b0050 to c64d82e Compare November 17, 2025 00:18
@ngimel ngimel force-pushed the ngimel/addmm_dtype branch from c64d82e to 1ffde37 Compare November 17, 2025 00:19
Copy link
Contributor

@PaulZhang12 PaulZhang12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the fixes!

@ngimel ngimel added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 17, 2025
@ngimel
Copy link
Collaborator Author

ngimel commented Nov 17, 2025

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@github-actions github-actions bot deleted the ngimel/addmm_dtype branch December 18, 2025 02:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/b200 ciflow/h100 ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request Merged release notes: cuda release notes category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants