[MPS] Add `linalg.householder_product` for MPS by kurtamohler · Pull Request #166090 · pytorch/pytorch

kurtamohler · 2025-10-22T20:50:57Z

Stack from ghstack (oldest at bottom):

-> [MPS] Add linalg.householder_product for MPS #166090

Fixes #166089

[ghstack-poisoned]

Fixes #166089 ghstack-source-id: d0b796e Pull-Request: #166090

pytorch-bot · 2025-10-22T20:51:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166090

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4da24cb with merge base 9038a30 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

Fixes #166089 ghstack-source-id: 1d7b4a8 Pull-Request: #166090

github-actions · 2025-10-22T20:59:16Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

kurtamohler · 2025-10-22T21:02:06Z

aten/src/ATen/native/mps/kernels/LinearAlgebra.metal

+      threadgroup_barrier(mem_flags::mem_threadgroup);
+
+      T H_prod_0_to_i_rc =
+          calc_matmul_rc(H_prod, H, H_stride_r, H_stride_c, m, r, c);


At the moment, performance is much worse than that of the CPU impl, except in some cases where the number of batches is greater than the number of A matrix elements times tau vector elements.

The vast majority of runtime is spent in this matrix multiplication. I'm using a naive implementation of matmul, so we should be able to get much better performance if I change it to a tiled matmul. I suppose it should be possible to just reuse the tiled matmul defined earlier in this file, so I will look into that

I attempted to improve performance (in this branch) by changing the kernel to just generate the householder matrices and use the existing do_metal_bmm for the matrix multiply. It improved performance slightly in some cases, and decreased in others, but overall it didn't make too much of a difference. Maybe the CPU impl isn't actually doing a series of full matrix multiplies and instead uses some simplified formula. I'll have to take a look at the lapack impl. But I guess this is probably somewhat low priority

aten/src/ATen/native/mps/kernels/LinearAlgebra.metal

Fixes pytorch#166089 ghstack-source-id: 1d7b4a8 Pull-Request: pytorch#166090

[ghstack-poisoned]

Fixes #166089 ghstack-source-id: 91d5b65 Pull-Request: #166090

kurtamohler · 2025-10-24T18:49:42Z

I will follow up with a performance improvement PR once I understand how to do it

kurtamohler · 2025-10-24T18:49:55Z

@pytorchbot merge

pytorchmergebot · 2025-10-24T18:51:50Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

9b2aa24

[ghstack-poisoned]

kurtamohler requested review from kulinseth and malfet as code owners October 22, 2025 20:50

kurtamohler added a commit that referenced this pull request Oct 22, 2025

[MPS] Add linalg.householder_product for MPS

6e79efb

Fixes #166089 ghstack-source-id: d0b796e Pull-Request: #166090

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Oct 22, 2025

Update

6d7efdf

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Oct 22, 2025

[MPS] Add linalg.householder_product for MPS

c0992cb

Fixes #166089 ghstack-source-id: 1d7b4a8 Pull-Request: #166090

pytorchbot added the open source label Oct 22, 2025

kurtamohler commented Oct 22, 2025

View reviewed changes

malfet added the topic: improvements topic category label Oct 22, 2025

malfet approved these changes Oct 22, 2025

View reviewed changes

aten/src/ATen/native/mps/kernels/LinearAlgebra.metal Outdated Show resolved Hide resolved

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request Oct 24, 2025

[MPS] Add linalg.householder_product for MPS

bed0d87

Fixes pytorch#166089 ghstack-source-id: 1d7b4a8 Pull-Request: pytorch#166090

Update

4da24cb

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Oct 24, 2025

[MPS] Add linalg.householder_product for MPS

419d321

Fixes #166089 ghstack-source-id: 91d5b65 Pull-Request: #166090

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 24, 2025

pytorchmergebot added the merging label Oct 24, 2025

pytorchmergebot added the Merged label Oct 24, 2025

pytorchmergebot closed this in c9b49e5 Oct 24, 2025

pytorchmergebot removed the merging label Oct 24, 2025

github-actions bot deleted the gh/kurtamohler/57/head branch November 24, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MPS] Add `linalg.householder_product` for MPS#166090

[MPS] Add `linalg.householder_product` for MPS#166090
kurtamohler wants to merge 3 commits intogh/kurtamohler/57/basefrom
gh/kurtamohler/57/head

kurtamohler commented Oct 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 22, 2025

Uh oh!

kurtamohler Oct 22, 2025 •

edited

Loading

Uh oh!

kurtamohler Oct 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

kurtamohler commented Oct 24, 2025

Uh oh!

kurtamohler commented Oct 24, 2025

Uh oh!

pytorchmergebot commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kurtamohler commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166090

✅ No Failures

Uh oh!

github-actions bot commented Oct 22, 2025

Attention! native_functions.yaml was changed

Uh oh!

kurtamohler Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kurtamohler Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kurtamohler commented Oct 24, 2025

Uh oh!

kurtamohler commented Oct 24, 2025

Uh oh!

pytorchmergebot commented Oct 24, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kurtamohler commented Oct 22, 2025 •

edited

Loading

pytorch-bot bot commented Oct 22, 2025 •

edited

Loading

kurtamohler Oct 22, 2025 •

edited

Loading

kurtamohler Oct 24, 2025 •

edited

Loading