[MPS] Fix conv layout handling by malfet · Pull Request #162776 · pytorch/pytorch

malfet · 2025-09-12T00:08:11Z

Stack from ghstack (oldest at bottom):

-> [MPS] Fix conv layout handling #162776

What started as simple fix for mps_convolution_backward_input resulted in a pretty significant refactor/fixes:

Updated mps_conv_use_channels_last to return channels last output if either input or weights are channels last
Use the same primitive throughout Convolution.mm to determine wether output should be allocated in channels last format or not

But doing only those two, resulted in crash in test_memory_format_nn_Conv2d_mps_float32, when weights were backward, and bias is present:

% python -c "import torch;print(torch.nn.functional.conv2d(torch.rand(2, 4, 3, 4,device='mps'), torch.rand(5, 4, 3, 3,device='mps').to(memory_format=torch.channels_last), torch.rand(5,device='mps')))"
/AppleInternal/Library/BuildRoots/4~B5E4ugDCh2RsPWAjMEoPu8LC5w1yXEwd7XweDhg/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShadersGraph/mpsgraph/MetalPerformanceShadersGraph/Core/Files/MPSGraphExecutable.mm:3619: failed assertion `Error: MLIR pass manager failed'
zsh: abort      python -c

Which requires a more thorough redesign/cleanup, namely:

Do not alter the layout based on MacOS version, but rather do additional copies on MacOS-14 if inputs/output or weight are in channels last format ( done by defining std::optional<Tensor> output_c; that contains a contiguous copy of the output tensor
Introduced input_suggested_layout which is set to ChannelsLast if and only if input is channels last and is running on MacOS-15+
Delete unused memory_layout and group arguments from fill_depthwise_conv_desc
Fix bias broadcasting logic for channels last

As result, in addition to adding one more regression test this change removes expectedFailures from:

TestModule.test_memory_format for Conv2d, ConvTranspose2d, LazyConv1d, LazyConvTranspose1d
test_require_stride_expanded_dynamic_shapes
test_mutable_custom_op_fixed_layout2 for MacOS-14

Fixes #161905

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

[ghstack-poisoned]

pytorch-bot · 2025-09-12T00:08:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162776

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[Maintenance] MacOS runners update

⏳ No Failures, 38 Pending

As of commit 337d8ee with merge base eb3fbf5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: b191a9f Pull Request resolved: #162776

aten/src/ATen/native/mps/operations/Convolution.mm

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 9b3908b Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: df1e15f Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: c56ff5d Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 45dcd30 Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 828fcfd Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 063eca2 Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 60efb8d Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 539a885 Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: e151e4a Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: f6e498d Pull Request resolved: #162776

[ghstack-poisoned]

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: d170c07 Pull Request resolved: #162776

malfet · 2025-09-25T23:39:51Z

@pytorchbot merge -f "Lint + MPS tests are green"

pytorchmergebot · 2025-09-25T23:41:19Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

What started as simple fix for `mps_convolution_backward_input` resulted in a pretty significant refactor/fixes: - Updated `mps_conv_use_channels_last` to return channels last output if either input or weights are channels last - Use the same primitive throughout `Convolution.mm` to determine wether output should be allocated in channels last format or not But doing only those two, resulted in crash in `test_memory_format_nn_Conv2d_mps_float32`, when weights were backward, and bias is present: ``` % python -c "import torch;print(torch.nn.functional.conv2d(torch.rand(2, 4, 3, 4,device='mps'), torch.rand(5, 4, 3, 3,device='mps').to(memory_format=torch.channels_last), torch.rand(5,device='mps')))" /AppleInternal/Library/BuildRoots/4~B5E4ugDCh2RsPWAjMEoPu8LC5w1yXEwd7XweDhg/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShadersGraph/mpsgraph/MetalPerformanceShadersGraph/Core/Files/MPSGraphExecutable.mm:3619: failed assertion `Error: MLIR pass manager failed' zsh: abort python -c ``` Which requires a more thorough redesign/cleanup, namely: - Do not alter the layout based on MacOS version, but rather do additional copies on MacOS-14 if inputs/output or weight are in channels last format ( done by defining `std::optional<Tensor> output_c;` that contains a contiguous copy of the output tensor - Introduced `input_suggested_layout` which is set to ChannelsLast if and only if input is channels last and is running on MacOS-15+ - Delete unused `memory_layout` and `group` arguments from `fill_depthwise_conv_desc` - Fix bias broadcasting logic for channels last As result, in addition to adding one more regression test this change removes `expectedFailures` from: - `TestModule.test_memory_format` for `Conv2d`, `ConvTranspose2d`, `LazyConv1d`, `LazyConvTranspose1d` - `test_require_stride_expanded_dynamic_shapes` - `test_mutable_custom_op_fixed_layout2` for MacOS-14 Fixes #161905 Pull Request resolved: #162776 Approved by: https://github.com/Skylion007

Update

39cc6e0

[ghstack-poisoned]

malfet requested a review from kulinseth as a code owner September 12, 2025 00:08

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Sep 12, 2025

malfet added a commit that referenced this pull request Sep 12, 2025

[MPS] Fix conv backwards for channel last

dd25107

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: b191a9f Pull Request resolved: #162776

Skylion007 reviewed Sep 13, 2025

View reviewed changes

aten/src/ATen/native/mps/operations/Convolution.mm Outdated Show resolved Hide resolved

Skylion007 approved these changes Sep 14, 2025

View reviewed changes

malfet added 2 commits September 14, 2025 13:46

Update

c294e4f

[ghstack-poisoned]

Update

db7c244

[ghstack-poisoned]

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Sep 16, 2025

Update

715bcbe

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Sep 16, 2025

Update

13792b7

[ghstack-poisoned]

malfet changed the title ~~[MPS] Fix conv backwards for channel last~~ [MPS] Fix conv backwards layout handling Sep 16, 2025

malfet and others added 3 commits September 22, 2025 06:17

Update

6ca8a02

[ghstack-poisoned]

Update

776c8ac

[ghstack-poisoned]

Update

f63e391

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 22, 2025

[MPS] Fix conv backwards for channel last

4812d63

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 9b3908b Pull Request resolved: #162776

Update

07882c4

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 22, 2025

[MPS] Fix conv backwards for channel last

a6d5087

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: df1e15f Pull Request resolved: #162776

Update

193f8a9

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 22, 2025

[MPS] Fix conv backwards for channel last

3711df8

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: c56ff5d Pull Request resolved: #162776

Update

59a9b06

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

dd3e6bf

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 45dcd30 Pull Request resolved: #162776

Update

f39dddd

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

9d1c404

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 828fcfd Pull Request resolved: #162776

Update

b638e69

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

cdd08fb

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 063eca2 Pull Request resolved: #162776

Update

08f5c1a

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

82d09f4

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 60efb8d Pull Request resolved: #162776

malfet changed the title ~~[MPS] Fix conv backwards layout handling~~ [MPS] Fix conv layout handling Sep 24, 2025

malfet added the topic: bug fixes topic category label Sep 24, 2025

Update

3bba308

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

3db03d8

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: 539a885 Pull Request resolved: #162776

Update

70af4a4

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 24, 2025

[MPS] Fix conv backwards for channel last

6d4b8b4

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: e151e4a Pull Request resolved: #162776

Update

5dc691b

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 25, 2025

[MPS] Fix conv backwards for channel last

4ddb755

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: f6e498d Pull Request resolved: #162776

Update

337d8ee

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 25, 2025

[MPS] Fix conv backwards for channel last

ae4f991

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: d170c07 Pull Request resolved: #162776

malfet added a commit that referenced this pull request Sep 25, 2025

[MPS] Fix conv backwards for channel last

d0ce599

Simply allocate output in channels last form if either input or gradient is channels last Fixes #161905 ghstack-source-id: d170c07 Pull Request resolved: #162776

pytorchmergebot added the merging label Sep 25, 2025

pytorchmergebot added the Merged label Sep 25, 2025

pytorchmergebot closed this in ff2f319 Sep 25, 2025

pytorchmergebot removed the merging label Sep 25, 2025

github-actions bot deleted the gh/malfet/516/head branch October 26, 2025 02:18

malfet mentioned this pull request Dec 2, 2025

RFC: mps: fix batch norm for non-contiguous channels-last inputs #169372

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MPS] Fix conv layout handling#162776

[MPS] Fix conv layout handling#162776
malfet wants to merge 18 commits intogh/malfet/516/basefrom
gh/malfet/516/head

malfet commented Sep 12, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

malfet commented Sep 25, 2025

Uh oh!

pytorchmergebot commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

malfet commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162776

❗ 1 Active SEVs

⏳ No Failures, 38 Pending

Uh oh!

Uh oh!

malfet commented Sep 25, 2025

Uh oh!

pytorchmergebot commented Sep 25, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

malfet commented Sep 12, 2025 •

edited

Loading

pytorch-bot bot commented Sep 12, 2025 •

edited

Loading