[BE][Ez]: Prevent copies of std::vector in CUDA ForeachOps by Skylion007 · Pull Request #163416 · pytorch/pytorch

Skylion007 · 2025-09-20T18:59:13Z

No need for unnecessary copy of std::vectors. This Tensor list is copied throughout the foreach paths and this code is on a hot path for torch optimizers. Auto move elision will not happen on the return statement since it's a subelement of a vector that needs to be copied out before the std::vector is dtor'd. This should reduce quite a few list copies along this path.

…rList

pytorch-bot · 2025-09-20T18:59:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163416

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 502e3cf with merge base d70c0ba ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ezyang · 2025-09-21T02:39:09Z

@pytorchbot merge

pytorchmergebot · 2025-09-21T02:41:01Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…63416) No need for unnecessary copy of std::vectors. This Tensor list is copied throughout the foreach paths and this code is on a hot path for torch optimizers. Auto move elision will not happen on the return statement since it's a subelement of a vector that needs to be copied out before the std::vector is dtor'd. This should reduce quite a few list copies along this path. Pull Request resolved: pytorch#163416 Approved by: https://github.com/ezyang

@ezyang

@ezyang A follow up where I found a few more missing returns of this style in the codebase. Follow up to #163416 Pull Request resolved: #163456 Approved by: https://github.com/cyyever, https://github.com/albanD

…63416) No need for unnecessary copy of std::vectors. This Tensor list is copied throughout the foreach paths and this code is on a hot path for torch optimizers. Auto move elision will not happen on the return statement since it's a subelement of a vector that needs to be copied out before the std::vector is dtor'd. This should reduce quite a few list copies along this path. Pull Request resolved: pytorch#163416 Approved by: https://github.com/ezyang

@ezyang

@ezyang A follow up where I found a few more missing returns of this style in the codebase. Follow up to pytorch#163416 Pull Request resolved: pytorch#163456 Approved by: https://github.com/cyyever, https://github.com/albanD

inspired by #163416 Pull Request resolved: #163599 Approved by: https://github.com/Skylion007

[BE]: Prevent unnecessary copy of std::vector in ForeachBinaryOpScala…

9a09788

…rList

Skylion007 requested review from albanD, ezyang, malfet and ngimel September 20, 2025 18:59

Skylion007 requested review from eqy and syed-ahmed as code owners September 20, 2025 18:59

pytorch-bot bot added the release notes: foreach_frontend release notes category label Sep 20, 2025

pytorchbot added the open source label Sep 20, 2025

Update other Foreach kernels

502e3cf

Skylion007 changed the title ~~[BE]: Prevent copy of std::vector in ForeachBinaryOpScalarList~~ [BE]: Prevent copies of std::vector in CUDA ForeachOps Sep 20, 2025

Skylion007 changed the title ~~[BE]: Prevent copies of std::vector in CUDA ForeachOps~~ [BE][Ez]: Prevent copies of std::vector in CUDA ForeachOps Sep 20, 2025

Skylion007 added the better-engineering Relatively self-contained tasks for better engineering contributors label Sep 20, 2025

Skylion007 requested review from atalman and jansel September 20, 2025 20:09

ezyang approved these changes Sep 21, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 21, 2025

pytorchmergebot added the merging label Sep 21, 2025

pytorchmergebot added the Merged label Sep 21, 2025

pytorchmergebot closed this in 1ca9445 Sep 21, 2025

pytorchmergebot removed the merging label Sep 21, 2025

Skylion007 mentioned this pull request Sep 21, 2025

[BE]: Add a few more missing move from return indices #163456

Closed

thenumberouscode mentioned this pull request Sep 23, 2025

[BE] Using std::move to reduce copy constructor calls by one. #163599

Closed

pytorchmergebot pushed a commit that referenced this pull request Nov 2, 2025

[BE] Using std::move to reduce copy constructor calls by one. (#163599)

7c203b8

inspired by #163416 Pull Request resolved: #163599 Approved by: https://github.com/Skylion007

pytorch-bot bot pushed a commit that referenced this pull request Nov 4, 2025

[BE] Using std::move to reduce copy constructor calls by one. (#163599)

0ee163c

inspired by #163416 Pull Request resolved: #163599 Approved by: https://github.com/Skylion007

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BE][Ez]: Prevent copies of std::vector in CUDA ForeachOps#163416

[BE][Ez]: Prevent copies of std::vector in CUDA ForeachOps#163416
Skylion007 wants to merge 2 commits intopytorch:mainfrom
Skylion007:skylion007/for-each-binary-op-move-2025-09-20

Skylion007 commented Sep 20, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 20, 2025 •

edited

Loading

Uh oh!

ezyang commented Sep 21, 2025

Uh oh!

pytorchmergebot commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Skylion007 commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163416

✅ No Failures

Uh oh!

ezyang commented Sep 21, 2025

Uh oh!

pytorchmergebot commented Sep 21, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Skylion007 commented Sep 20, 2025 •

edited

Loading

pytorch-bot bot commented Sep 20, 2025 •

edited

Loading