[optim] Widen the cases for defaulting to foreach #95820

janeyx99 · 2023-03-01T21:16:59Z

Big OOP correction continued. Also added a test this time to verify the defaulting was as expected.

The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that all tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach can handle cpu tensors, this should not introduce breakage.

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2023-03-01T21:17:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95820

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d9f191b:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. [ghstack-poisoned]

ghstack-source-id: 60a53ba Pull Request resolved: #95820

albanD

Sounds good!
Did the doc PR landed? Does the phrasing on when we use foreach by default need to be changed?

janeyx99 · 2023-03-02T03:27:51Z

@pytorchbot merge

pytorchmergebot · 2023-03-02T03:29:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

janeyx99 · 2023-03-02T04:30:26Z

Sounds good! Did the doc PR landed? Does the phrasing on when we use foreach by default need to be changed?

Doc PR landed last week--phrasing could be minorly updated from "tensors" to "parameters" but not really significant.

Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. Pull Request resolved: pytorch#95820 Approved by: https://github.com/albanD

* [optim] include nn.Parameter as foreach supported (#95811) This PR is a result of a realization that models are NOT subscribed to the foreach defaulting as have been claimed on our documentation for months now. BIG OOPS. Pull Request resolved: #95811 Approved by: https://github.com/albanD * [optim] Widen the cases for defaulting to foreach (#95820) Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. Pull Request resolved: #95820 Approved by: https://github.com/albanD

Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. Pull Request resolved: pytorch/pytorch#95820 Approved by: https://github.com/albanD

* [optim] include nn.Parameter as foreach supported (pytorch#95811) This PR is a result of a realization that models are NOT subscribed to the foreach defaulting as have been claimed on our documentation for months now. BIG OOPS. Pull Request resolved: pytorch#95811 Approved by: https://github.com/albanD * [optim] Widen the cases for defaulting to foreach (pytorch#95820) Big OOP correction continued. Also added a test this time to verify the defaulting was as expected. The key here is realizing that the grouping for foreach already assumes that the non-param tensorlists follow suit in dtype and device, so it is too narrow to check that _all_ tensors were on CUDA. The main leeway this allowed was state_steps, which are sometimes cpu tensors. Since foreach _can_ handle cpu tensors, this should not introduce breakage. Pull Request resolved: pytorch#95820 Approved by: https://github.com/albanD

[optim] Widen the cases for defaulting to foreach

9963097

[ghstack-poisoned]

janeyx99 requested a review from albanD as a code owner March 1, 2023 21:17

janeyx99 mentioned this pull request Mar 1, 2023

[optim] include nn.Parameter as foreach supported #95811

Closed

pytorch-bot bot added the release notes: nn release notes category label Mar 1, 2023

janeyx99 mentioned this pull request Mar 1, 2023

[optim] Widen the cases for defaulting to foreach #95818

Closed

janeyx99 added ciflow/trunk Trigger trunk jobs on your pull request ciflow/inductor labels Mar 1, 2023

janeyx99 requested review from H-Huang, awgu, fegin, kwen2501, mrshenli, rohan-varma, wanchaol and zhaojuanmao as code owners March 1, 2023 23:20

janeyx99 added a commit that referenced this pull request Mar 1, 2023

[optim] Widen the cases for defaulting to foreach

36ef4b5

ghstack-source-id: 60a53ba Pull Request resolved: #95820

albanD approved these changes Mar 2, 2023

View reviewed changes

pytorchmergebot added the Merged label Mar 2, 2023

pytorchmergebot closed this in 75cb99e Mar 2, 2023

This was referenced Mar 2, 2023

[optim] _actually_ default to foreach #95862

Merged

[v.2.0.0] Release Tracker #94937

Closed

janeyx99 mentioned this pull request Mar 10, 2023

V2 Performance Signal Detected by TorchBench CI on '2.1.0.dev20230303+cu117' pytorch/benchmark#1450

Closed

facebook-github-bot deleted the gh/janeyx99/34/head branch June 8, 2023 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[optim] Widen the cases for defaulting to foreach #95820

[optim] Widen the cases for defaulting to foreach #95820

Uh oh!

janeyx99 commented Mar 1, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 1, 2023 •

edited

Loading

Uh oh!

albanD left a comment

Uh oh!

janeyx99 commented Mar 2, 2023

Uh oh!

pytorchmergebot commented Mar 2, 2023

Uh oh!

janeyx99 commented Mar 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[optim] Widen the cases for defaulting to foreach #95820

[optim] Widen the cases for defaulting to foreach #95820

Uh oh!

Conversation

janeyx99 commented Mar 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95820

✅ No Failures

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

janeyx99 commented Mar 2, 2023

Uh oh!

pytorchmergebot commented Mar 2, 2023

Merge started

Uh oh!

janeyx99 commented Mar 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

janeyx99 commented Mar 1, 2023 •

edited

Loading

pytorch-bot bot commented Mar 1, 2023 •

edited

Loading