[FSDP()][13/N] Refactor unshard/reshard/grads #87926

awgu · 2022-10-27T22:20:58Z

Stack from ghstack:

[FSDP] Rename unflat_param_name -> fqn for consistency #88123 [FSDP] Rename unflat_param_name -> fqn for consistency
[FSDP] Simplify _get_buffer_names() #88122 [FSDP] Simplify _get_buffer_names()
[FSDP] Remove unneeded torch.no_grad() context when offloading to CPU #88121 [FSDP] Remove unneeded torch.no_grad() context when offloading to CPU
[FSDP][Docs] Add note mentioning rate limiter for backward prefetch #88120 [FSDP][Docs] Add note mentioning rate limiter for backward prefetch
[FSDP()][27/N] Add forward hook registration #88040 [FSDP()][27/N] Add forward hook registration
[FSDP()][26/N] Move _lazy_init() into _fsdp_root_pre_forward() #87941 [FSDP()][26/N] Move _lazy_init() into _fsdp_root_pre_forward()
[FSDP()][25/N] Add _post_forward_reshard() #87940 [FSDP()][25/N] Add _post_forward_reshard()
[FSDP()][24/N] Refactor _lazy_init() #87939 [FSDP()][24/N] Refactor _lazy_init()
[FSDP()][23/N] Refactor handle attr initialization #87938 [FSDP()][23/N] Refactor handle attr initialization
[FSDP] Simplify _reset_lazy_init() #87937 [FSDP] Simplify _reset_lazy_init()
[FSDP()][22/N] Refactor _cast_buffers() in _lazy_init() #87936 [FSDP()][22/N] Refactor _cast_buffers() in _lazy_init()
[FSDP()][21/N] Refactor and fix _cast_buffers() #87935 [FSDP()][21/N] Refactor _buffer_name_to_orig_dtype computation
[FSDP] Rename dtype to buffer_name_to_dtype #87934 [FSDP] Rename dtype to buffer_name_to_dtype
[FSDP] Remove device arg from _cast_buffers() #87933 [FSDP] Remove device arg from _cast_buffers()
[FSDP()][20/N][Easy] Move functions in file #87932 [FSDP()][20/N][Easy] Move functions in file
[FSDP()][18/N] Refactor pre_forward_unshard() #87931 [FSDP()][18/N] Refactor pre_forward_unshard()
[FSDP()][17/N] Refactor _fsdp_root_pre_forward() #87930 [FSDP()][17/N] Refactor _fsdp_root_pre_forward()
[FSDP()][16/N] Refactor post-forward/pre-backward #87929 [FSDP()][16/N] Refactor post-forward/pre-backward
[FSDP()][15/N] Refactor _init_streams() #87928 [FSDP()][15/N] Refactor _init_streams()
[FSDP()][14/N] Refactor pre-forward/post-backward #87927 [FSDP()][14/N] Refactor pre-forward/post-backward
[FSDP()][13/N] Refactor unshard/reshard/grads #87926 [FSDP()][13/N] Refactor unshard/reshard/grads
[FSDP()][12/N] Easy cleanup #87925 [FSDP()][12/N] Easy cleanup
[FSDP()][10/N][11/N] Introduce composable (ctor only) #87924 [FSDP()][10/N][11/N] Introduce composable (ctor only)
[FSDP()][9/N] Refactor ctor (continued) #87923 [FSDP()][9/N] Refactor ctor (continued)

This PR is not too complicated. We just move unshard/reshard/grads out to _runtime_utils.py and make them take state: _State instead of self.

[ghstack-poisoned]

pytorch-bot · 2022-10-27T22:21:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87926

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d827723:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

ghstack-source-id: 019c863 Pull Request resolved: pytorch#87926

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

ghstack-source-id: 7b33348 Pull Request resolved: pytorch#87926

awgu · 2022-11-01T13:35:51Z

@pytorchbot merge

pytorchmergebot · 2022-11-01T13:37:24Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. Pull Request resolved: pytorch#87926 Approved by: https://github.com/mrshenli

[FSDP()][13/N] Refactor unshard/reshard/grads

4e09c5f

[ghstack-poisoned]

awgu requested review from H-Huang, kwen2501, mrshenli, pritamdamania87, rohan-varma and zhaojuanmao as code owners October 27, 2022 22:20

pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Oct 27, 2022

awgu added the topic: not user facing topic category label Oct 31, 2022

Update on "[FSDP()][13/N] Refactor unshard/reshard/grads"

998e219

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

Update on "[FSDP()][13/N] Refactor unshard/reshard/grads"

366afd9

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

awgu pushed a commit to awgu/pytorch that referenced this pull request Oct 31, 2022

[FSDP()][13/N] Refactor unshard/reshard/grads

28103d7

ghstack-source-id: 019c863 Pull Request resolved: pytorch#87926

Update on "[FSDP()][13/N] Refactor unshard/reshard/grads"

541a347

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

Update on "[FSDP()][13/N] Refactor unshard/reshard/grads"

d827723

This PR is not too complicated. We just move unshard/reshard/grads out to `_runtime_utils.py` and make them take `state: _State` instead of `self`. [ghstack-poisoned]

awgu pushed a commit to awgu/pytorch that referenced this pull request Nov 1, 2022

[FSDP()][13/N] Refactor unshard/reshard/grads

afb677f

ghstack-source-id: 7b33348 Pull Request resolved: pytorch#87926

pytorchmergebot added the Merged label Nov 1, 2022

pytorchmergebot closed this in b1750d0 Nov 1, 2022

facebook-github-bot deleted the gh/awgu/156/head branch June 8, 2023 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FSDP()][13/N] Refactor unshard/reshard/grads #87926

[FSDP()][13/N] Refactor unshard/reshard/grads #87926

Uh oh!

awgu commented Oct 27, 2022 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 27, 2022 •

edited

Loading

Uh oh!

awgu commented Nov 1, 2022

Uh oh!

pytorchmergebot commented Nov 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[FSDP()][13/N] Refactor unshard/reshard/grads #87926

[FSDP()][13/N] Refactor unshard/reshard/grads #87926

Uh oh!

Conversation

awgu commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87926

✅ No Failures

Uh oh!

awgu commented Nov 1, 2022

Uh oh!

pytorchmergebot commented Nov 1, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

awgu commented Oct 27, 2022 •

edited

Loading

pytorch-bot bot commented Oct 27, 2022 •

edited

Loading