[FSDP()][3/N] Refactor public APIs #87917

awgu · 2022-10-27T22:19:45Z

Stack from ghstack:

[FSDP()][27/N] Add forward hook registration #88040 [FSDP()][27/N] Add forward hook registration
[FSDP()][26/N] Move _lazy_init() into _fsdp_root_pre_forward() #87941 [FSDP()][26/N] Move _lazy_init() into _fsdp_root_pre_forward()
[FSDP()][25/N] Add _post_forward_reshard() #87940 [FSDP()][25/N] Add _post_forward_reshard()
[FSDP()][24/N] Refactor _lazy_init() #87939 [FSDP()][24/N] Refactor _lazy_init()
[FSDP()][23/N] Refactor handle attr initialization #87938 [FSDP()][23/N] Refactor handle attr initialization
[FSDP] Simplify _reset_lazy_init() #87937 [FSDP] Simplify _reset_lazy_init()
[FSDP()][22/N] Refactor _cast_buffers() in _lazy_init() #87936 [FSDP()][22/N] Refactor _cast_buffers() in _lazy_init()
[FSDP()][21/N] Refactor and fix _cast_buffers() #87935 [FSDP()][21/N] Refactor _buffer_name_to_orig_dtype computation
[FSDP] Rename dtype to buffer_name_to_dtype #87934 [FSDP] Rename dtype to buffer_name_to_dtype
[FSDP] Remove device arg from _cast_buffers() #87933 [FSDP] Remove device arg from _cast_buffers()
[FSDP()][20/N][Easy] Move functions in file #87932 [FSDP()][20/N][Easy] Move functions in file
[FSDP()][18/N] Refactor pre_forward_unshard() #87931 [FSDP()][18/N] Refactor pre_forward_unshard()
[FSDP()][17/N] Refactor _fsdp_root_pre_forward() #87930 [FSDP()][17/N] Refactor _fsdp_root_pre_forward()
[FSDP()][16/N] Refactor post-forward/pre-backward #87929 [FSDP()][16/N] Refactor post-forward/pre-backward
[FSDP()][15/N] Refactor _init_streams() #87928 [FSDP()][15/N] Refactor _init_streams()
[FSDP()][14/N] Refactor pre-forward/post-backward #87927 [FSDP()][14/N] Refactor pre-forward/post-backward
[FSDP()][13/N] Refactor unshard/reshard/grads #87926 [FSDP()][13/N] Refactor unshard/reshard/grads
[FSDP()][12/N] Easy cleanup #87925 [FSDP()][12/N] Easy cleanup
[FSDP()][10/N][11/N] Introduce composable (ctor only) #87924 [FSDP()][10/N][11/N] Introduce composable (ctor only)
[FSDP()][9/N] Refactor ctor (continued) #87923 [FSDP()][9/N] Refactor ctor (continued)
[FSDP()][8/N] Refactor limiter's _FreeEventQueue #87922 [FSDP()][8/N] Refactor limiter's _FreeEventQueue
[FSDP()][7/N] Refactor most of ctor #87921 [FSDP()][7/N] Refactor most of ctor
[FSDP()][3/N] Refactor public APIs #87917 [FSDP()][3/N] Refactor public APIs

This PR defines a new api.py meant to hold the public API for FSDP (minus FullyShardedDataParallel itself). This is needed because several of the _<...>_utils.py files rely on the public API, and we cannot import from torch.distributed.fsdp.fully_sharded_data_parallel without a circular import. Calling the file api.py follows the convention used by ShardedTensor.
This PR cleans up the wording in the BackwardPrefetch, ShardingStrategy, MixedPrecision, and CPUOffload docstrings.
This PR adds the aforementioned classes to fsdp.rst to have them rendered in public docs.
To abide by the public bindings contract (test_public_bindings.py), the aforementioned classes are removed from fully_sharded_data_parallel.py's __all__. This is technically BC breaking if someone uses from torch.distributed.fsdp.fully_sharded_data_parallel import *; however, that does not happen in any of our own external or internal code.

[ghstack-poisoned]

pytorch-bot · 2022-10-27T22:20:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87917

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e688429:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

ghstack-source-id: 6dc3b46 Pull Request resolved: pytorch#87917

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

- This PR defines a new `api.py` meant to hold the public API for FSDP (minus `FullyShardedDataParallel` itself). This is needed because several of the `_<...>_utils.py` files rely on the public API, and we cannot import from `torch.distributed.fsdp.fully_sharded_data_parallel` without a circular import. Calling the file `api.py` follows the convention used by `ShardedTensor`. - This PR cleans up the wording in the `BackwardPrefetch`, `ShardingStrategy`, `MixedPrecision`, and `CPUOffload` docstrings. - This PR adds the aforementioned classes to `fsdp.rst` to have them rendered in public docs. - To abide by the public bindings contract (`test_public_bindings.py`), the aforementioned classes are removed from `fully_sharded_data_parallel.py`'s `__all__`. This is technically BC breaking if someone uses `from torch.distributed.fsdp.fully_sharded_data_parallel import *`; however, that does not happen in any of our own external or internal code. [ghstack-poisoned]

ghstack-source-id: 0ba00bf Pull Request resolved: pytorch#87917

- This PR defines a new `api.py` meant to hold the public API for FSDP (minus `FullyShardedDataParallel` itself). This is needed because several of the `_<...>_utils.py` files rely on the public API, and we cannot import from `torch.distributed.fsdp.fully_sharded_data_parallel` without a circular import. Calling the file `api.py` follows the convention used by `ShardedTensor`. - This PR cleans up the wording in the `BackwardPrefetch`, `ShardingStrategy`, `MixedPrecision`, and `CPUOffload` docstrings. - This PR adds the aforementioned classes to `fsdp.rst` to have them rendered in public docs. - To abide by the public bindings contract (`test_public_bindings.py`), the aforementioned classes are removed from `fully_sharded_data_parallel.py`'s `__all__`. This is technically BC breaking if someone uses `from torch.distributed.fsdp.fully_sharded_data_parallel import *`; however, that does not happen in any of our own external or internal code. Pull Request resolved: pytorch#87917 Approved by: https://github.com/mrshenli

[FSDP()][3/N] Refactor BackwardPrefetch enum

6e77fbd

[ghstack-poisoned]

awgu requested review from H-Huang, kwen2501, mrshenli, pritamdamania87, rohan-varma and zhaojuanmao as code owners October 27, 2022 22:19

pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Oct 27, 2022

awgu mentioned this pull request Oct 29, 2022

[WIP] Composable FSDP Follow-Ups #88025

Closed

Update on "[FSDP()][3/N] Refactor BackwardPrefetch enum"

1595271

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

Update on "[FSDP()][3/N] Refactor BackwardPrefetch enum"

ce9c7cf

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

Update on "[FSDP()][3/N] Refactor BackwardPrefetch enum"

54a9cba

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

awgu pushed a commit to awgu/pytorch that referenced this pull request Oct 29, 2022

[FSDP()][3/N] Refactor BackwardPrefetch enum

71594c2

ghstack-source-id: 6dc3b46 Pull Request resolved: pytorch#87917

Update on "[FSDP()][3/N] Refactor BackwardPrefetch enum"

b44ec86

This PR is easy. I moved `BackwardPrefetch` to a new file `common_utils.py` and reworded the docs a bit. [ghstack-poisoned]

awgu mentioned this pull request Oct 29, 2022

[FSDP()][27/N] Add forward hook registration #88040

Closed

awgu added the topic: bc breaking topic category label Oct 29, 2022

awgu changed the title ~~[FSDP()][3/N] Refactor BackwardPrefetch enum~~ [FSDP()][3/N] Refactor public APIs Oct 29, 2022

Andrew Gu added 4 commits October 29, 2022 21:35

awgu pushed a commit to awgu/pytorch that referenced this pull request Oct 30, 2022

[FSDP()][3/N] Refactor ctor enums/dataclasses

88ab015

ghstack-source-id: 0ba00bf Pull Request resolved: pytorch#87917

pytorchmergebot closed this in 9d9267c Oct 31, 2022

facebook-github-bot deleted the gh/awgu/147/head branch June 8, 2023 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FSDP()][3/N] Refactor public APIs #87917

[FSDP()][3/N] Refactor public APIs #87917

Uh oh!

awgu commented Oct 27, 2022 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 27, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FSDP()][3/N] Refactor public APIs #87917

[FSDP()][3/N] Refactor public APIs #87917

Uh oh!

Conversation

awgu commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87917

✅ No Failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

awgu commented Oct 27, 2022 •

edited

Loading

pytorch-bot bot commented Oct 27, 2022 •

edited

Loading