Update DDP docs for Dynamo/DDPOptimizer #89096

wconstab · 2022-11-15T22:28:54Z

Stack from ghstack (oldest at bottom):

-> Update DDP docs for Dynamo/DDPOptimizer #89096

[ghstack-poisoned]

pytorch-bot · 2022-11-15T22:28:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89096

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Failures

As of commit 55d73a6:

The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 558da2a Pull Request resolved: #89096

msaroufim · 2022-11-15T23:10:30Z

docs/source/notes/ddp.rst

+.. code::
+
+        ddp_model = DDP(model, device_ids=[rank])
+        ddp_model = torch.compile(ddp_model)


So we can't merge this quite yet since won't exist until sometime next week, in the meantime you can can use the optimize API if you'd rather merge this now

no rush, i can just land it once its ready. keep me posted.

msaroufim · 2022-11-15T23:11:17Z

docs/source/notes/ddp.rst

+------------------------
+
+DDP's performance advantage comes from overlapping allreduce collectives with computations during backwards.
+AotAutograd prevents this overlap when used with TorchDynamo for compiling a whole forward and whole backward graph,


I would imagine the DDP audience may not know what AotAutograd is, I'd rather expanding on this a bit more

Maybe a picture would help make things clearer

Do you think it's better to duplicate the picture/explanation here, or would a link out to @davidberard98's blog suffice? He explains it well and has pictures

Link is fine

ok. link is a paragraph below, but i could move it up if you think it would help.

[ghstack-poisoned]

ghstack-source-id: b4d0a1a Pull Request resolved: #89096

[ghstack-poisoned]

ghstack-source-id: a847973 Pull Request resolved: #89096

[ghstack-poisoned]

ghstack-source-id: a8e4fde Pull Request resolved: #89096

[ghstack-poisoned]

ghstack-source-id: 464ccb5 Pull Request resolved: #89096

[ghstack-poisoned]

ghstack-source-id: 43a8e57 Pull Request resolved: #89096

[ghstack-poisoned]

ghstack-source-id: 88cc651 Pull Request resolved: #89096

wconstab · 2022-11-29T23:50:10Z

@pytorchbot merge

pytorchmergebot · 2022-11-29T23:51:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-11-30T01:32:43Z

Merge failed

Reason: 4 additional jobs have failed, first few of them are: windows-binary-libtorch-debug ,windows-binary-libtorch-debug / libtorch-cpu-shared-with-deps-debug-test ,trunk ,trunk / win-vs2019-cuda11.6-py3 / test (force_on_cpu, 1, 1, windows.4xlarge)

Details for Dev Infra team

Raised by workflow job

wconstab · 2022-11-30T05:48:16Z

@pytorchbot merge -f "unrelated CI fail"

pytorchmergebot · 2022-11-30T05:50:06Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Pull Request resolved: pytorch#89096 Approved by: https://github.com/msaroufim

Update DDP docs for Dynamo/DDPOptimizer

0ffdcc5

[ghstack-poisoned]

This was referenced Nov 15, 2022

Enable DDPOptimizer by default in dynamo #88523

Closed

[dynamo] NNModuleVariable traces into call_function #89015

Closed

wconstab added a commit that referenced this pull request Nov 15, 2022

Update DDP docs for Dynamo/DDPOptimizer

41aad6d

ghstack-source-id: 558da2a Pull Request resolved: #89096

wconstab requested a review from msaroufim November 15, 2022 22:54

msaroufim requested changes Nov 15, 2022

View reviewed changes

Update on "Update DDP docs for Dynamo/DDPOptimizer"

8908710

[ghstack-poisoned]

wconstab mentioned this pull request Nov 16, 2022

Use torchrun for dynamo/distributed.py #89149

Closed

wconstab added a commit that referenced this pull request Nov 16, 2022

Update DDP docs for Dynamo/DDPOptimizer

89c85bc

ghstack-source-id: b4d0a1a Pull Request resolved: #89096

Update on "Update DDP docs for Dynamo/DDPOptimizer"

89ebda0

[ghstack-poisoned]

Update on "Update DDP docs for Dynamo/DDPOptimizer"

bb0ee6e

[ghstack-poisoned]

This was referenced Nov 16, 2022

Fix typo in dist_util.py #89167

Closed

Add torchvis support to dist bench #89324

Closed

[don't land] - debug fsdp stuff #89325

Closed

Special-case fsdp wrapped modules to be Unspecialized #89330

Closed

Update on "Update DDP docs for Dynamo/DDPOptimizer"

e545c53

[ghstack-poisoned]

This was referenced Nov 21, 2022

Add gpu memory profiler script for FSDP bench #89457

Closed

WIP debug fsdp inductor backward hook issue #89458

Closed

Update on "Update DDP docs for Dynamo/DDPOptimizer"

b95a2fa

[ghstack-poisoned]

wconstab mentioned this pull request Nov 21, 2022

Add limited FSDP correctness to torchdynamo benchmark #89469

Closed

Update on "Update DDP docs for Dynamo/DDPOptimizer"

3d7b928

[ghstack-poisoned]

wconstab added a commit that referenced this pull request Nov 22, 2022

Update DDP docs for Dynamo/DDPOptimizer

6f2ce38

ghstack-source-id: a847973 Pull Request resolved: #89096

Update on "Update DDP docs for Dynamo/DDPOptimizer"

fb3af91

[ghstack-poisoned]

Update on "Update DDP docs for Dynamo/DDPOptimizer"

d78bb46

[ghstack-poisoned]

Update on "Update DDP docs for Dynamo/DDPOptimizer"

6400a80

[ghstack-poisoned]

wconstab added a commit that referenced this pull request Nov 29, 2022

Update DDP docs for Dynamo/DDPOptimizer

e39f222

ghstack-source-id: a8e4fde Pull Request resolved: #89096

msaroufim approved these changes Nov 29, 2022

View reviewed changes

Update on "Update DDP docs for Dynamo/DDPOptimizer"

ad8b9f6

[ghstack-poisoned]

Update on "Update DDP docs for Dynamo/DDPOptimizer"

d43bad5

[ghstack-poisoned]

wconstab added a commit that referenced this pull request Nov 29, 2022

Update DDP docs for Dynamo/DDPOptimizer

468bb2f

ghstack-source-id: 464ccb5 Pull Request resolved: #89096

Update on "Update DDP docs for Dynamo/DDPOptimizer"

2ee4db3

[ghstack-poisoned]

wconstab added a commit that referenced this pull request Nov 29, 2022

Update DDP docs for Dynamo/DDPOptimizer

4241bff

ghstack-source-id: 43a8e57 Pull Request resolved: #89096

Update on "Update DDP docs for Dynamo/DDPOptimizer"

49ffe5f

[ghstack-poisoned]

Update on "Update DDP docs for Dynamo/DDPOptimizer"

55d73a6

[ghstack-poisoned]

wconstab added a commit that referenced this pull request Nov 29, 2022

Update DDP docs for Dynamo/DDPOptimizer

c74b4ab

ghstack-source-id: 88cc651 Pull Request resolved: #89096

wconstab added the release notes: distributed (ddp) release notes category label Nov 29, 2022

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 29, 2022

pytorchmergebot added the Merged label Nov 30, 2022

pytorchmergebot closed this in 4472837 Nov 30, 2022

kulinseth pushed a commit to kulinseth/pytorch that referenced this pull request Dec 10, 2022

Update DDP docs for Dynamo/DDPOptimizer (pytorch#89096)

26864b0

Pull Request resolved: pytorch#89096 Approved by: https://github.com/msaroufim

facebook-github-bot deleted the gh/wconstab/38/head branch June 8, 2023 19:17

Update DDP docs for Dynamo/DDPOptimizer #89096

Update DDP docs for Dynamo/DDPOptimizer #89096

Uh oh!

Conversation

wconstab commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89096

❌ 2 Failures

Uh oh!

msaroufim Nov 15, 2022

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 21, 2022

Choose a reason for hiding this comment

Uh oh!

msaroufim Nov 15, 2022

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

msaroufim Nov 16, 2022

Choose a reason for hiding this comment

Uh oh!

wconstab Nov 21, 2022

Choose a reason for hiding this comment

Uh oh!

wconstab commented Nov 29, 2022

Uh oh!

pytorchmergebot commented Nov 29, 2022

Merge started

Uh oh!

pytorchmergebot commented Nov 30, 2022

Merge failed

Uh oh!

wconstab commented Nov 30, 2022

Uh oh!

pytorchmergebot commented Nov 30, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wconstab commented Nov 15, 2022 •

edited

Loading

pytorch-bot bot commented Nov 15, 2022 •

edited

Loading