-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[DDP] Generalize activation checkpoint tests #74130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
enable these tests to run for all dist backends not just nccl. Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/) [ghstack-poisoned]
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit c809226 (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
CI Flow Status⚛️ CI FlowRuleset - Version:
|
enable these tests to run for all dist backends not just nccl. Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/) ghstack-source-id: 151201192 Pull Request resolved: #74130
- Enables activation checkpointing tests to run for GLOO backend in addition to NCCL. - Follow up changes will add support for non reentrant-based checkpointing and CheckpointWrapper. - Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/) [ghstack-poisoned]
Pull Request resolved: #74130 enable these tests to run for all dist backends not just nccl. ghstack-source-id: 151220384 Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/)
- Enables activation checkpointing tests to run for GLOO backend in addition to NCCL. - Follow up changes will add support for non reentrant-based checkpointing and CheckpointWrapper. - Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/) [ghstack-poisoned]
Pull Request resolved: #74130 enable these tests to run for all dist backends not just nccl. ghstack-source-id: 151275305 Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/)
awgu
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left some minor questions.
- Enables activation checkpointing tests to run for GLOO backend in addition to NCCL. - Follow up changes will add support for non reentrant-based checkpointing and CheckpointWrapper. - Changes AbstractDDPTest to CommonDDPTest to better indicate that it now has tests that are common to both backends - Differential Revision: [D34281684](https://our.internmc.facebook.com/intern/diff/D34281684/) [ghstack-poisoned]
Test wasn't actually doing weight sharing as pointed out by Andrew in #74130 (comment) Differential Revision: [D34904535](https://our.internmc.facebook.com/intern/diff/D34904535/) [ghstack-poisoned]
Test wasn't actually doing weight sharing as pointed out by Andrew in #74130 (comment) Differential Revision: [D34904535](https://our.internmc.facebook.com/intern/diff/D34904535/) ghstack-source-id: 151429409 Pull Request resolved: #74252
awgu
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Summary: Pull Request resolved: #74130 enable these tests to run for all dist backends not just nccl. ghstack-source-id: 151429410 Test Plan: CI Reviewed By: awgu Differential Revision: D34281684 fbshipit-source-id: 956c1b0cafe0502b593dd42b157d518e89a47d8e
Summary: Pull Request resolved: #74252 Test wasn't actually doing weight sharing as pointed out by Andrew in #74130 (comment) ghstack-source-id: 151429409 Test Plan: CI Reviewed By: awgu Differential Revision: D34904535 fbshipit-source-id: 62d1aa9d9fd618176ecd898ca9364a6fa478be6d
Summary: Pull Request resolved: #74252 Test wasn't actually doing weight sharing as pointed out by Andrew in #74130 (comment) ghstack-source-id: 151429409 Test Plan: CI Reviewed By: awgu Differential Revision: D34904535 fbshipit-source-id: 62d1aa9d9fd618176ecd898ca9364a6fa478be6d (cherry picked from commit 03e7b3b)
Stack from ghstack (oldest at bottom):
Differential Revision: D34281684