-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[Reland][FSDP] Mixed precision enablement" #75024
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) [ghstack-poisoned]
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 01e3c46 (more details on the Dr. CI page):
🕵️ 1 new failure recognized by patternsThe following CI failures do not appear to be due to upstream breakages
|
Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) ghstack-source-id: 152704893 Pull Request resolved: #75024
mrshenli
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions.
Do we need to mention this in our docs?
Reland #74452 Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions. Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) [ghstack-poisoned]
Reland #74452 Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions. Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) [ghstack-poisoned]
Reland #74452 Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions. Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) [ghstack-poisoned]
Pull Request resolved: #75024 Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 ghstack-source-id: 152989929 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/)
Reland #74452 Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions. Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 Differential Revision: [D35287501](https://our.internmc.facebook.com/intern/diff/D35287501/) [ghstack-poisoned]
…75024) Summary: Pull Request resolved: #75024 Original commit changeset: 99295ea4ff02 Original Phabricator Diff: D35000703 (6b0b088) ghstack-source-id: 153059190 (Note: this ignores all push blocking failures!) Test Plan: CI Reviewed By: pbelevich Differential Revision: D35287501 fbshipit-source-id: c6c9ada039de27cf9cc477561f92a7f888bdf5f7
|
Hey @rohan-varma. |
Stack from ghstack (oldest at bottom):
Reland #74452
Issue was older nccl version does not support bf16. Will take an approach similar to #67843 to ensure test only runs with later nccl versions.
Original commit changeset: 99295ea4ff02
Original Phabricator Diff: D35000703
Differential Revision: D35287501