Skip to content

Conversation

@mrshenli
Copy link
Contributor

@mrshenli mrshenli commented Dec 10, 2022

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 10, 2022

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90620

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 85a007b:

The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Dec 10, 2022
mrshenli added a commit that referenced this pull request Dec 10, 2022
ghstack-source-id: a6d90b9
Pull Request resolved: #90620
Copy link
Contributor

@rohan-varma rohan-varma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, but curious what's the use case to skip inputs?

@mrshenli
Copy link
Contributor Author

Thanks @rohan-varma, one model I am working on has one forward argument, which is sensitive to precision. So have to keep it in fp32, instead of converting it to bfloat16 and convert back.

@mrshenli mrshenli added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 10, 2022
@awgu awgu changed the title [FSDP] Allos MixedPrecision to skip inputs [FSDP] Allow MixedPrecision to skip inputs Dec 10, 2022
Copy link
Collaborator

@awgu awgu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall, this looks good to me. Sorry for leaving all of the comments in separate reviews instead of just one.

@awgu
Copy link
Collaborator

awgu commented Dec 11, 2022

My bad for suggesting to un-default cast_forward_inputs. I did not see that it already existed in past tests.

I think you need to add a value for cast_forward_inputs in this line:

model = SaveForwardInputsModel(forward_inputs).cuda()

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 80542ad.

mrshenli added a commit to mrshenli/pytorch that referenced this pull request Dec 12, 2022
ghstack-source-id: d9e229f
Pull Request resolved: pytorch#90620
@facebook-github-bot facebook-github-bot deleted the gh/mrshenli/355/head branch June 8, 2023 18:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request Merged release notes: distributed (fsdp) release notes category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants