Cast tensors when loading optimizer state dicts #3658

apaszke · 2017-11-12T14:07:07Z

Right now optimizers can load state dicts of other optimizers only if all parameters are matching in type and device (in contrast to nn.Modules). This is too strict for many use cases, and is addresses in this patch.

The only problem is that optimizer state isn't typed in any way, so code from this PR tries to make reasonable guesses - only state that's bound to certain parameters is casted (with parameter being the template), and we assume that floating point tensors in the state should match the type of parameter (I can't think of better way to handle load_state_dict across sets of parameters with different fp types). All other types are only moved to a different device.

Fixes #2830, #1442.

colesbury

lgtm

Cast tensors when loading optimizer state dicts

5d9bb44

apaszke requested a review from colesbury November 16, 2017 16:47

colesbury approved these changes Nov 17, 2017

View reviewed changes

ezyang merged commit af9fd35 into master Nov 28, 2017

colesbury deleted the optim_state_dict branch December 2, 2017 21:27

AMairesse mentioned this pull request Jan 15, 2018

Getting an error when using the option " --continue_from " SeanNaren/deepspeech.pytorch#210

Closed

ezyang added the open source label Jun 24, 2019

msbaines mentioned this pull request Aug 27, 2020

checkpoint restore of optimizers changes dtype of Floating-point state #43706

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cast tensors when loading optimizer state dicts #3658

Cast tensors when loading optimizer state dicts #3658

Uh oh!

apaszke commented Nov 12, 2017

Uh oh!

colesbury left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Cast tensors when loading optimizer state dicts #3658

Cast tensors when loading optimizer state dicts #3658

Uh oh!

Conversation

apaszke commented Nov 12, 2017

Uh oh!

colesbury left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants