Fix MSELoss when target.requires_grad is True. #44437

gchanan · 2020-09-09T22:38:02Z

Stack from ghstack:

Simplify target handling in nn gradcheck. #44507 Simplify target handling in nn gradcheck.
Fix SmoothL1Loss when target.requires_grad is True. #44486 Fix SmoothL1Loss when target.requires_grad is True.
Fix L1Loss when target.requires_grad is True. #44471 Fix L1Loss when target.requires_grad is True.
Fix MSELoss when target.requires_grad is True. #44437 Fix MSELoss when target.requires_grad is True.
Merge criterion_tests and new_criterion_tests. #44398 Merge criterion_tests and new_criterion_tests.
Combine criterion and new criterion tests in test_jit. #43958 Combine criterion and new criterion tests in test_jit.

MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True.

This PR does the following:

adds derivative support for target via the normal derivatives.yaml route
kill the different (and incorrect) path for when target.requires_grad was True
modify the MSELoss CriterionTests to verify that the target derivative is checked.

TODO:

do we still need check_criterion_jacobian when we run grad/gradgrad checks?
ensure the Module tests check when target.requires_grad
do we actually test when reduction='none' and reduction='mean'?

Differential Revision: D23612166

MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? [ghstack-poisoned]

MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? ghstack-source-id: cbfba5f Pull Request resolved: #44437

dr-ci · 2020-09-09T23:04:21Z

💊 CI failures summary and remediations

As of commit c472bb2 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

1 failure confirmed as flaky and can be ignored:

pytorch_bazel_test

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.7-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 8 times.

gchanan · 2020-09-10T14:46:42Z

~~also need to fix the cpp API.~~
Look athttps://github.com/pytorch/pytorch/blob/af9cad761ad1dd215470ebce58932b82d6ba8bee/test/test_nn.py#L9787 at detached cases here.

codecov · 2020-09-10T14:49:51Z

Codecov Report

Merging #44437 into gh/gchanan/322/base will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@                   Coverage Diff                   @@
##           gh/gchanan/322/base   #44437      +/-   ##
=======================================================
- Coverage                68.00%   67.99%   -0.01%     
=======================================================
  Files                      382      382              
  Lines                    49379    49373       -6     
=======================================================
- Hits                     33578    33572       -6     
  Misses                   15801    15801

Impacted Files	Coverage Δ
torch/nn/functional.py	`92.18% <100.00%> (-0.04%)`	⬇️
torch/testing/_internal/common_nn.py	`83.06% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4f72e7c...c472bb2. Read the comment docs.

MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? Differential Revision: [D23612166](https://our.internmc.facebook.com/intern/diff/D23612166) [ghstack-poisoned]

MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? ghstack-source-id: 88dab44 Pull Request resolved: #44437

albanD

lgtm
Not sure why we didn't do that before... :D

albanD · 2020-09-10T19:17:06Z

torch/testing/_internal/common_nn.py

-        # currently compute the gradient w.r.t. target for loss functions.
-        gradcheck(apply_fn, inputs)
+        if target.requires_grad:
+            inputs = inputs + (target,)


I think this works but it is a bit fragile.
Your apply_fn above actually ignores the target that is given in *params. But it works because it captures the same target from the parent scope. And the gradcheck ensures that you can do that. But I think it would be clearer to just get the target from the apply_fn inputs properly no?

facebook-github-bot · 2020-09-11T16:18:16Z

@gchanan merged this pull request in d07d25a.

Summary: Pull Request resolved: #44437 MSELoss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23612166 Pulled By: gchanan fbshipit-source-id: 4f74d38d8a81063c74e002e07fbb7837b2172a10

gchanan requested a review from apaszke as a code owner September 9, 2020 22:38

This was referenced Sep 9, 2020

Combine criterion and new criterion tests in test_jit. #43958

Closed

Merge criterion_tests and new_criterion_tests. #44398

Closed

gchanan requested review from ebetica, goldsborough and yf225 as code owners September 10, 2020 15:01

gchanan mentioned this pull request Sep 10, 2020

Fix L1Loss when target.requires_grad is True. #44471

Closed

gchanan requested a review from albanD September 10, 2020 16:57

albanD approved these changes Sep 10, 2020

View reviewed changes

gchanan mentioned this pull request Sep 10, 2020

Fix SmoothL1Loss when target.requires_grad is True. #44486

Closed

albanD reviewed Sep 10, 2020

View reviewed changes

albanD approved these changes Sep 10, 2020

View reviewed changes

gchanan mentioned this pull request Sep 10, 2020

Simplify target handling in nn gradcheck. #44507

Closed

facebook-github-bot closed this in d07d25a Sep 11, 2020

facebook-github-bot added the merged label Sep 11, 2020

gchanan mentioned this pull request Sep 11, 2020

F.mse_loss(a, b, reduction='elementwise_mean') value is incorrect and doesn't show deprecation warning when 2nd argument requires gradient #43228

Closed

glaringlee mentioned this pull request Sep 11, 2020

pytorch cpp api the derivative for 'target' is not implemented #16830

Closed

facebook-github-bot deleted the gh/gchanan/322/head branch September 15, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix MSELoss when target.requires_grad is True. #44437

Fix MSELoss when target.requires_grad is True. #44437

Uh oh!

gchanan commented Sep 9, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Sep 9, 2020 •

edited

Loading

Uh oh!

gchanan commented Sep 10, 2020 •

edited

Loading

Uh oh!

codecov bot commented Sep 10, 2020 •

edited

Loading

Uh oh!

albanD left a comment

Uh oh!

albanD Sep 10, 2020

Uh oh!

facebook-github-bot commented Sep 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix MSELoss when target.requires_grad is True. #44437

Fix MSELoss when target.requires_grad is True. #44437

Uh oh!

Conversation

gchanan commented Sep 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Sep 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

gchanan commented Sep 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

albanD Sep 10, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gchanan commented Sep 9, 2020 •

edited

Loading

dr-ci bot commented Sep 9, 2020 •

edited

Loading

gchanan commented Sep 10, 2020 •

edited

Loading

codecov bot commented Sep 10, 2020 •

edited

Loading