only Tensors of floating point dtype can require gradients (see #7021) #7034

t-vi · 2018-04-27T10:50:17Z

Sometimes, people are surprised that things cannot be differentiated w.r.t. integer parameters such indices.
The following patch takes some steps to prevent them from requiring gradients of non-floating point Tensors:

by setting tensor.requires_grad = True
by using tensor.requires_grad_(True)
by using factory functions with requires_grad=True

Of course, applying the above with False needs to still be allowed.

This as requested by Adam in #7021, this is done at the Python interface level.

…ch#7021)

t-vi · 2018-04-27T12:09:26Z

something is up with the pytorch-linux-xenial-py3-clang5-asan test. It seems to hang for 30 minutes in test_multi_drop (test_utils.TestDataLoader).
Is that me or the test?

zou3519 · 2018-04-27T15:27:17Z

@t-vi Probably just the test, I've seen intermittent timeouts there

t-vi · 2018-04-27T15:52:07Z

Indeed. Worked when I changed the commit msg. There is an issue about it, too.

test/test_autograd.py

+            for f in [f1, f2, f3]:
+                a = torch.ones(1, dtype=dt, device='cuda' if cuda else 'cpu')
+                if dt.is_floating_point:
+                    f()


t-vi · 2018-04-28T19:14:42Z

So, now the MacOS has a "CI changed" failure, but I think it works.

ezyang · 2018-04-30T03:08:19Z

OS X failure is unrelated, and I think fixed on master. @pytorchbot retest this please

apaszke · 2018-04-30T08:20:07Z

Thanks @t-vi!

gchanan · 2018-05-02T16:07:16Z

This didn't change the constructors in tensor_new.cpp, e.g. torch.tensor.

If you implemented this in those constructors, it would get a little awkward when combined with type inference, because you don't know the type of the tensor that will come out, e.g.:

def convert_to_tensors(data0, data1)
  return torch.tensor(data0, requires_grad=True), torch.tensor(data1, requires_grad=True)

would not throw an error on `convert_to_tensors([0., 1.], [2., 3.]) but would on convert_to_tensors([0., 1.], [2, 3]). Sometimes you want this fail-fast behavior, but sometimes not.

…ch#7021) (pytorch#7034)

Only float tensors can be backpropped and PyTorch throws an error if you try: pytorch/pytorch#7034

only Tensors of floating point dtype can require gradients (see pytor…

61a5d19

…ch#7021)

t-vi requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners April 27, 2018 10:50

fix TorchTest.test_empty_full to not use requires_grad on int tensors.

04fcbd6

t-vi force-pushed the set_requires_grad_only_for_float branch from c7e2383 to 04fcbd6 Compare April 27, 2018 13:02

ezyang approved these changes Apr 27, 2018

View reviewed changes

apaszke reviewed Apr 27, 2018

View reviewed changes

test/test_autograd.py Outdated

for f in [f1, f2, f3]:

a = torch.ones(1, dtype=dt, device='cuda' if cuda else 'cpu')

if dt.is_floating_point:

f()

This comment was marked as off-topic.

Sign in to view

Improve test. Thank you Adam for your feedback

60bbf19

apaszke approved these changes Apr 30, 2018

View reviewed changes

apaszke merged commit 8fbab83 into pytorch:master Apr 30, 2018

gchanan mentioned this pull request May 2, 2018

Don't allow requires_grad to be set on integer Tensor constructors in… #7185

Merged

Jorghi12 pushed a commit to wsttiger/pytorch that referenced this pull request May 10, 2018

only Tensors of floating point dtype can require gradients (see pytor…

8f5da7c

…ch#7021) (pytorch#7034)

weiyangfb pushed a commit to weiyangfb/pytorch that referenced this pull request Jun 11, 2018

only Tensors of floating point dtype can require gradients (see pytor…

992e525

…ch#7021) (pytorch#7034)

ezyang added the open source label Jun 24, 2019

justindujardin added a commit to justindujardin/thinc that referenced this pull request Jan 31, 2020

fix pytorch int gradient error

037161e

Only float tensors can be backpropped and PyTorch throws an error if you try: pytorch/pytorch#7034

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

only Tensors of floating point dtype can require gradients (see #7021) #7034

only Tensors of floating point dtype can require gradients (see #7021) #7034

Uh oh!

t-vi commented Apr 27, 2018 •

edited

Loading

Uh oh!

t-vi commented Apr 27, 2018 •

edited

Loading

Uh oh!

zou3519 commented Apr 27, 2018

Uh oh!

t-vi commented Apr 27, 2018 via email

Uh oh!

This comment was marked as off-topic.

Uh oh!

t-vi commented Apr 28, 2018 •

edited

Loading

Uh oh!

ezyang commented Apr 30, 2018

Uh oh!

apaszke commented Apr 30, 2018

Uh oh!

gchanan commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

only Tensors of floating point dtype can require gradients (see #7021) #7034

only Tensors of floating point dtype can require gradients (see #7021) #7034

Uh oh!

Conversation

t-vi commented Apr 27, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t-vi commented Apr 27, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zou3519 commented Apr 27, 2018

Uh oh!

t-vi commented Apr 27, 2018 via email

Uh oh!

This comment was marked as off-topic.

Uh oh!

t-vi commented Apr 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Apr 30, 2018

Uh oh!

apaszke commented Apr 30, 2018

Uh oh!

gchanan commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

t-vi commented Apr 27, 2018 •

edited

Loading

t-vi commented Apr 27, 2018 •

edited

Loading

t-vi commented Apr 28, 2018 •

edited

Loading