torch.where : Scalar Support #40336

kshitij12345 · 2020-06-20T10:03:22Z

Reference: #38349 #9190

TODO

Add Tests
Update Docs

dr-ci · 2020-06-20T10:13:00Z

💊 CI failures summary and remediations

As of commit ea11981 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 39 times.

vadimkantorov · 2020-06-20T11:08:17Z

A lot of real-world usage cases can be found in https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/Loss.cpp

It would be good if they were refactored to not allocate zeros tensors.

Another question is that they often allocate indicator tensors (often it's like x == 1, x == -1, x > 0, x == 0), but without automatic fusion or introducing comparison op argument indicating on how to interpret a float tensor (or choosing one by default, e.g. 0 is 0, non-zero is 1, like in C), it's impossible to get rid of and maybe not worth it - also discussed in #9190

vadimkantorov · 2020-06-20T11:18:12Z

Will type promotion be also be supported, i.e. using 0 or 1 in place of 0.0 and 1.0?

kshitij12345 · 2020-06-20T11:46:06Z

@vadimkantorov

A lot of real-world usage cases can be found in https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/Loss.cpp
It would be good if they were refactored to not allocate zeros tensors.

Thanks! Would be a good test as well.

Another question is that they often allocate indicator tensors (often it's like x == 1, x == -1, x > 0, x == 0), but without automatic fusion or introducing comparison op argument indicating on how to interpret a float tensor (or choosing one by default, e.g. 0 is 0, non-zero is 1, like in C), it's impossible to get rid of and maybe not worth it - also discussed in #9190

Sounds interesting but surely out-of-scope for this PR.

Will type promotion be also be supported, i.e. using 0 or 1 in place of 0.0 and 1.0?

I am not planning to do it in this PR as it would also affect the behaviour of Tensor-Tensor overload.

vadimkantorov · 2020-06-20T12:02:36Z

I am not planning to do it in this PR as it would also affect the behaviour of Tensor-Tensor overload.

Got it. for future, type promotion would be useful for tensor-tensor overload too (although, not as pressing, given the new scalar support)

vadimkantorov · 2020-06-20T13:50:10Z

aten/src/ATen/native/Loss.cpp

  auto margin_clamp = (margin - self).clamp_min_(0);
-  auto output_margin = at::where(target != 1, margin_clamp, zeros);
-  auto output_self = at::where(target != -1, self, zeros);
+  auto output_margin = at::where(target != 1, margin_clamp, 0);


will there not be problem because of int scalar type? margin_clamp has probably float tensor type
or will aten promote types here?

At the moment, if there is one scalar and one tensor, the scalar's dtype is cast to the type of tensor, this felt natural when implementing. But looking back, I guess this isn't a good behaviour, as it may lead to promotion and demotion for the type of scalar based on tensor.

Ah, cool. So there is type promotion scalar -> tensor. For me that's what I would expect

torch/_torch_docs.py

vadimkantorov · 2020-06-20T13:54:29Z

Also, not sure if torch.lerp currently supports scalars or not

vadimkantorov · 2020-06-20T13:57:23Z

Also, at some point there were problems of torch.where producing NaN gradients, it would be good to have a note in docs about current state of affairs (canonical erroring example is entropy computation in presence of 0 torch.where(p > 0, p*p.log(), 0).sum(-1)):
#18287 #23395

Many of these issues were circularly closed (one links another), the only currently open issue is: #23156

kshitij12345 · 2020-07-11T12:29:14Z

@mruberry Please review :)

vadimkantorov · 2020-07-11T17:33:53Z

Will sth like

x = torch.rand(3, 4, 5).to('cuda')
y = torch.where(x > 0.5, x, torch.tensor(0))

work?
i.e. cpu->cuda scalar copy

kshitij12345 · 2020-07-11T18:42:46Z

This PR doesn't actually touch the current version.

>>> a = torch.randn(3,4).to('cuda')
>>> torch.where(a > 0.5, a, torch.tensor(0.))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
RuntimeError: Expected condition, x and y to be on the same device, but condition is on cuda:0 and x and y are on cuda:0 and cpu respectively

However after this PR

>>> a = torch.randn(3,4).to('cuda')
>>> torch.where(a > 0.5, a, 0)
tensor([[0.0000, 0.0000, 0.5484, 0.0000],
        [0.8243, 0.0000, 1.0878, 1.2405],
        [0.0000, 0.0000, 0.0000, 1.7502]], device='cuda:0')

kshitij12345 · 2020-07-21T10:26:41Z

@mruberry
Gentle ping :)

mruberry · 2020-07-21T17:38:26Z

aten/src/ATen/native/TensorCompare.cpp


+namespace {
+
+inline at::Tensor scalar_to_tensor_default_dtype(


Better add a comment explaining what this function is for

Originally I had a few suggestions for how to modernize this function, but now I have a different idea: what about copying and putting it next to "scalar_to_tensor?":

pytorch/aten/src/ATen/ScalarOps.h

Line 12 in 2b02d15

inline at::Tensor scalar_to_tensor(Scalar s, const Device device = at::kCPU) {

Except the dtypes would be different, as you're acquiring them.

Have moved the function and added some comments.

Please review.

torch/_torch_docs.py

mruberry

Cool. Nice work, @kshitij12345. I made a few small comments about code organization and docs. I'm curious to see how type promotion will affect the tests, and wondering if in the next iteration we could get some tests that compare directly with np.where, too.

Just ping me when this is ready.

* fix doc for argument * fix example

kshitij12345 · 2020-07-22T12:13:55Z

@mruberry

Have address the changes.
Do let me know if there is a better way to phrase the comment in ScalarOps.h.

For next iteration (type-promotion), we would most likely have tests directly against numpy!

kshitij12345 · 2020-07-24T09:56:14Z

@mruberry Gentle Ping:)

kshitij12345 · 2020-07-27T14:39:18Z

@mruberry Gentle Ping:)

mruberry · 2020-07-29T09:15:05Z

@mruberry

Have address the changes.
Do let me know if there is a better way to phrase the comment in ScalarOps.h.

For next iteration (type-promotion), we would most likely have tests directly against numpy!

Sorry to keep you waiting, @kshitij12345, it's been a very busy week! Thank you for your patience.

Let's not hold this PR up anymore on small issues. We can look at improving that comment, if we want to, in the future.

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mruberry · 2020-07-30T06:15:45Z

Internal tests flagged a few potential issues / lint problems, @kshitij12345. I'll sort through them tomorrow. Should be a few simple fixes.

kshitij12345 · 2020-07-30T06:47:13Z

@mruberry
Sure. Thank You!
Do let me know if I can help with anything.

aten/src/ATen/native/TensorCompare.cpp

facebook-github-bot · 2020-07-31T08:13:44Z

@mruberry merged this pull request in 31d41f9.

vadimkantorov · 2020-08-03T19:32:47Z

Also, at some point there were problems of torch.where producing NaN gradients, it would be good to have a note in docs about current state of affairs (canonical erroring example is entropy computation in presence of 0 torch.where(p > 0, p*p.log(), 0).sum(-1)):
#18287 #23395

Many of these issues were circularly closed (one links another), the only currently open issue is: #23156

Does the entropy case work now? Is torch.where NaN-safe for this case?

mruberry · 2020-08-04T18:42:05Z

Does the entropy case work now? Is torch.where NaN-safe for this case?

No. The behavior of torch.where is unchanged. This is just sugar allowing scalars to be interpreted as tensor arguments.

vadimkantorov · 2020-08-04T19:00:16Z

Does the entropy case work now? Is torch.where NaN-safe for this case?

No. The behavior of torch.where is unchanged. This is just sugar allowing scalars to be interpreted as tensor arguments.

It would be good to have this as a test, even if it's known to fail at the moment. A note in docs would also be good. This is a very common mistake (and entropy computation usecase specifically is a good/common example of this I think) and source of surprises...

import torch

def where_is_not_nan_safe():
  p = torch.zeros(4).requires_grad_()
  e = torch.where(p > 0, p*p.log(), torch.zeros_like(p)).sum(-1)
  assert torch.isnan(torch.autograd.grad(e, (p,))[0]).all()

mruberry · 2020-08-04T23:53:22Z

The linked issue is less about torch.where and more about how gradients are represented, so I don't think we'd take a test for this in particular.

vadimkantorov · 2020-08-05T00:02:28Z

You're right. From current gradient processing + no special processing in torch.where, it's how things are. But if torch.where does special processing, this could be a way around current gradient processing. All I wanted to point out is that trying to implement entropy and work around the naive p * p.log() with torch.where is quite frequent (and failing currently), to the point that it might deserve a note in docs, and at best be fixed by special processing in torch.where (if technically possible).

initial where scalar support

489aac5

kshitij12345 changed the title ~~torch.where : Scalar Support~~ [WIP] torch.where : Scalar Support Jun 20, 2020

kshitij12345 added 2 commits June 20, 2020 15:47

fix declaration

125a051

update doc

7867e38

fix logic for scalar-scalar

2f23205

use the scalar overload in Loss.cpp

200ba1a

vadimkantorov reviewed Jun 20, 2020

View reviewed changes

torch/_torch_docs.py Outdated Show resolved Hide resolved

pytorchbot added the open source label Jun 22, 2020

kshitij12345 added 5 commits July 1, 2020 18:27

Merge branch 'master' into develop/where/scalar

71f6da4

Merge branch 'master' into develop/where/scalar

ba79d36

add scalar tensor variant tests

0407531

remove stray decorator and update the condition generation for complex

bcebc6d

add scalar-scalar

aaa67b5

kshitij12345 marked this pull request as ready for review July 11, 2020 12:28

kshitij12345 changed the title ~~[WIP] torch.where : Scalar Support~~ torch.where : Scalar Support Jul 11, 2020

kshitij12345 added 2 commits July 11, 2020 18:04

add a scalar example in doc

f604749

make flake8 happy

5180064

mruberry self-requested a review July 12, 2020 07:16

mruberry added the module: numpy Related to numpy support, and also numpy compatibility of our operators label Jul 12, 2020

mruberry reviewed Jul 21, 2020

View reviewed changes

torch/_torch_docs.py Outdated Show resolved Hide resolved

mruberry reviewed Jul 21, 2020

View reviewed changes

torch/_torch_docs.py Outdated Show resolved Hide resolved

mruberry self-requested a review July 21, 2020 17:56

mruberry approved these changes Jul 21, 2020

View reviewed changes

kshitij12345 added 2 commits July 22, 2020 17:26

move scalar_to_tensor_default_dtype to ScalarOps.h

6d07ecf

address comments

4174d1c

* fix doc for argument * fix example

Merge branch 'master' into develop/where/scalar

ea11981

facebook-github-bot reviewed Jul 29, 2020

View reviewed changes

mruberry reviewed Jul 30, 2020

View reviewed changes

aten/src/ATen/native/TensorCompare.cpp Show resolved Hide resolved

facebook-github-bot closed this in 31d41f9 Jul 31, 2020

facebook-github-bot added the merged label Jul 31, 2020

kshitij12345 deleted the develop/where/scalar branch July 31, 2020 08:19

vadimkantorov mentioned this pull request Aug 19, 2020

Adding entropy function analogous to SciPy #43255

Open

mruberry added the Merged label Oct 28, 2020

vadimkantorov mentioned this pull request Jan 6, 2021

torch.where to support Python scalars and type promotion #9190

Closed


		namespace {

		inline at::Tensor scalar_to_tensor_default_dtype(

torch.where : Scalar Support #40336

torch.where : Scalar Support #40336

Uh oh!

Conversation

kshitij12345 commented Jun 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Jun 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

vadimkantorov commented Jun 20, 2020

Uh oh!

vadimkantorov commented Jun 20, 2020

Uh oh!

kshitij12345 commented Jun 20, 2020

Uh oh!

vadimkantorov commented Jun 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vadimkantorov Jun 20, 2020

Choose a reason for hiding this comment

Uh oh!

kshitij12345 Jun 22, 2020

Choose a reason for hiding this comment

Uh oh!

vadimkantorov Jun 22, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vadimkantorov commented Jun 20, 2020

Uh oh!

vadimkantorov commented Jun 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kshitij12345 commented Jul 11, 2020

Uh oh!

vadimkantorov commented Jul 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kshitij12345 commented Jul 11, 2020

Uh oh!

kshitij12345 commented Jul 21, 2020

Uh oh!

mruberry Jul 21, 2020

Choose a reason for hiding this comment

Uh oh!

mruberry Jul 21, 2020

Choose a reason for hiding this comment

Uh oh!

kshitij12345 Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mruberry left a comment

Choose a reason for hiding this comment

Uh oh!

kshitij12345 commented Jul 22, 2020

Uh oh!

kshitij12345 commented Jul 24, 2020

Uh oh!

kshitij12345 commented Jul 27, 2020

Uh oh!

mruberry commented Jul 29, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

mruberry commented Jul 30, 2020

Uh oh!

kshitij12345 commented Jul 30, 2020

Uh oh!

Uh oh!

facebook-github-bot commented Jul 31, 2020

Uh oh!

vadimkantorov commented Aug 3, 2020

Uh oh!

mruberry commented Aug 4, 2020

Uh oh!

kshitij12345 commented Jun 20, 2020 •

edited

Loading

dr-ci bot commented Jun 20, 2020 •

edited

Loading

vadimkantorov commented Jun 20, 2020 •

edited

Loading

vadimkantorov commented Jun 20, 2020 •

edited

Loading

vadimkantorov commented Jul 11, 2020 •

edited

Loading

vadimkantorov commented Aug 4, 2020 •

edited

Loading