Let bfloat16 support promotion with other types #41698

xuhdev · 2020-07-20T22:15:56Z

Fix #40580

dr-ci · 2020-07-20T22:18:22Z

💊 CI failures summary and remediations

As of commit fb8fbd7 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 45 times.

c10/core/ScalarType.h

test/test_torch.py

test/test_type_promotion.py

mruberry

Hey @xuhdev! Thanks for the PR! I think there are still some tests to sort out, and maybe we should leave fp16 x bfloat16 undefined for the moment?

Let me know your thoughts.

nairbv · 2020-07-22T14:51:06Z

What devices support bfloat16?

In torch/testing/__init__.py there's a function get_all_math_dtypes(device), but it doesn't appear to set include_bfloat16=True for any device type. If we fix that to handle bfloat16 too, I think it would enable some other arithmetical mixed-type testing.

test/test_torch.py

mruberry

New changes look good. Test in test_type_promotion still needs to be expanded, though.

To answer @nairbv's question about device support: at least CPUs, CUDA devices, and XLA devices support bfloat16 today. The new test suite I'm developing actually validates bfloat16 pretty well. It found a few issues but overall bfloat16 seems to be working as expected where we've implemented the dispatch for it.

mruberry · 2020-07-23T18:07:50Z

test/test_type_promotion.py

You don't really need this test with the default tensor (it will just be float32 or float64, which you're already testing).

This test looks good. One small change is to test bf + complex number errors (you can construct complex Python numbers using complex(<real>, <imag>)) and to flip the tests so you're adding the number / other tensor + the bfloat16 tensor. You can do this like this:

bf = torch.tensor(5.5, dtype=torch.bfloat16, device=device) scalars = (2.2, 5, complex(1, -1)) for scalar in scalars: if isinstance(scalar, complex): with self.assertRaises(RuntimeError): a + b with self.assertRaises(RuntimeError): b + a else: self.assertEqual(a + b, b + a) self.assertEqual((a + b).dtype,

And then in the next section you want to create the tensor of the dtype upfront:

for dtype in torch.testing.get_all_dtypes(): t = torch.tensor(1, device=device, dtype=dtype) if dtype in (torch.float32, torch.float64): self.assertEqual(bf + t, t + bf) self.assertEqual(bf.dtype, dtype) ...

How does that sound?

Yeah I agree. I adapted your proposal a bit

Fix pytorch#40580

xuhdev · 2020-07-29T18:58:45Z

Do we have any update?

mruberry · 2020-07-29T19:37:23Z

Hey @xuhdev, sorry to keep you waiting. I had a chance to talk to @nairbv offline and we're satisfied. Nice job!

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-07-30T20:22:14Z

@mruberry merged this pull request in 344defc.

xuhdev requested a review from mruberry July 20, 2020 22:16

xuhdev force-pushed the bfloat16-promotion branch from a9141ee to ad2f22a Compare July 20, 2020 22:17

pytorchbot added the open source label Jul 20, 2020

xuhdev requested review from anjali411, nairbv and ngimel July 20, 2020 22:34

gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 21, 2020

xuhdev force-pushed the bfloat16-promotion branch 2 times, most recently from 783cb49 to 642bb8e Compare July 21, 2020 18:17