[NNC] fix Half conversion of immediates in Cuda backend #45213

nickgg · 2020-09-23T18:00:19Z

The Cuda HalfChecker casts up all loads and stores of Half to Float, so we do math in Float on the device. It didn't cast up HalfImmediate (ie. constants) so they could insert mixed-size ops. Fix is to do that.

bertmaher · 2020-09-23T18:06:33Z

torch/csrc/jit/tensorexpr/cuda_half_support.h

So this will get us float{v[i]} < float{0.} right, but won't the dtype of the compare op still be half? Does that cause problems?

The default implementation of each operator in IRMutator will recreate it with the correct types if child nodes are modified.

I can modify the test to reach into the generated IR and check the dtype of the Max op if you like?

bertmaher · 2020-09-23T18:11:27Z

Oh, also this is the last blocker for enabling float16 in test_jit_fuser_te.py::test_unary_ops. Could you re-enable that dtype there?

bertmaher

Awesome, thanks for the quick fix.

facebook-github-bot

@nickgg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

dr-ci · 2020-09-24T01:28:33Z

💊 CI failures summary and remediations

As of commit 3dcc84e (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.7-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

facebook-github-bot · 2020-09-25T18:14:56Z

@nickgg merged this pull request in d1d9017.

nickgg requested a review from bertmaher September 23, 2020 18:00

nickgg requested a review from apaszke as a code owner September 23, 2020 18:00

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 23, 2020

nickgg force-pushed the fixHalfImm branch from e8a7518 to ee072a4 Compare September 23, 2020 18:01

bertmaher reviewed Sep 23, 2020

View reviewed changes

[NNC] fix Half conversion of immediates in Cuda backend

3dcc84e

nickgg force-pushed the fixHalfImm branch from ee072a4 to 3dcc84e Compare September 23, 2020 18:19

bertmaher approved these changes Sep 23, 2020

View reviewed changes

facebook-github-bot reviewed Sep 23, 2020

View reviewed changes

facebook-github-bot closed this in d1d9017 Sep 25, 2020

facebook-github-bot added the merged label Sep 25, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NNC] fix Half conversion of immediates in Cuda backend #45213

[NNC] fix Half conversion of immediates in Cuda backend #45213

Uh oh!

nickgg commented Sep 23, 2020

Uh oh!

bertmaher Sep 23, 2020

Uh oh!

nickgg Sep 23, 2020

Uh oh!

bertmaher commented Sep 23, 2020

Uh oh!

bertmaher left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

dr-ci bot commented Sep 24, 2020

Uh oh!

facebook-github-bot commented Sep 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[NNC] fix Half conversion of immediates in Cuda backend #45213

[NNC] fix Half conversion of immediates in Cuda backend #45213

Uh oh!

Conversation

nickgg commented Sep 23, 2020

Uh oh!

bertmaher Sep 23, 2020

Choose a reason for hiding this comment

Uh oh!

nickgg Sep 23, 2020

Choose a reason for hiding this comment

Uh oh!

bertmaher commented Sep 23, 2020

Uh oh!

bertmaher left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

dr-ci bot commented Sep 24, 2020

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

facebook-github-bot commented Sep 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants