[NNC] fix support for FP16 in CudaCodgen #44209

nickgg · 2020-09-04T18:22:22Z

Fixes a bug where FP16 values could be incorrectly cast to a half type that doesn't have a cast operator by inserting the cuda specific cast to float during handling of the Cast node, not as a wrapper around printing Loads and Stores. Two main changes: the HalfChecker now inserts the casts to float explicitly in the IR, and the PrioritizeLoad mutator now consumes both Loads and a Cast which immediately preceded a load.

Tested with test_jit_fuser_te.py and test_tensorexpr.py, plus C++ tests obv.

dr-ci · 2020-09-04T19:10:39Z

💊 CI failures summary and remediations

As of commit 4409f54 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.7-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 6 times.

bertmaher

I'm still a little nervous here because the HalfChecker mutation is going to turn loads of halfs into floats, but the expression tree has been constructed assuming all the arithmetic is half-typed. So this probably ends up working syntactically, but the types of the IR are potentially out of sync. I'm not really sure anything will break as a result of this though, so maybe we can push it on to a TODO stack.

facebook-github-bot

@nickgg has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

codecov · 2020-09-08T19:18:15Z

Codecov Report

Merging #44209 into master will decrease coverage by 0.00%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #44209      +/-   ##
==========================================
- Coverage   69.24%   69.24%   -0.01%     
==========================================
  Files         381      381              
  Lines       47573    47573              
==========================================
- Hits        32943    32942       -1     
- Misses      14630    14631       +1

Impacted Files	Coverage Δ
torch/testing/_internal/expecttest.py	`77.55% <0.00%> (-1.03%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5de805d...4409f54. Read the comment docs.

facebook-github-bot · 2020-09-09T04:10:33Z

@nickgg merged this pull request in be94dba.

nickgg requested a review from apaszke as a code owner September 4, 2020 18:22

nickgg requested review from ZolotukhinM, bertmaher and zheng-xq September 4, 2020 18:22

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 4, 2020

bertmaher approved these changes Sep 8, 2020

View reviewed changes

[NNC] fix support for FP16 in CudaCodgen

4409f54

nickgg force-pushed the fixCudaHalf branch from 225992c to 4409f54 Compare September 8, 2020 16:10

facebook-github-bot reviewed Sep 8, 2020

View reviewed changes

facebook-github-bot closed this in be94dba Sep 9, 2020

facebook-github-bot added the merged label Sep 9, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NNC] fix support for FP16 in CudaCodgen #44209

[NNC] fix support for FP16 in CudaCodgen #44209

Uh oh!

nickgg commented Sep 4, 2020

Uh oh!

dr-ci bot commented Sep 4, 2020 •

edited

Loading

Uh oh!

bertmaher left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

codecov bot commented Sep 8, 2020 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[NNC] fix support for FP16 in CudaCodgen #44209

[NNC] fix support for FP16 in CudaCodgen #44209

Uh oh!

Conversation

nickgg commented Sep 4, 2020

Uh oh!

dr-ci bot commented Sep 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

bertmaher left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Sep 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

facebook-github-bot commented Sep 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dr-ci bot commented Sep 4, 2020 •

edited

Loading

codecov bot commented Sep 8, 2020 •

edited

Loading