fix multihead attention for half #21658

ngimel · 2019-06-11T22:01:26Z

Currently multihead attention for half type is broken

  File "/home/ngimel/pytorch/torch/nn/functional.py", line 3279, in multi_head_attention_forward
    attn_output = torch.bmm(attn_output_weights, v)
RuntimeError: Expected object of scalar type Float but got scalar type Half for argument #2 'mat2'

because softmax converts half inputs into fp32 inputs. This is unnecessary - all the computations in softmax will be done in fp32 anyway, and the results need to be converted into fp16 for the subsequent batch matrix multiply, so nothing is gained by writing them out in fp32. This PR gets rid of type casting in softmax, so that half works.

zhangguanheng66

The failed tests are not relevant. Ready to Merge. Thanks for the contribution @ngimel

facebook-github-bot

@zhangguanheng66 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-06-14T02:35:02Z

@zhangguanheng66 merged this pull request in efd20de.

fix multihead attention for half

0cae1c1

ngimel requested a review from zhangguanheng66 June 11, 2019 22:01

pytorchbot added the module: nn Related to torch.nn label Jun 11, 2019

ezyang added the open source label Jun 11, 2019

zhangguanheng66 approved these changes Jun 12, 2019

View reviewed changes

facebook-github-bot reviewed Jun 13, 2019

View reviewed changes

facebook-github-bot closed this in efd20de Jun 13, 2019

facebook-github-bot added the merged label Jun 14, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix multihead attention for half #21658

fix multihead attention for half #21658

Uh oh!

ngimel commented Jun 11, 2019

Uh oh!

zhangguanheng66 left a comment •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jun 14, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fix multihead attention for half #21658

fix multihead attention for half #21658

Uh oh!

Conversation

ngimel commented Jun 11, 2019

Uh oh!

zhangguanheng66 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 14, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

zhangguanheng66 left a comment •

edited

Loading