register parameters correctly in c++ MultiheadAttention #42037

glaringlee · 2020-07-24T21:36:02Z

Stack from ghstack:

register parameters correctly in c++ MultiheadAttention #42037 register parameters correctly in c++ MultiheadAttention

This is to fix #41951

Differential Revision: D22764717

[ghstack-poisoned]

ghstack-source-id: bae3a2d Pull Request resolved: #42037

glaringlee · 2020-07-24T21:55:03Z

This is to correct the parameter registration in MultiheadAttention, the root cause is described in #41951.
cc @pbelevich @yf225

zhangguanheng66 · 2020-07-26T16:54:54Z

test/cpp/api/modules.cpp

      } else {
        std::uniform_int_distribution<int> d(5, 20);
        kv_dim = d(generator);
+        while (kv_dim == d_model) {


what's this for?

This branch was aimed to test kv_dim != d_model case, but the generator d could generate a same value as d_model, for eg. 9, so I add this while.

zhangguanheng66 · 2020-07-26T16:57:34Z

torch/csrc/api/src/nn/modules/activation.cpp

+    bias_k = register_parameter("bias_k", torch::empty({1, 1, options.embed_dim()}));
+    bias_v = register_parameter("bias_v", torch::empty({1, 1, options.embed_dim()}));
  } else {
    bias_k = {};


Do we need to register_parameter('bias_k', {}, /*requires_grad=*/false)?

Based on python impl, No. It only register the non-empty case for bias_k and bias_v.

pytorch/torch/nn/modules/activation.py

Line 837 in 4121d34

self.bias_k = self.bias_v = None

zhangguanheng66

Not relevant. We merged two PRs in H1 for nn.MHA on the Python side.
#31996
#33763

zhangguanheng66

LGTM for the MHA functionality.

facebook-github-bot · 2020-07-27T22:20:19Z

@glaringlee merged this pull request in 5246bc4.

register parameters correctly in c++ MultiheadAttention

99b2a3e

[ghstack-poisoned]

glaringlee requested review from ebetica, goldsborough and yf225 as code owners July 24, 2020 21:36

glaringlee pushed a commit that referenced this pull request Jul 24, 2020

register parameters correctly in c++ MultiheadAttention

747b889

ghstack-source-id: bae3a2d Pull Request resolved: #42037

glaringlee requested review from anjali411, pbelevich and zhangguanheng66 and removed request for ebetica and goldsborough July 24, 2020 21:37

zhangguanheng66 reviewed Jul 26, 2020

View reviewed changes

zhangguanheng66 approved these changes Jul 27, 2020

View reviewed changes

yf225 approved these changes Jul 27, 2020

View reviewed changes

facebook-github-bot closed this in 5246bc4 Jul 27, 2020

facebook-github-bot added the merged label Jul 27, 2020

facebook-github-bot deleted the gh/glaringlee/24/head branch July 31, 2020 14:17

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

register parameters correctly in c++ MultiheadAttention #42037

register parameters correctly in c++ MultiheadAttention #42037

Uh oh!

glaringlee commented Jul 24, 2020 •

edited

Loading

Uh oh!

glaringlee commented Jul 24, 2020

Uh oh!

zhangguanheng66 Jul 26, 2020

Uh oh!

glaringlee Jul 27, 2020

Uh oh!

zhangguanheng66 Jul 26, 2020

Uh oh!

glaringlee Jul 27, 2020 •

edited

Loading

Uh oh!

zhangguanheng66 left a comment

Uh oh!

zhangguanheng66 left a comment

Uh oh!

facebook-github-bot commented Jul 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

register parameters correctly in c++ MultiheadAttention #42037

register parameters correctly in c++ MultiheadAttention #42037

Uh oh!

Conversation

glaringlee commented Jul 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glaringlee commented Jul 24, 2020

Uh oh!

zhangguanheng66 Jul 26, 2020

Choose a reason for hiding this comment

Uh oh!

glaringlee Jul 27, 2020

Choose a reason for hiding this comment

Uh oh!

zhangguanheng66 Jul 26, 2020

Choose a reason for hiding this comment

Uh oh!

glaringlee Jul 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangguanheng66 left a comment

Choose a reason for hiding this comment

Uh oh!

zhangguanheng66 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

glaringlee commented Jul 24, 2020 •

edited

Loading

glaringlee Jul 27, 2020 •

edited

Loading