[Quant] Add fused LinearTanh module for onednn backend #88923

Xia-Weiwen · 2022-11-12T02:01:32Z

Stack from ghstack (oldest at bottom):

Summary
This PR adds fused QLinearTanh module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown.

Test plan
python test_quantization.py TestStaticQuantizedModule

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2022-11-12T02:01:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88923

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit ab6b724:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d3a5d90 Pull Request resolved: #88923

jerryzh168

looks good overall, but probably need some update after the previous PRs are updated I think

jgong5

Not sure if we need to rename the files to something like linear_activation from linear_relu since we added linear+tanh into it.

Xia-Weiwen · 2022-11-14T09:51:19Z

Not sure if we need to rename the files to something like linear_activation from linear_relu since we added linear+tanh into it.

Good idea. Thanks. I will do that after clarifications from Jerry about PT 2.0.

[ghstack-poisoned]

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule [ghstack-poisoned]

Xia-Weiwen · 2022-12-07T02:12:37Z

Hi @jerryzh168 I have done changes per your comments. Could you take a look again? Thanks!

Xia-Weiwen · 2022-12-09T01:19:43Z

Hi @jerryzh168. Is it ok to land this? Thanks

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen · 2022-12-14T01:17:57Z

Hi @jerryzh168 Do you have more comments on this PR? Thanks!

jerryzh168 · 2022-12-14T17:41:08Z

torch/nn/intrinsic/__init__.py

 from torch.ao.nn.intrinsic import LinearLeakyReLU
+from torch.ao.nn.intrinsic import LinearTanh


please revert these, we don't want to add new things to torch/nn/intrinsic folder, this is a folder that we plan to deprecate in the future

Ok they are reverted.

jerryzh168

please revert changes to torch/nn/intrinsic folder

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen · 2022-12-15T02:38:21Z

please revert changes to torch/nn/intrinsic folder

Ok they are reverted.

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

jerryzh168 · 2022-12-15T03:26:39Z

test/allowlist_for_publicAPI.json

    "torch.nn.intrinsic.quantized.modules.bn_relu": "torch.ao.nn.intrinsic.quantized.modules.bn_relu",
    "torch.nn.intrinsic.quantized.modules.conv_relu": "torch.ao.nn.intrinsic.quantized.modules.conv_relu",
-    "torch.nn.intrinsic.quantized.modules.linear_relu": "torch.ao.nn.intrinsic.quantized.modules.linear_relu",
+    "torch.nn.intrinsic.quantized.modules.linear_activation": "torch.ao.nn.intrinsic.quantized.modules.linear_activation",


is this change still valid? we don't have torch.nn.intrinsic.quantized.modules.linear_activation now

It's still needed whereas the file name is changed:

"torch.nn.intrinsic.quantized.modules.linear_relu": "torch.ao.nn.intrinsic.quantized.modules.linear_activation",

I reverted the file name change due to CI checks failure.

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen · 2022-12-16T01:40:59Z

Not sure if we need to rename the files to something like linear_activation from linear_relu since we added linear+tanh into it.

Good idea. Thanks. I will do that after clarifications from Jerry about PT 2.0.

I have renamed the file to linear_activation.py

I reverted the change due to CI check failure.

**Summary** This PR adds fused `QLinearTanh` module for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this module with other quantization backends otherwise an error is thrown. **Test plan** python test_quantization.py TestStaticQuantizedModule cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen · 2022-12-19T13:40:44Z

@pytorchbot merge

pytorchmergebot · 2022-12-19T13:42:18Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[Quant] Add fused LinearTanh module for onednn backend

034c25e

[ghstack-poisoned]

Xia-Weiwen requested review from albanD, jbschlosser, jerryzh168 and z-a-f as code owners November 12, 2022 02:01

pytorch-bot bot added the release notes: quantization release notes category label Nov 12, 2022

Xia-Weiwen mentioned this pull request Nov 12, 2022

[Quant] Add fused linear-leaky_relu op for onednn backend #88478

Closed

Xia-Weiwen added a commit that referenced this pull request Nov 12, 2022

[Quant] Add fused LinearTanh module for onednn backend

17e1210

ghstack-source-id: d3a5d90 Pull Request resolved: #88923

pytorchbot added the open source label Nov 12, 2022

Xia-Weiwen marked this pull request as draft November 12, 2022 02:55

Xia-Weiwen requested a review from jgong5 November 12, 2022 02:56

jerryzh168 reviewed Nov 12, 2022

View reviewed changes

jgong5 approved these changes Nov 14, 2022

View reviewed changes

albanD removed their request for review November 14, 2022 15:15

Update on "[Quant] Add fused LinearTanh module for onednn backend"

0aa39b7

[ghstack-poisoned]

Xia-Weiwen mentioned this pull request Nov 17, 2022

[Quant] lower fused LinearTanh for onednn backend #89188

Closed

Xia-Weiwen added 5 commits November 17, 2022 14:03

Update on "[Quant] Add fused LinearTanh module for onednn backend"

c27a73f

[ghstack-poisoned]

Update on "[Quant] Add fused LinearTanh module for onednn backend"

fcb0ec8

[ghstack-poisoned]

Update on "[Quant] Add fused LinearTanh module for onednn backend"

1111c05

[ghstack-poisoned]

Update on "[Quant] Add fused LinearTanh module for onednn backend"

e975032

[ghstack-poisoned]

Xia-Weiwen marked this pull request as ready for review November 22, 2022 06:03

Xia-Weiwen requested a review from anjali411 as a code owner November 22, 2022 06:03

Xia-Weiwen requested a review from jerryzh168 November 22, 2022 06:03

jerryzh168 reviewed Dec 14, 2022

View reviewed changes

jerryzh168 requested changes Dec 14, 2022

View reviewed changes

Xia-Weiwen added 2 commits December 15, 2022 10:00

Xia-Weiwen added 2 commits December 15, 2022 10:54

jerryzh168 reviewed Dec 15, 2022

View reviewed changes

jerryzh168 approved these changes Dec 15, 2022

View reviewed changes

Xia-Weiwen added 5 commits December 15, 2022 11:28

Xia-Weiwen added 5 commits December 16, 2022 09:43

pytorchmergebot added the Merged label Dec 19, 2022

pytorchmergebot closed this in 6686e9b Dec 19, 2022

facebook-github-bot deleted the gh/Xia-Weiwen/6/head branch June 8, 2023 14:57

		from torch.ao.nn.intrinsic import LinearLeakyReLU
		from torch.ao.nn.intrinsic import LinearTanh

[Quant] Add fused LinearTanh module for onednn backend #88923

[Quant] Add fused LinearTanh module for onednn backend #88923

Uh oh!

Conversation

Xia-Weiwen commented Nov 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88923

✅ No Failures

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Nov 14, 2022

Uh oh!

Xia-Weiwen commented Dec 7, 2022

Uh oh!

Xia-Weiwen commented Dec 9, 2022

Uh oh!

Xia-Weiwen commented Dec 14, 2022

Uh oh!

jerryzh168 Dec 14, 2022

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Dec 15, 2022

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Dec 15, 2022

Uh oh!

jerryzh168 Dec 15, 2022

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Dec 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Dec 16, 2022

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Dec 16, 2022

Uh oh!

Xia-Weiwen commented Dec 19, 2022

Uh oh!

pytorchmergebot commented Dec 19, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Xia-Weiwen commented Nov 12, 2022 •

edited

Loading

pytorch-bot bot commented Nov 12, 2022 •

edited

Loading

Xia-Weiwen Dec 15, 2022 •

edited

Loading