[Quant] Add fused linear-tanh op for onednn backend #88879

Xia-Weiwen · 2022-11-11T09:18:40Z

Stack from ghstack (oldest at bottom):

Summary
Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused linear-tanh op for onednn backend, which will be used for int8 inference with onednn backend. Linear-tanh is found in models like CGAN.
Cannot call this op with other quantization backends otherwise an error is thrown.

Test Plan
python test_quantization.py TestQuantizedLinear

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo @jgong5 @leslie-fang-intel @VitalyFedyunin @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2022-11-11T09:18:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88879

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fc5ce60:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 573640e Pull Request resolved: #88879

jerryzh168

lg, please add "Summary" and "Test Plan" as well

kimishpatel · 2022-11-14T14:59:59Z

what is the motivation behind tanh linear fusion?

Xia-Weiwen · 2022-11-15T07:48:01Z

what is the motivation behind tanh linear fusion?

Fusing activations with root ops can reduce overhead and improve inference performance. Linear-tanh is found in models like CGAN.

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused `linear-tanh` op for `onednn` backend, which will be used for int8 inference with `onednn` backend. Linear-tanh is found in models like CGAN. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** python test_quantization.py TestQuantizedLinear cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen · 2022-12-19T07:53:34Z

@pytorchbot merge

pytorchmergebot · 2022-12-19T07:55:26Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[Quant] Add fused linear-tanh op for onednn backend

fb1b0aa

[ghstack-poisoned]

Xia-Weiwen requested review from digantdesai, jerryzh168, jianyuh, kimishpatel, salilsdesai and z-a-f as code owners November 11, 2022 09:18

Xia-Weiwen mentioned this pull request Nov 11, 2022

[Quant] Add fused linear-leaky_relu op for onednn backend #88478

Closed

pytorch-bot bot added the release notes: quantization release notes category label Nov 11, 2022

This was referenced Nov 11, 2022

[Quant] Add fused LinearLeakyReLU module for onednn backend #88661

Closed

[Quant][FX] Add backend config for onednn backend and fuse Linear-LeakyReLU #88665

Closed

[Quant][FX] Lower QLinearLeakyReLU for onednn backend #88668

Closed

github-actions bot added module: cpu CPU specific problem (e.g., perf, algorithm) oncall: quantization Quantization support in PyTorch labels Nov 11, 2022

Xia-Weiwen added a commit that referenced this pull request Nov 11, 2022

[Quant] Add fused linear-tanh op for onednn backend

904d954

ghstack-source-id: 573640e Pull Request resolved: #88879

Xia-Weiwen marked this pull request as draft November 11, 2022 09:20

Xia-Weiwen requested a review from jgong5 November 11, 2022 09:20

pytorchbot added the open source label Nov 11, 2022

jgong5 approved these changes Nov 12, 2022

View reviewed changes

Xia-Weiwen mentioned this pull request Nov 12, 2022

[Quant] Add fused LinearTanh module for onednn backend #88923

Closed

jerryzh168 approved these changes Nov 12, 2022

View reviewed changes

Update on "[Quant] Add fused linear-tanh op for onednn backend"

e9874ac

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen mentioned this pull request Nov 17, 2022

[Quant] lower fused LinearTanh for onednn backend #89188

Closed

Xia-Weiwen added 3 commits November 17, 2022 14:03

Update on "[Quant] Add fused linear-tanh op for onednn backend"

7f14660

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Update on "[Quant] Add fused linear-tanh op for onednn backend"

9e1351b

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Update on "[Quant] Add fused linear-tanh op for onednn backend"

b66df68

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

Xia-Weiwen marked this pull request as ready for review November 21, 2022 00:43

Xia-Weiwen added the intel This tag is for PR from Intel label Nov 21, 2022

Xia-Weiwen requested a review from jerryzh168 November 22, 2022 05:21

Xia-Weiwen added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 25, 2022

Xia-Weiwen added 6 commits November 29, 2022 14:48

jerryzh168 approved these changes Dec 14, 2022

View reviewed changes

Xia-Weiwen added 9 commits December 15, 2022 10:00

pytorchmergebot added the Merged label Dec 19, 2022

pytorchmergebot closed this in ea49e76 Dec 19, 2022

facebook-github-bot deleted the gh/Xia-Weiwen/5/head branch June 8, 2023 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant] Add fused linear-tanh op for onednn backend #88879

[Quant] Add fused linear-tanh op for onednn backend #88879

Uh oh!

Xia-Weiwen commented Nov 11, 2022 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 11, 2022 •

edited

Loading

Uh oh!

jerryzh168 left a comment

Uh oh!

kimishpatel commented Nov 14, 2022

Uh oh!

Xia-Weiwen commented Nov 15, 2022

Uh oh!

Xia-Weiwen commented Dec 19, 2022

Uh oh!

pytorchmergebot commented Dec 19, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[Quant] Add fused linear-tanh op for onednn backend #88879

[Quant] Add fused linear-tanh op for onednn backend #88879

Uh oh!

Conversation

Xia-Weiwen commented Nov 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/88879

✅ No Failures

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

kimishpatel commented Nov 14, 2022

Uh oh!

Xia-Weiwen commented Nov 15, 2022

Uh oh!

Xia-Weiwen commented Dec 19, 2022

Uh oh!

pytorchmergebot commented Dec 19, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Xia-Weiwen commented Nov 11, 2022 •

edited

Loading

pytorch-bot bot commented Nov 11, 2022 •

edited

Loading