[Quant] lower fused LinearTanh for onednn backend #89188

Xia-Weiwen · 2022-11-17T05:44:26Z

Stack from ghstack (oldest at bottom):

Summary
Add fuser method and quantization mappings for QLinearLeakyReLU for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode.

Test plan
python test_quantization.py TestFuseFx TestQuantizeFx

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo @jgong5 @leslie-fang-intel @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2022-11-17T05:44:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89188

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f725426:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

ghstack-source-id: 9c9b3c0 Pull Request resolved: #89188

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

ghstack-source-id: 8128287 Pull Request resolved: #89188

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

ghstack-source-id: bea4703 Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

ghstack-source-id: 9219f9b Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 973e908 Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: a460fd0 Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 8c9438a Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 2b31e86 Pull Request resolved: #89188

**Summary** Add fuser method and quantization mappings for `QLinearLeakyReLU` for int8 inference for onednn backend. The fusion and lowering are supported only in FX mode. **Test plan** python test_quantization.py TestFuseFx TestQuantizeFx cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: fcaf152 Pull Request resolved: #89188

Xia-Weiwen · 2022-12-20T00:32:13Z

Hi @jerryzh168 I have made changes per your comments. Please take a look again. Thanks!

Xia-Weiwen · 2022-12-20T01:28:21Z

@pytorchbot merge

pytorchmergebot · 2022-12-20T01:30:15Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[Quant] lower fused LinearTanh for onednn backend

ea3d2e3

[ghstack-poisoned]

Xia-Weiwen requested review from jerryzh168 and z-a-f as code owners November 17, 2022 05:44

pytorch-bot bot added the release notes: quantization release notes category label Nov 17, 2022

Xia-Weiwen mentioned this pull request Nov 17, 2022

[Quant] Add fused linear-leaky_relu op for onednn backend #88478

Closed

github-actions bot added the oncall: quantization Quantization support in PyTorch label Nov 17, 2022

Xia-Weiwen marked this pull request as draft November 17, 2022 05:45

pytorchbot added the open source label Nov 17, 2022

Update on "[Quant] lower fused LinearTanh for onednn backend"

0e9524c

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

Update on "[Quant] lower fused LinearTanh for onednn backend"

a585cd3

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Nov 17, 2022

[Quant] lower fused LinearTanh for onednn backend

4b186a9

ghstack-source-id: 9c9b3c0 Pull Request resolved: #89188

Update on "[Quant] lower fused LinearTanh for onednn backend"

559e440

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Nov 20, 2022

[Quant] lower fused LinearTanh for onednn backend

c14e02e

ghstack-source-id: 8128287 Pull Request resolved: #89188

Update on "[Quant] lower fused LinearTanh for onednn backend"

0d69288

cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel [ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Nov 21, 2022

[Quant] lower fused LinearTanh for onednn backend

1aaef77

ghstack-source-id: bea4703 Pull Request resolved: #89188

Xia-Weiwen added a commit that referenced this pull request Nov 22, 2022

[Quant] lower fused LinearTanh for onednn backend

9d28b12

ghstack-source-id: 9219f9b Pull Request resolved: #89188

Xia-Weiwen marked this pull request as ready for review November 22, 2022 07:00

Xia-Weiwen added the intel This tag is for PR from Intel label Nov 22, 2022

Xia-Weiwen requested a review from jgong5 November 22, 2022 12:09

Xia-Weiwen marked this pull request as draft November 22, 2022 12:09

Xia-Weiwen added a commit that referenced this pull request Dec 15, 2022

[Quant] lower fused LinearTanh for onednn backend

7b96eb2

ghstack-source-id: 973e908 Pull Request resolved: #89188

Xia-Weiwen added a commit that referenced this pull request Dec 15, 2022

[Quant] lower fused LinearTanh for onednn backend

51d273e

ghstack-source-id: a460fd0 Pull Request resolved: #89188

Xia-Weiwen added a commit that referenced this pull request Dec 16, 2022

[Quant] lower fused LinearTanh for onednn backend

1484177

ghstack-source-id: 8c9438a Pull Request resolved: #89188

Xia-Weiwen added a commit that referenced this pull request Dec 16, 2022

[Quant] lower fused LinearTanh for onednn backend

318e442

ghstack-source-id: 2b31e86 Pull Request resolved: #89188

Xia-Weiwen added a commit that referenced this pull request Dec 18, 2022

[Quant] lower fused LinearTanh for onednn backend

bda9d61

ghstack-source-id: fcaf152 Pull Request resolved: #89188

Xia-Weiwen requested a review from jerryzh168 December 20, 2022 00:31

jerryzh168 approved these changes Dec 20, 2022

View reviewed changes

pytorchmergebot added the Merged label Dec 20, 2022

pytorchmergebot closed this in a5eb564 Dec 20, 2022

facebook-github-bot deleted the gh/Xia-Weiwen/7/head branch June 8, 2023 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant] lower fused LinearTanh for onednn backend #89188

[Quant] lower fused LinearTanh for onednn backend #89188

Uh oh!

Xia-Weiwen commented Nov 17, 2022 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 17, 2022 •

edited

Loading

Uh oh!

Xia-Weiwen commented Dec 20, 2022

Uh oh!

Xia-Weiwen commented Dec 20, 2022

Uh oh!

pytorchmergebot commented Dec 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[Quant] lower fused LinearTanh for onednn backend #89188

[Quant] lower fused LinearTanh for onednn backend #89188

Uh oh!

Conversation

Xia-Weiwen commented Nov 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89188

✅ No Failures

Uh oh!

Xia-Weiwen commented Dec 20, 2022

Uh oh!

Xia-Weiwen commented Dec 20, 2022

Uh oh!

pytorchmergebot commented Dec 20, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Xia-Weiwen commented Nov 17, 2022 •

edited

Loading

pytorch-bot bot commented Nov 17, 2022 •

edited

Loading