[Quant] Add fused conv2d_add_relu op for onednn backend #90364

leslie-fang-intel · 2022-12-07T07:35:48Z

Stack from ghstack (oldest at bottom):

Summary
Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown.

Test Plan

python -m pytest test_quantization.py::TestQuantizedConv

cc @VitalyFedyunin @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2022-12-07T07:35:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90364

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e4877f1:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: fde2390 Pull Request resolved: #90364

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` **TODO** There is a oneDNN issue which may cause the kernel core dump with some input shapes. This PR should be merged after the issue resolved. cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 5b3ca1a Pull Request resolved: #90364

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` **TODO** There is a oneDNN issue which may cause the kernel core dump with some input shapes. This PR should be merged after the issue resolved. cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 8b0e1e9 Pull Request resolved: #90364

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` **TODO** There is a oneDNN issue which may cause the kernel core dump with some input shapes. This PR should be merged after the issue resolved. cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel · 2023-01-11T06:11:45Z

@jerryzh168 Thanks for the comments and I have changed them. Could you help to take a look of this PR again?

ghstack-source-id: 7c95466 Pull Request resolved: pytorch#90364

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel · 2023-01-26T02:23:18Z

Hi @jerryzh168 Is there any other comments to this PR? Could you help to take a look again?

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

jerryzh168 · 2023-01-27T07:13:30Z

test/quantization/core/test_quantized_op.py

                Y_scale, Y_zero_point, use_bias, "add", use_channelwise, False,
                input_dtype=X_qdtype, output_dtype=X_qdtype, X2_scale=X2_scale, X2_zero_point=X2_zero_point)

+    @given(batch_size=st.integers(1, 3),


please don't add new calls to hypothesis, it has caused a lot of flaky test errors in CI before, can you change them to loops instead?

Thanks for the comments. I have changed it with itertools.product to use for loop. Please help to take a look again @jerryzh168.

**Summary** Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused conv2d_add_relu op for onednn backend, which will be used for int8 inference with onednn backend. Cannot call this op with other quantization backends otherwise an error is thrown. **Test Plan** ``` python -m pytest test_quantization.py::TestQuantizedConv ``` cc VitalyFedyunin jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel · 2023-01-30T05:44:54Z

Hi @jerryzh168, thanks your review and comments for this ghstack. The previous comments have all been fixed. Could you kindly take a look of this ghstack again? There are still some PRs inside this ghstack may need your approval.

leslie-fang-intel · 2023-01-31T01:18:34Z

@pytorchbot merge

pytorchmergebot · 2023-01-31T01:20:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

vkuzo · 2023-02-07T23:27:11Z

I have a build error blamed to this PR when trying to build PyTorch on my machine: https://gist.github.com/vkuzo/43fa00f625fa099eb50b1e41bf8a9b2e, specifically the residual_with_sum_zero_point function. Any thoughts on how I can debug this?

jerryzh168 · 2023-02-08T00:11:14Z

is this related to the version of ideep dependency?

leslie-fang-intel · 2023-02-08T01:17:06Z

@vkuzo Can you try update your ideep version to same as PyTorch master? residual_with_sum_zero_point is newly added to ideep and this ideep version has been integrated to PyTorch.

vkuzo · 2023-02-08T05:13:18Z

thanks, my ideep third_party repo was corrupted, removing and recloning it fixed the error. Sorry for false report!

[Quant] Add fused conv2d_add_relu op for onednn backend

9ee737a

[ghstack-poisoned]

leslie-fang-intel requested review from digantdesai, jerryzh168, jianyuh, kimishpatel, salilsdesai and z-a-f as code owners December 7, 2022 07:35

leslie-fang-intel mentioned this pull request Dec 7, 2022

[Quant] Remove all the dequant nodes when the ref module has multi input args #90157

Closed

pytorch-bot bot added the release notes: quantization release notes category label Dec 7, 2022

leslie-fang-intel mentioned this pull request Dec 7, 2022

[Quant] Add fused conv2d_add op for onednn backend #90262

Closed

github-actions bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Dec 7, 2022

leslie-fang-intel added a commit that referenced this pull request Dec 7, 2022

[Quant] Add fused conv2d_add_relu op for onednn backend

876152a

ghstack-source-id: fde2390 Pull Request resolved: #90364

leslie-fang-intel marked this pull request as draft December 7, 2022 07:36

leslie-fang-intel requested review from Xia-Weiwen, XiaobingSuper and jgong5 December 7, 2022 07:36

leslie-fang-intel added intel This tag is for PR from Intel ciflow/trunk Trigger trunk jobs on your pull request labels Dec 7, 2022

pytorchbot added the open source label Dec 7, 2022

jgong5 approved these changes Dec 9, 2022

View reviewed changes

leslie-fang-intel mentioned this pull request Dec 10, 2022

[Quant] Update IDeep to support oneDNN conv add fusion #90605

Closed

leslie-fang-intel added a commit that referenced this pull request Dec 10, 2022

[Quant] Add fused conv2d_add_relu op for onednn backend

9937142

ghstack-source-id: 5b3ca1a Pull Request resolved: #90364

leslie-fang-intel added a commit that referenced this pull request Dec 14, 2022

[Quant] Add fused conv2d_add_relu op for onednn backend

bd33ef7

ghstack-source-id: 8b0e1e9 Pull Request resolved: #90364

leslie-fang-intel mentioned this pull request Dec 14, 2022

[Quant] Use the true src zero point to query and create conv pd #90818

Closed

leslie-fang-intel added 2 commits January 10, 2023 10:16

leslie-fang-intel requested review from jerryzh168 and removed request for z-a-f January 10, 2023 02:38

leslie-fang-intel assigned leslie-fang-intel and unassigned leslie-fang-intel Jan 10, 2023

leslie-fang-intel mentioned this pull request Jan 13, 2023

Fail to run quantized conv fused with add relu with oneDNN library #92138

Closed

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Jan 13, 2023

[Quant] Add fused conv2d_add_relu op for onednn backend

9dae222

ghstack-source-id: 7c95466 Pull Request resolved: pytorch#90364

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Jan 26, 2023

[Quant] Add fused conv2d_add_relu op for onednn backend

f29c161

ghstack-source-id: 7c95466 Pull Request resolved: pytorch#90364

leslie-fang-intel added 2 commits January 26, 2023 10:35

jerryzh168 reviewed Jan 27, 2023

View reviewed changes

leslie-fang-intel added 3 commits January 28, 2023 11:02

jerryzh168 approved these changes Jan 30, 2023

View reviewed changes

pytorchmergebot added the Merged label Jan 31, 2023

pytorchmergebot closed this in a71d9a9 Jan 31, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/5/head branch June 8, 2023 17:53

[Quant] Add fused conv2d_add_relu op for onednn backend #90364

[Quant] Add fused conv2d_add_relu op for onednn backend #90364

Uh oh!

Conversation

leslie-fang-intel commented Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90364

✅ No Failures

Uh oh!

leslie-fang-intel commented Jan 11, 2023

Uh oh!

leslie-fang-intel commented Jan 26, 2023

Uh oh!

jerryzh168 Jan 27, 2023

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Jan 28, 2023

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel commented Jan 30, 2023

Uh oh!

leslie-fang-intel commented Jan 31, 2023

Uh oh!

pytorchmergebot commented Jan 31, 2023

Merge started

Uh oh!

vkuzo commented Feb 7, 2023

Uh oh!

jerryzh168 commented Feb 8, 2023

Uh oh!

leslie-fang-intel commented Feb 8, 2023

Uh oh!

vkuzo commented Feb 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

leslie-fang-intel commented Dec 7, 2022 •

edited

Loading

pytorch-bot bot commented Dec 7, 2022 •

edited

Loading