[Quant] Remove all the dequant nodes when the ref module has multi input args #90157

leslie-fang-intel · 2022-12-05T06:40:11Z

Stack from ghstack (oldest at bottom):

Summary:
When converting a ref module into a quant module, _lower_static_weighted_ref_module pass assumes the ref_node only has 1 input node, and only remove the first dequant node. We add a check in this PR to ensure this is the case for _lower_static_weighted_ref_module pass.

Test Plan:
We only add a check in this PR, there is no new added test case.

cc @jerryzh168 @jianyuh @raghuramank100 @jamesr66a @vkuzo @jgong5 @Xia-Weiwen @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

…input [ghstack-poisoned]

pytorch-bot · 2022-12-05T06:40:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90157

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0ba4c89:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…input ghstack-source-id: cafba39 Pull Request resolved: #90157

…as multi input args" **Summary**: When converting a ref module into a quant module, `_lower_static_weighted_ref_module` pass assumes the `ref_node` only has 1 input node, and only remove the first `dequant` node. However, when we enable the `conv add` fusion, there will be a extra input node from `add` node besides the original input node from `conv`. Similar as did in the `_lower_quantized_binary_op` pass https://github.com/pytorch/pytorch/blob/41c3b41b92f5019f8d5e2f2846a06b87db01ca4e/torch/ao/quantization/fx/_lower_to_native_backend.py#L766-L775, We should remove all the `dequant` nodes in the `_lower_static_weighted_ref_module` pass. **Test Plan**: It's a bug fix instead of a new feature. When we enable the `conv add` fusion PR later, we will add test cases accordingly. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel · 2022-12-10T04:24:51Z

@jerryzh168 I think this PR is ready for review, could you help to take a look?

…as multi input args" **Summary**: When converting a ref module into a quant module, `_lower_static_weighted_ref_module` pass assumes the `ref_node` only has 1 input node, and only remove the first `dequant` node. However, when we enable the `conv add` fusion, there will be a extra input node from `add` node besides the original input node from `conv`. Similar as did in the `_lower_quantized_binary_op` pass https://github.com/pytorch/pytorch/blob/41c3b41b92f5019f8d5e2f2846a06b87db01ca4e/torch/ao/quantization/fx/_lower_to_native_backend.py#L766-L775, We should remove all the `dequant` nodes in the `_lower_static_weighted_ref_module` pass. **Test Plan**: It's a bug fix instead of a new feature. When we enable the `conv add` fusion PR later, we will add test cases accordingly. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

…input ghstack-source-id: 63fcac0 Pull Request resolved: pytorch#90157

leslie-fang-intel · 2022-12-20T01:15:40Z

Hi @jerryzh168, Could you help to take a look of this fix?

jerryzh168 · 2022-12-20T01:23:14Z

torch/ao/quantization/fx/_lower_to_native_backend.py

-        dq_node = ref_node.args[0]
-        assert(isinstance(dq_node, Node))
-        ref_node.replace_input_with(dq_node, dq_node.args[0])
+        for arg in ref_node.args:


I feel this might be a bit hacky, generally we want to identify a specific pattern and just lower that pattern. this function (_lower_static_weighted_ref_module) is assuming that we have "dq -> ref_fp32_module -> q" pattern I think, could you
1). add some checks to this function to make sure this is the case, e.g. check len(ref_node.args) == 1 or something
2). add another lowering function for conv -> add that uses this code?

Thanks for the suggestions, @jerryzh168. Followed up these 2 steps:

In this PR, I only add the the check of len(ref_node.args) == 1 for this pass ( _lower_static_weighted_ref_module)

I have added another lowering pass for conv -> add named as _lower_static_weighted_ref_module_with_two_dq_inputs in this [Quant][FX] Lower QConvAdd2d for onednn backend #91153. Could you take a look of this lowering pass as we talked about here? Does it look good to you?

Hi @jerryzh168, Could you help to take a look of this fix again?

sounds great, could you update the summary for this PR as well

…as multi input args" **Summary**: When converting a ref module into a quant module, `_lower_static_weighted_ref_module` pass assumes the `ref_node` only has 1 input node, and only remove the first `dequant` node. However, when we enable the `conv add` fusion, there will be a extra input node from `add` node besides the original input node from `conv`. Similar as did in the `_lower_quantized_binary_op` pass https://github.com/pytorch/pytorch/blob/41c3b41b92f5019f8d5e2f2846a06b87db01ca4e/torch/ao/quantization/fx/_lower_to_native_backend.py#L766-L775, We should remove all the `dequant` nodes in the `_lower_static_weighted_ref_module` pass. **Test Plan**: It's a bug fix instead of a new feature. When we enable the `conv add` fusion PR later, we will add test cases accordingly. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

…input ghstack-source-id: 08a8267 Pull Request resolved: pytorch#90157

…as multi input args" **Summary**: When converting a ref module into a quant module, `_lower_static_weighted_ref_module` pass assumes the `ref_node` only has 1 input node, and only remove the first `dequant` node. However, when we enable the `conv add` fusion, there will be a extra input node from `add` node besides the original input node from `conv`. Similar as did in the `_lower_quantized_binary_op` pass https://github.com/pytorch/pytorch/blob/41c3b41b92f5019f8d5e2f2846a06b87db01ca4e/torch/ao/quantization/fx/_lower_to_native_backend.py#L766-L775, We should remove all the `dequant` nodes in the `_lower_static_weighted_ref_module` pass. **Test Plan**: It's a bug fix instead of a new feature. When we enable the `conv add` fusion PR later, we will add test cases accordingly. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

jerryzh168

thanks, please update the summary as well

leslie-fang-intel · 2023-01-05T01:33:17Z

thanks, please update the summary as well

Updated the summary for this PR.

leslie-fang-intel · 2023-01-05T01:37:29Z

@pytorchbot rebase

pytorchmergebot · 2023-01-05T01:39:18Z

@pytorchbot successfully started a rebase job. Check the current status here

…as multi input args" **Summary**: When converting a ref module into a quant module, `_lower_static_weighted_ref_module` pass assumes the `ref_node` only has 1 input node, and only remove the first `dequant` node. We add a check in this PR to ensure this is the case for `_lower_static_weighted_ref_module` pass. **Test Plan**: We only add a check in this PR, there is no new added test case. cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 Xia-Weiwen mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

pytorchmergebot · 2023-01-05T01:39:34Z

Successfully rebased gh/leslie-fang-intel/3/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/90157)

…input ghstack-source-id: db222e1 Pull Request resolved: #90157

leslie-fang-intel · 2023-01-05T01:40:05Z

@pytorchbot merge

pytorchmergebot · 2023-01-05T01:43:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-01-05T07:41:49Z

The merge job was canceled. If you believe this is a mistake,then you can re trigger it through pytorch-bot.

leslie-fang-intel · 2023-01-05T23:56:39Z

@pytorchbot merge

pytorchmergebot · 2023-01-05T23:58:41Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

If the ref module has multi args, remove all the dequant node as the …

e0cb978

…input [ghstack-poisoned]

pytorch-bot bot added the release notes: AO frontend label Dec 5, 2022

leslie-fang-intel added a commit that referenced this pull request Dec 5, 2022

If the ref module has multi args, remove all the dequant node as the …

7f683e4

…input ghstack-source-id: cafba39 Pull Request resolved: #90157

github-actions bot added the release notes: quantization release notes category label Dec 5, 2022

leslie-fang-intel added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 5, 2022

pytorchbot added the open source label Dec 5, 2022

leslie-fang-intel marked this pull request as draft December 5, 2022 06:47

leslie-fang-intel changed the title ~~If the ref module has multi args, remove all the dequant node as the input~~ [Quant] Remove all the dequant nodes when the ref module has multi input args. Dec 5, 2022

leslie-fang-intel changed the title ~~[Quant] Remove all the dequant nodes when the ref module has multi input args.~~ [Quant] Remove all the dequant nodes when the ref module has multi input args Dec 5, 2022

leslie-fang-intel requested review from Xia-Weiwen and jgong5 December 5, 2022 06:56

leslie-fang-intel added intel This tag is for PR from Intel oncall: quantization Quantization support in PyTorch labels Dec 5, 2022

Xia-Weiwen approved these changes Dec 5, 2022

View reviewed changes

leslie-fang-intel mentioned this pull request Dec 6, 2022

[Quant] Add fused conv2d_add op for onednn backend #90262

Closed

leslie-fang-intel mentioned this pull request Dec 7, 2022

[Quant] Add fused conv2d_add_relu op for onednn backend #90364

Closed

jgong5 approved these changes Dec 9, 2022

View reviewed changes

leslie-fang-intel marked this pull request as ready for review December 10, 2022 03:26

leslie-fang-intel requested a review from jerryzh168 December 10, 2022 03:26

leslie-fang-intel mentioned this pull request Dec 10, 2022

[Quant] Update IDeep to support oneDNN conv add fusion #90605

Closed

leslie-fang-intel mentioned this pull request Dec 14, 2022

[Quant] Use the true src zero point to query and create conv pd #90818

Closed

leslie-fang-intel added 3 commits December 16, 2022 08:09

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Dec 19, 2022

If the ref module has multi args, remove all the dequant node as the …

1091a47

…input ghstack-source-id: 63fcac0 Pull Request resolved: pytorch#90157

jerryzh168 reviewed Dec 20, 2022

View reviewed changes

leslie-fang-intel added 2 commits December 20, 2022 15:11

leslie-fang-intel added a commit to leslie-fang-intel/pytorch that referenced this pull request Dec 20, 2022

If the ref module has multi args, remove all the dequant node as the …

5ca8526

…input ghstack-source-id: 08a8267 Pull Request resolved: pytorch#90157

leslie-fang-intel requested a review from jerryzh168 December 21, 2022 00:38

jerryzh168 mentioned this pull request Jan 4, 2023

quantization fuse in convert_fx leave a wrong dequantize node when fuse multiple-input node #91688

Closed

jerryzh168 approved these changes Jan 4, 2023

View reviewed changes

pytorchmergebot pushed a commit that referenced this pull request Jan 5, 2023

If the ref module has multi args, remove all the dequant node as the …

8b7a108

…input ghstack-source-id: db222e1 Pull Request resolved: #90157

pytorchmergebot added the Merged label Jan 5, 2023

pytorchmergebot closed this in aab55d6 Jan 5, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/3/head branch June 8, 2023 17:52

[Quant] Remove all the dequant nodes when the ref module has multi input args #90157

[Quant] Remove all the dequant nodes when the ref module has multi input args #90157

Uh oh!

Conversation

leslie-fang-intel commented Dec 5, 2022 • edited by pytorchmergebot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90157

✅ No Failures

Uh oh!

leslie-fang-intel commented Dec 10, 2022

Uh oh!

leslie-fang-intel commented Dec 20, 2022

Uh oh!

jerryzh168 Dec 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Dec 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel commented Jan 5, 2023

Uh oh!

leslie-fang-intel commented Jan 5, 2023

Uh oh!

pytorchmergebot commented Jan 5, 2023

Uh oh!

pytorchmergebot commented Jan 5, 2023

Uh oh!

leslie-fang-intel commented Jan 5, 2023

Uh oh!

pytorchmergebot commented Jan 5, 2023

Merge started

Uh oh!

pytorchmergebot commented Jan 5, 2023

Uh oh!

leslie-fang-intel commented Jan 5, 2023

Uh oh!

pytorchmergebot commented Jan 5, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

leslie-fang-intel commented Dec 5, 2022 •

edited by pytorchmergebot

Loading

pytorch-bot bot commented Dec 5, 2022 •

edited

Loading

jerryzh168 Dec 20, 2022 •

edited

Loading

leslie-fang-intel Dec 20, 2022 •

edited

Loading