[FX][Quant] Enable FX quant for patterns like x.view(x.size(...), ...) #90001

Xia-Weiwen · 2022-12-01T10:11:20Z

Summary
This work continues with #83784 by @vkuzo and includes all the changes in that PR.
Quote from #83784:

Issue #83658 reports that ops followed by a certain pattern of view and size ops were not quantized correctly by FX graph mode quantization.
Before this PR, the "size" op was in the "op shares qparams with input" category, and the code assumed that the input of this op has the same dtype as its output. This led to incorrectly propagating the int dtype as the output of whichever op was preceding the view op, which in turn made that op blocklisted from quantization.

The fix is to create a new category of ops which work on different dtypes of tensors but are not observed. This PR does so for size, and also for shape since it works the same way.

Note: This PR needs #91297 to be landed first otherwise there is a UT failure.

Test plan

python test/test_quantization.py -k test_linear_size_view
python test/test_quantization.py -k test_linear_shape_view

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2022-12-01T10:11:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90001

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5adc1b8:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jgong5

LGTM. A separate question on the backend configuration. It seems awkward to me that we have to add such a common configuration to all the backends. Maybe we can have a group of common configurations that can be applied directly to all the backends?

Xia-Weiwen · 2022-12-02T00:42:10Z

LGTM. A separate question on the backend configuration. It seems awkward to me that we have to add such a common configuration to all the backends. Maybe we can have a group of common configurations that can be applied directly to all the backends?

Yes. Maybe we do it in another PR.

Xia-Weiwen · 2022-12-14T10:15:22Z

Hi @vkuzo @jerryzh168 @z-a-f Could you help review this? Thanks!

jerryzh168 · 2022-12-14T17:50:59Z

torch/ao/quantization/fx/_lower_to_native_backend.py

nit: this can also be:

if not is_get_tensor_info_node(n): continue

to reduce indentation

OK. It's fixed

jerryzh168

LGTM, maybe ask @vkuzo to take a look as well since he added the original PR

jerryzh168 · 2023-01-23T17:48:56Z

Hi @Xia-Weiwen any plans to land this?

Xia-Weiwen · 2023-01-26T02:30:29Z

Hi @Xia-Weiwen any plans to land this?

Hi @jerryzh168. This PR was waiting for #91297 to be landed first. That's why it is still not landed. I was trying to land #91297 but some unrelated CI checks failed. I will land these two PRs today if no further CI issues.

Xia-Weiwen · 2023-01-26T12:42:13Z

Hi @Xia-Weiwen any plans to land this?

Hi @jerryzh168. This PR was waiting for #91297 to be landed first. That's why it is still not landed. I was trying to land #91297 but some unrelated CI checks failed. I will land these two PRs today if no further CI issues.

The latest master branch is causing a lot of CI failures right now. Probably due to this commit 46f16b9. I will merge this tomorrow if CI checks pass.

Xia-Weiwen · 2023-01-27T04:47:06Z

@pytorchbot merge

pytorchmergebot · 2023-01-27T04:49:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added the release notes: quantization release notes category label Dec 1, 2022

Xia-Weiwen requested review from jgong5 and vkuzo December 1, 2022 10:11

pytorchbot added the open source label Dec 1, 2022

Xia-Weiwen mentioned this pull request Dec 1, 2022

[Quantization][FX] Fused module not lowered if followed by size + view #83658

Closed

jgong5 approved these changes Dec 1, 2022

View reviewed changes

Xia-Weiwen marked this pull request as ready for review December 2, 2022 00:42

Xia-Weiwen requested review from jerryzh168 and z-a-f as code owners December 2, 2022 00:42

Xia-Weiwen changed the title ~~[FX][Quant] Enable FX quant for pattern like x.view(x.size(...), ...)~~ [FX][Quant] Enable FX quant for patterns like x.view(x.size(...), ...) Dec 2, 2022

Xia-Weiwen added the intel This tag is for PR from Intel label Dec 2, 2022

Xia-Weiwen force-pushed the fx_quant_size_op branch from 45a923e to 0d9ab14 Compare December 5, 2022 01:40

Xia-Weiwen force-pushed the fx_quant_size_op branch from 0d9ab14 to 4752c80 Compare December 12, 2022 02:57

jerryzh168 reviewed Dec 14, 2022

View reviewed changes

jerryzh168 approved these changes Dec 15, 2022

View reviewed changes

Xia-Weiwen force-pushed the fx_quant_size_op branch 2 times, most recently from edb212b to 9d6abd7 Compare December 21, 2022 07:37

Xia-Weiwen force-pushed the fx_quant_size_op branch from 9d6abd7 to 9a5138b Compare January 10, 2023 07:26

Xia-Weiwen force-pushed the fx_quant_size_op branch from 9a5138b to 30d6b07 Compare January 26, 2023 10:01

Enable FX quant for pattern like x.view(x.size(...), ...)

5adc1b8

Xia-Weiwen force-pushed the fx_quant_size_op branch from 30d6b07 to 5adc1b8 Compare January 27, 2023 01:34

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 27, 2023

pytorchmergebot added the Merged label Jan 27, 2023

pytorchmergebot closed this in 6fa84fd Jan 27, 2023

Xia-Weiwen deleted the fx_quant_size_op branch November 13, 2024 06:07

[FX][Quant] Enable FX quant for patterns like x.view(x.size(...), ...) #90001

[FX][Quant] Enable FX quant for patterns like x.view(x.size(...), ...) #90001

Uh oh!

Conversation

Xia-Weiwen commented Dec 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90001

✅ No Failures

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Dec 2, 2022

Uh oh!

Xia-Weiwen commented Dec 14, 2022

Uh oh!

jerryzh168 Dec 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Dec 15, 2022

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Jan 23, 2023

Uh oh!

Xia-Weiwen commented Jan 26, 2023

Uh oh!

Xia-Weiwen commented Jan 26, 2023

Uh oh!

Xia-Weiwen commented Jan 27, 2023

Uh oh!

pytorchmergebot commented Jan 27, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Xia-Weiwen commented Dec 1, 2022 •

edited

Loading

pytorch-bot bot commented Dec 1, 2022 •

edited

Loading

jerryzh168 Dec 14, 2022 •

edited

Loading