[quant][graph] Add support for FP16 dynamic quant #42222

supriyar · 2020-07-29T04:43:20Z

Stack from ghstack:

[quant] Use PlaceholderObserver instead of Fp16Observer and NoopObserver #42348 [quant] Use PlaceholderObserver instead of Fp16Observer and NoopObserver
[quant][graph] Add support for FP16 dynamic quant #42222 [quant][graph] Add support for FP16 dynamic quant
[quant] Add FP16Observer for fp16 quant support #42221 [quant] Add FP16Observer for fp16 quant support
[quant] Add saturate_to_fp16 op for FP16 quant support #42147 [quant] Add saturate_to_fp16 op for FP16 quant support

Summary:
This change adds the necessary passes to perform FP16 dynamic quantization.
We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights

Test Plan:
python test/test_quantization.py TestQuantizeJitOps

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D22849220

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 99716c4 Pull Request resolved: #42222

dr-ci · 2020-07-29T04:57:58Z

💊 CI failures summary and remediations

As of commit 8016494 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-xenial-rocm3.5.1-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 17 times.

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: fdd1e38 Pull Request resolved: #42222

jerryzh168 · 2020-07-29T20:46:47Z

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp

  }
  if (quant_type == QuantType::DYNAMIC) {
-    if (isFP16NoopObserver(module, observer)) {
+    if (isFp16Observer(observer->input(0))) {


can we just check dtype here?

I think so, was just being more explicit by checking for the observer type as well.

jerryzh168 · 2020-07-29T22:56:10Z

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp

-  auto observer_module = module.attr(findObserverName(v).value()).toModule();
-  return (observer_module.attr("dtype") == at::ScalarType::Half) &&
-      isNoopObserver(observer);
+bool isFp16Observer(Value* observer) {


do we need this check? I think checking dtype is enough for our purposes?

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 0d270ed Pull Request resolved: #42222

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp

jerryzh168 · 2020-07-30T16:21:13Z

test/quantization/test_quantize_jit.py

+                observer_name = 'Fp16Observer = prim::GetAttr[name="_observer_'
+                FileCheck().check(observer_name) \
+                           .run(m.fc.graph)


Looks like this check is not very useful, what do we want to check here?

Just additional check for observer name match

jerryzh168

LG, had a few inline comments

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D22849220](https://our.internmc.facebook.com/intern/diff/D22849220) [ghstack-poisoned]

Summary: This change adds the necessary passes to perform FP16 dynamic quantization. We skip inserting observers for activations based on the dtype (torch.float16) and only insert the Fp16Observer for weights Test Plan: python test/test_quantization.py TestQuantizeJitOps Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: ce6e7dd Pull Request resolved: #42222

facebook-github-bot · 2020-07-31T20:15:25Z

This pull request has been merged in 6bd46b5.

supriyar requested a review from apaszke as a code owner July 29, 2020 04:43

This was referenced Jul 29, 2020

[quant] Add saturate_to_fp16 op for FP16 quant support #42147

Closed

[quant] Add FP16Observer for fp16 quant support #42221

Closed

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jul 29, 2020

supriyar requested review from jerryzh168 and vkuzo July 29, 2020 16:46

jerryzh168 reviewed Jul 29, 2020

View reviewed changes

supriyar requested a review from jerryzh168 July 30, 2020 00:59

jerryzh168 reviewed Jul 30, 2020

View reviewed changes

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp Outdated Show resolved Hide resolved

jerryzh168 reviewed Jul 30, 2020

View reviewed changes

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp Outdated Show resolved Hide resolved

jerryzh168 reviewed Jul 30, 2020

View reviewed changes

jerryzh168 approved these changes Jul 30, 2020

View reviewed changes

supriyar mentioned this pull request Jul 31, 2020

[quant] Use PlaceholderObserver instead of Fp16Observer and NoopObserver #42348

Closed

facebook-github-bot closed this in 6bd46b5 Jul 31, 2020

facebook-github-bot added the merged label Jul 31, 2020

facebook-github-bot deleted the gh/supriyar/153/head branch August 4, 2020 14:15

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant][graph] Add support for FP16 dynamic quant #42222

[quant][graph] Add support for FP16 dynamic quant #42222

Uh oh!

supriyar commented Jul 29, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Jul 29, 2020 •

edited

Loading

Uh oh!

jerryzh168 Jul 29, 2020

Uh oh!

supriyar Jul 29, 2020

Uh oh!

jerryzh168 Jul 29, 2020

Uh oh!

Uh oh!

Uh oh!

jerryzh168 Jul 30, 2020

Uh oh!

supriyar Jul 30, 2020

Uh oh!

jerryzh168 left a comment

Uh oh!

facebook-github-bot commented Jul 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[quant][graph] Add support for FP16 dynamic quant #42222

[quant][graph] Add support for FP16 dynamic quant #42222

Uh oh!

Conversation

supriyar commented Jul 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Jul 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

jerryzh168 Jul 29, 2020

Choose a reason for hiding this comment

Uh oh!

supriyar Jul 29, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jul 29, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jerryzh168 Jul 30, 2020

Choose a reason for hiding this comment

Uh oh!

supriyar Jul 30, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

supriyar commented Jul 29, 2020 •

edited

Loading

dr-ci bot commented Jul 29, 2020 •

edited

Loading