quant bench: update observer configs #42956

vkuzo · 2020-08-13T00:52:20Z

Stack from ghstack:

observers: use torch.all to check for valid min and max values #43151 observers: use torch.all to check for valid min and max values
observers: use clamp instead of min/max in calculate_qparams #43150 observers: use clamp instead of min/max in calculate_qparams
observers: make eps a buffer #43149 observers: make eps a buffer
quant bench: update observer configs #42956 quant bench: update observer configs

Summary:

In preparation for observer perf improvement, cleans up the
micro benchmarks:

disable CUDA for histogram observers (it's too slow)
add larger shapes for better representation of real workloads

Test Plan:

cd benchmarks/operator_benchmark
python -m pt.qobserver_test

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D23093996

Summary: In preparation for observer perf improvement, cleans up the micro benchmarks: * disable CUDA for histogram observers (it's too slow) * add larger shapes for better representation of real workloads Test Plan: ``` cd benchmarks/operator_benchmark python -m pt.qobserver_test ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-08-13T06:57:58Z

💊 CI failures summary and remediations

As of commit d96e549 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 3 times.

Summary: In preparation for observer perf improvement, cleans up the micro benchmarks: * disable CUDA for histogram observers (it's too slow) * add larger shapes for better representation of real workloads Test Plan: ``` cd benchmarks/operator_benchmark python -m pt.qobserver_test ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23093996](https://our.internmc.facebook.com/intern/diff/D23093996) [ghstack-poisoned]

Summary: In preparation for observer perf improvement, cleans up the micro benchmarks: * disable CUDA for histogram observers (it's too slow) * add larger shapes for better representation of real workloads Test Plan: ``` cd benchmarks/operator_benchmark python -m pt.qobserver_test ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6047570 Pull Request resolved: #42956

raghuramank100 · 2020-08-17T19:24:15Z

benchmarks/operator_benchmark/pt/qobserver_test.py


    def forward(self):
-        return self.op_func(self.f_input)
+        self.op_func(self.f_input)


Previously we had a forward and qparam benchmark separately which might be more useful in practice. We call forward for multiple iterations and calcqparams once at convert. With the separate ones, we can also synthesize the time taken for the combined forward+calcqparam call. Is there a reason to prefer this way of doing profiling?

this is making the benchmark represent what happens inside the observer during QAT, not keeping the old code around because I'm not aware of a need for it in the near future. We have separate benchmarks for histogram observers, and I'm not aware of any requests to optimize observers outside of QAT + histogram observers.

calculate_qparams is called at every pass through the observer during QAT, when observers are enabled

facebook-github-bot · 2020-08-18T00:16:34Z

This pull request has been merged in 5aa61af.

This was referenced Aug 13, 2020

min_max kernel: add CUDA #42868

Closed

_min_max_val.dim: CPU implementation #42894

Closed

_min_max.dim: CUDA implementation #42943

Closed

quant: switch observers to use min_max #42957

Closed

vkuzo requested review from jerryzh168, raghuramank100, supriyar and z-a-f August 13, 2020 00:55

supriyar approved these changes Aug 13, 2020

View reviewed changes

This was referenced Aug 17, 2020

observers: make eps a buffer #43149

Closed

observers: use clamp instead of min/max in calculate_qparams #43150

Closed

observers: use torch.all to check for valid min and max values #43151

Closed

raghuramank100 reviewed Aug 17, 2020

View reviewed changes

facebook-github-bot closed this in 5aa61af Aug 18, 2020

facebook-github-bot added the merged label Aug 18, 2020

facebook-github-bot deleted the gh/vkuzo/122/head branch August 21, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

quant bench: update observer configs #42956

quant bench: update observer configs #42956

Uh oh!

vkuzo commented Aug 13, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Aug 13, 2020 •

edited

Loading

Uh oh!

raghuramank100 Aug 17, 2020 •

edited

Loading

Uh oh!

vkuzo Aug 17, 2020

Uh oh!

vkuzo Aug 17, 2020

Uh oh!

facebook-github-bot commented Aug 18, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

quant bench: update observer configs #42956

quant bench: update observer configs #42956

Uh oh!

Conversation

vkuzo commented Aug 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Aug 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

raghuramank100 Aug 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vkuzo Aug 17, 2020

Choose a reason for hiding this comment

Uh oh!

vkuzo Aug 17, 2020

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 18, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

vkuzo commented Aug 13, 2020 •

edited

Loading

dr-ci bot commented Aug 13, 2020 •

edited

Loading

raghuramank100 Aug 17, 2020 •

edited

Loading