quant: add q_batchnorm_1d op #42491

vkuzo · 2020-08-03T21:27:54Z

Stack from ghstack:

quant: add quantized batchnorm 1d module #42546 quant: add quantized batchnorm 1d module
quant: add q_batchnorm_1d op #42491 quant: add q_batchnorm_1d op

Summary:

Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode
hookup will be in a future PR, and graph mode should work after this PR.

Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d
because we convert back to contiguous memory format at the end, since
channels_last is only defined for rank >= 4. If further optimization is
needed, that can be a separate PR (will need the NHWC folks to see if
there is a workaround). Meanwhile, having this is better than not having anything.

Context: There have been both internal and external requests for various
quantized BN1d use cases.

Test Plan:

python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d
python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu
python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm

// performance:
// https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D22926254

Summary: Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode hookup will be in a future PR, and graph mode should work after this PR. Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d because we convert back to contiguous memory format at the end, since channels_last is only defined for rank >= 4. If further optimization is needed, that can be a separate PR (will need the NHWC folks to see if there is a workaround). Context: There have been both internal and external requests for various quantized BN1d use cases. Test Plan: ``` python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm // performance: // https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-08-03T21:29:51Z

💊 CI failures summary and remediations

As of commit 322626b (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-xenial-rocm3.5.1-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 10 times.

Summary: Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode hookup will be in a future PR, and graph mode should work after this PR. Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d because we convert back to contiguous memory format at the end, since channels_last is only defined for rank >= 4. If further optimization is needed, that can be a separate PR (will need the NHWC folks to see if there is a workaround). Meanwhile, having this is better than not having anything. Context: There have been both internal and external requests for various quantized BN1d use cases. Test Plan: ``` python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm // performance: // https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode hookup will be in a future PR, and graph mode should work after this PR. Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d because we convert back to contiguous memory format at the end, since channels_last is only defined for rank >= 4. If further optimization is needed, that can be a separate PR (will need the NHWC folks to see if there is a workaround). Context: There have been both internal and external requests for various quantized BN1d use cases. Test Plan: ``` python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm // performance: // https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: e8890a9 Pull Request resolved: #42491

aten/src/ATen/native/quantized/cpu/qbatch_norm.cpp

jerryzh168 · 2020-08-04T19:01:37Z

test/quantization/test_quantized_op.py


    @skipIfNoFBGEMM
-    def test_batch_norm2d_relu(self):
+    def test_batch_norm_1d_2d_3d_relu(self):


nit: maybe just test_batch_norm_relu and add dimensions in the docstring if needed.

sure, sounds good

jerryzh168 · 2020-08-04T19:01:42Z

test/quantization/test_quantized_op.py


    @skipIfNoFBGEMM
-    def test_batch_norm3d(self):
+    def test_batch_norm_1d_2d_3d(self):


jerryzh168

Looks good

Summary: Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode hookup will be in a future PR, and graph mode should work after this PR. Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d because we convert back to contiguous memory format at the end, since channels_last is only defined for rank >= 4. If further optimization is needed, that can be a separate PR (will need the NHWC folks to see if there is a workaround). Meanwhile, having this is better than not having anything. Context: There have been both internal and external requests for various quantized BN1d use cases. Test Plan: ``` python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm // performance: // https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e ``` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D22926254](https://our.internmc.facebook.com/intern/diff/D22926254) [ghstack-poisoned]

facebook-github-bot · 2020-08-06T02:12:55Z

This pull request has been merged in 50f0d2b.

Summary: Hooks up quantized batchnorm_1d to the quantized_bn kernel. Eager mode hookup will be in a future PR, and graph mode should work after this PR. Note: currently the implementation is ~2x slower on the benchmark than q_batch_norm2d because we convert back to contiguous memory format at the end, since channels_last is only defined for rank >= 4. If further optimization is needed, that can be a separate PR (will need the NHWC folks to see if there is a workaround). Context: There have been both internal and external requests for various quantized BN1d use cases. Test Plan: ``` python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d python test/test_quantization.py TestQuantizedOps.test_batch_norm_1d_2d_3d_relu python test/test_quantization.py TestQuantizeJitOps.test_qbatch_norm // performance: // https://gist.github.com/vkuzo/73a07c0f24c05f5804990d9ebfaecf5e ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 8c1e75b Pull Request resolved: pytorch/pytorch#42491

vkuzo requested review from jerryzh168, raghuramank100 and supriyar August 3, 2020 22:32

vkuzo changed the title ~~quant: add q_batchnorm_1d kernel~~ quant: add q_batchnorm_1d op Aug 4, 2020

vkuzo commented Aug 4, 2020

View reviewed changes

aten/src/ATen/native/quantized/cpu/qbatch_norm.cpp Show resolved Hide resolved

vkuzo mentioned this pull request Aug 4, 2020

quant: add quantized batchnorm 1d module #42546

Closed

jerryzh168 reviewed Aug 4, 2020

View reviewed changes

jerryzh168 approved these changes Aug 4, 2020

View reviewed changes

facebook-github-bot closed this in 50f0d2b Aug 6, 2020

facebook-github-bot added the merged label Aug 6, 2020

facebook-github-bot deleted the gh/vkuzo/111/head branch August 9, 2020 14:16

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

quant: add q_batchnorm_1d op #42491

quant: add q_batchnorm_1d op #42491

Uh oh!

vkuzo commented Aug 3, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Aug 3, 2020 •

edited

Loading

Uh oh!

Uh oh!

jerryzh168 Aug 4, 2020

Uh oh!

vkuzo Aug 4, 2020

Uh oh!

jerryzh168 Aug 4, 2020

Uh oh!

jerryzh168 left a comment

Uh oh!

facebook-github-bot commented Aug 6, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

quant: add q_batchnorm_1d op #42491

quant: add q_batchnorm_1d op #42491

Uh oh!

Conversation

vkuzo commented Aug 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Aug 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

Uh oh!

jerryzh168 Aug 4, 2020

Choose a reason for hiding this comment

Uh oh!

vkuzo Aug 4, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Aug 4, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 6, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vkuzo commented Aug 3, 2020 •

edited

Loading

dr-ci bot commented Aug 3, 2020 •

edited

Loading