[PyTorch Edge] Add Quantized Softmax Op (Naive Implementation) #75017

salilsdesai · 2022-03-31T13:48:59Z

Summary:
This version just does dequantize, fp32 softmax, quantize.
Another version of actual quantized softmax using qnnpack will be added next

Test Plan:
From fbcode:
buck test caffe2/test:quantization -- test_qsoftmax

Benchmarking: See summary of D34996486

Reviewed By: kimishpatel

Differential Revision: D34943147

Summary: This version just does dequantize, fp32 softmax, quantize. Another version of actual quantized softmax using qnnpack will be added next Test Plan: From fbcode: ```buck test caffe2/test:quantization -- test_qsoftmax``` Benchmarking: See summary of D34996486 Reviewed By: kimishpatel Differential Revision: D34943147 fbshipit-source-id: dcebecd55e8da989ab4937c2b8db38baaf32b700

facebook-github-bot · 2022-03-31T13:49:05Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/75017
Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 1e34275 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

facebook-github-bot · 2022-03-31T13:49:39Z

This pull request was exported from Phabricator. Differential Revision: D34943147

Summary: Pull Request resolved: #75017 This version just does dequantize, fp32 softmax, quantize. Another version of actual quantized softmax using qnnpack will be added next Test Plan: From fbcode: ```buck test caffe2/test:quantization -- test_qsoftmax``` Benchmarking: See summary of D34996486 Reviewed By: kimishpatel Differential Revision: D34943147 fbshipit-source-id: 426a0780803597a21460139c67960891d6e9cc81

github-actions · 2022-03-31T19:32:41Z

Hey @salilsdesai.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: In #75017 a quantized softmax kernel was added. This PR adds the eager mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. A future PR will add FX graph mode workflow integration. Test plan: ``` python test/test_quantization.py TestQuantizeEagerPTQStatic.test_softmax ``` [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 097b951 Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 0af2cec Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 9eb7056 Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: e555ffe Pull Request resolved: #75106

facebook-github-bot · 2022-04-05T07:51:44Z

This pull request has been reverted by 1bcae0d. To re-land this change, please open another pull request, assignthe same reviewers, fix the CI failures that caused the revert and make sure that the failing CI runs on the PR by applying the proper ciflow label (e.g., ciflow/trunk).

facebook-github-bot · 2022-04-05T16:57:59Z

This pull request has been reverted by b99b7182aeaf77d07978708b3e1630260e342974. To re-land this change, please open another pull request, assignthe same reviewers, fix the CI failures that caused the revert and make sure that the failing CI runs on the PR by applying the proper ciflow label (e.g., ciflow/trunk).

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 9ec762e Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 8dec27f Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 535666a Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: 7d83ed1 Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` ghstack-source-id: f45b660 Pull Request resolved: #75106

Summary: In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Differential Revision: [D35324817](https://our.internmc.facebook.com/intern/diff/D35324817) [ghstack-poisoned]

Summary: Pull Request resolved: #75106 In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test Plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Reviewed By: kimishpatel, andrewor14 Differential Revision: D35324817 Pulled By: vkuzo fbshipit-source-id: 710ae3bedf8a6ad1dc411cd9808fdd0ce743e757

Summary: Pull Request resolved: #75106 In #75017 a quantized softmax kernel was added. This PR adds the FX graph mode quantization workflow integration to swap `nn.Softmax` to `nnq.Softmax`. Test Plan: ``` python test/test_quantization.py TestQuantizeFxOps.test_fixed_qparams_ops ``` Reviewed By: kimishpatel, andrewor14 Differential Revision: D35324817 Pulled By: vkuzo fbshipit-source-id: 710ae3bedf8a6ad1dc411cd9808fdd0ce743e757 (cherry picked from commit d67603c)

facebook-github-bot added the cla signed label Mar 31, 2022

facebook-github-bot added the fb-exported label Mar 31, 2022

pytorchmergebot closed this in 8d7242a Mar 31, 2022

vkuzo mentioned this pull request Apr 1, 2022

fx quant: add quantized Softmax workflow integration #75106

Closed

facebook-github-bot added the Reverted label Apr 5, 2022

WBobby mentioned this pull request Aug 17, 2022

Add ROCm5.2.3/AMDGPU support for PyTorch WBobby/pytorch#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PyTorch Edge] Add Quantized Softmax Op (Naive Implementation) #75017

[PyTorch Edge] Add Quantized Softmax Op (Naive Implementation) #75017

Uh oh!

salilsdesai commented Mar 31, 2022

Uh oh!

facebook-github-bot commented Mar 31, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 31, 2022

Uh oh!

github-actions bot commented Mar 31, 2022

Uh oh!

facebook-github-bot commented Apr 5, 2022

Uh oh!

facebook-github-bot commented Apr 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[PyTorch Edge] Add Quantized Softmax Op (Naive Implementation) #75017

[PyTorch Edge] Add Quantized Softmax Op (Naive Implementation) #75017

Uh oh!

Conversation

salilsdesai commented Mar 31, 2022

Uh oh!

facebook-github-bot commented Mar 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

facebook-github-bot commented Mar 31, 2022

Uh oh!

github-actions bot commented Mar 31, 2022

Uh oh!

facebook-github-bot commented Apr 5, 2022

Uh oh!

facebook-github-bot commented Apr 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

facebook-github-bot commented Mar 31, 2022 •

edited

Loading