-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[quant][gpu][core] Implemented quantized add operator using cudnn [reland PR74463] #74463
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Summary: This PR implements the quantized add operator using cudnn operations. A test case was added .... Test Plan: TBA [ghstack-poisoned]
CI Flow Status⚛️ CI FlowRuleset - Version:
|
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit d378f4c (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
|
@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
… cudnn" Summary: This PR implements the quantized add operator using cudnn operations. A test case was added .... Test Plan: TBA Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
… cudnn" Summary: This PR implements the quantized add operator using cudnn operations. A test case was added .... Test Plan: TBA Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
… cudnn" Summary: This PR implements the quantized add operator using cudnn operations. A test case was added .... Test Plan: TBA Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
… cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
… cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: 908412b Pull Request resolved: #74463
…udnn" Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: ef6fbf9 Pull Request resolved: #74463
|
We have recently simplified the CIFlow labels and
|
…udnn" Summary: This PR implements the quantized add operator using cudnn operations. Added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
…g cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
…g cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: 4de9f48 Pull Request resolved: #74463
|
@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
1 similar comment
|
@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
…4463) Summary: Pull Request resolved: #74463 This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` TBA Differential Revision: D35009111 D35009111 Reviewed By: jerryzh168 Pulled By: dzdang fbshipit-source-id: 13afa7f0192ffaf1f36334b1af827202c7dd0f74
| std::vector<int64_t> new_sizes(3, 1); | ||
| // cudnn expects leading dimensions to be the dummy dimensions | ||
| new_sizes.back() = qa.sizes().back(); | ||
| if (qa.ndim() == 2) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This line should be qa.dim() instead, similar to previous line 67.
|
This pull request has been reverted by 7bb0133. To re-land this change, please open another pull request, assignthe same reviewers, fix the CI failures that caused the revert and make sure that the failing CI runs on the PR by applying the proper ciflow label (e.g., ciflow/trunk). |
…g cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: eb4c059 Pull Request resolved: #74463
…g cudnn" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
…g cudnn [reland PR74463]" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
…land PR74463] Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: 4713a57 Pull Request resolved: #74463
…g cudnn [reland PR74463]" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` a Differential Revision: [D35009111](https://our.internmc.facebook.com/intern/diff/D35009111) [ghstack-poisoned]
…g cudnn [reland PR74463]" Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` [ghstack-poisoned]
…land PR74463] Summary: This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` ghstack-source-id: 4713a57 Pull Request resolved: #74463
|
@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
…land PR74463] (#74463) Summary: Pull Request resolved: #74463 This PR implements the quantized add operator using cudnn operations. Also added a corresponding test function in test_quantized_op.py. Ideally, we should merge this function with the cpu variant, but for now, we will keep it separate until cudnn v8 is in the default build. Other factors also complicate the merge as cudnn quantized add is currently only supported for int8 symmetrically quantized tensors. Test Plan: In pytorch main dir, execute ``` python test/test_quantization.py TestQuantizedOps.test_qadd_relu_cudnn ``` Reviewed By: ngimel Differential Revision: D35218224 Pulled By: dzdang fbshipit-source-id: a2e57e0b46cff655f2fb77000ea4db3a558a0851
Stack from ghstack (oldest at bottom):
Summary:
This PR implements the quantized add operator using cudnn operations.
Also added a corresponding test function in test_quantized_op.py. Ideally,
we should merge this function with the cpu variant, but for now, we will
keep it separate until cudnn v8 is in the default build. Other factors also
complicate the merge as cudnn quantized add is currently only supported for
int8 symmetrically quantized tensors.
Test Plan:
In pytorch main dir, execute
Differential Revision: D35218224