[ao][sparsity] comsability for sparsity and QAT convert #74848

HDCharles · 2022-03-28T18:33:19Z

Stack from ghstack (oldest at bottom):

-> [ao][sparsity] comsability for sparsity and QAT convert #74848

Summary: The primary issue for enabling sparsity to work with QAT
convert (unlike normal quantization convert) is that when the
parametrized module undergoes the QAT convert, the parametrizations need
to be maintained. If the parametrizations don't
get transfered during the convert, the sparsifier would lose its
connection to the model. In practice this was handled using the
transfer_parametrizations_and_params function to move the weight and
bias and any associated paramerizations to the new module. This PR also adds
tests for transfer_parametrizations_and_params and type_before_parametrizations
to test_nn.py and also added comments to the test code for
composability.

Test Plan: python test/test_ao_sparsity.py TestComposability
python test/test_nn.py TestNN

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D35240272

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6c0da00 Pull Request resolved: #74848

facebook-github-bot · 2022-03-28T18:33:41Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/74848
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 0cc5bdc (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to identify all parametrizations on the original module and then move them (and their associated parameters) to the new module. Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

torch/nn/qat/modules/linear.py

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to identify all parametrizations on the original module and then move them (and their associated parameters) to the new module. Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to identify all parametrizations on the original module and then move them (and their associated parameters) to the new module. Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 2153364 Pull Request resolved: #74848

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to identify all parametrizations on the original module and then move them (and their associated parameters) to the new module. Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations also added comments to the test code Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 57f9e42 Pull Request resolved: #74848

HDCharles · 2022-04-07T00:22:51Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

test/test_nn.py

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations also added comments to the test code Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D35240272](https://our.internmc.facebook.com/intern/diff/D35240272) [ghstack-poisoned]

HDCharles · 2022-04-07T17:23:11Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations also added comments to the test code Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D35240272](https://our.internmc.facebook.com/intern/diff/D35240272) [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations also added comments to the test code Test Plan: python test/test_ao_sparsity.py TestComposability Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 53b5776 Pull Request resolved: #74848

HDCharles · 2022-04-07T17:29:08Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations to test_nn.py and also added comments to the test code for composability. Test Plan: python test/test_ao_sparsity.py TestComposability python test/test_nn.py TestNN Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D35240272](https://our.internmc.facebook.com/intern/diff/D35240272) [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations to test_nn.py and also added comments to the test code for composability. Test Plan: python test/test_ao_sparsity.py TestComposability python test/test_nn.py TestNN Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 3bf0f42 Pull Request resolved: #74848

HDCharles · 2022-04-08T01:55:08Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

lezcano

Just one more thing I forgot! Parametrisations may be many-to-one if the right-inverse returns more than one tensor. You can find examples of these in test_multiple_inputs_parametrization. Could you add a test for this case?

lezcano · 2022-04-08T16:30:11Z

torch/nn/utils/parametrize.py

+
+            # need to initialize the param in to_module if it doesn't exist already
+            if not hasattr(to_module, parameter_name):
+                setattr(to_module, parameter_name, deepcopy(from_module.parametrizations[parameter_name].original))


I believe this branch is not tested, is it? Furthermore, I believe it is incorrect. I think it should be

Suggested change

setattr(to_module, parameter_name, deepcopy(from_module.parametrizations[parameter_name].original))

setattr(to_module, parameter_name, getattr(from_module, parameter_name))

Note that the original parameter may be a tuple if the parametrization is many-to-one, so setting it as an attribute would not be of much use.

yeah, its tested in line 3282 in test_nn.py within test_transfer_parametrizations_and_params, I can split it into another test if you like, it looked like some of the other parametriation tests were grouped in some similar cases so I wasn't sure.

I added your suggested change, and added a test for the many-to-one case which works after some changes.

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations to test_nn.py and also added comments to the test code for composability. Test Plan: python test/test_ao_sparsity.py TestComposability python test/test_nn.py TestNN Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D35240272](https://our.internmc.facebook.com/intern/diff/D35240272) [ghstack-poisoned]

HDCharles · 2022-04-08T18:34:56Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

lezcano

LGTM CI abiding. Thank you for the many improvements and the testing on the parametrisations end!

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations to test_nn.py and also added comments to the test code for composability. Test Plan: python test/test_ao_sparsity.py TestComposability python test/test_nn.py TestNN Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D35240272](https://our.internmc.facebook.com/intern/diff/D35240272) [ghstack-poisoned]

Summary: The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to move the weight and bias and any associated paramerizations to the new module. This PR also adds tests for transfer_parametrizations_and_params and type_before_parametrizations to test_nn.py and also added comments to the test code for composability. Test Plan: python test/test_ao_sparsity.py TestComposability python test/test_nn.py TestNN Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 0cd91d0 Pull Request resolved: #74848

HDCharles · 2022-04-08T21:10:58Z

@HDCharles has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-04-11T16:30:53Z

@pytorchbot merge this

(Initiating merge automatically since Phabricator Diff has merged)

github-actions · 2022-04-11T16:32:47Z

Hey @HDCharles.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: Pull Request resolved: #74848 The primary issue for enabling sparsity to work with QAT convert (unlike normal quantization convert) is that when the parametrized module undergoes the QAT convert, the parametrizations need to be maintained. If the parametrizations don't get transfered during the convert, the sparsifier would lose its connection to the model. In practice this was handled using the transfer_parametrizations_and_params function to identify all parametrizations on the original module and then move them (and their associated parameters) to the new module. Test Plan: python test/test_ao_sparsity.py TestComposability Imported from OSS Reviewed By: malfet Differential Revision: D35240272 fbshipit-source-id: 08d6a938d5919ba2dfd8490b1c768fafc5b179dd

composability sparsity+QAT

66cef77

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

HDCharles requested review from albanD and jbschlosser as code owners March 28, 2022 18:33

This was referenced Mar 28, 2022

[ao][sparsity] make sparsity and PTQ compose #74845

Closed

[ao][sparsity] make sparsity compose with PTQ convert #74846

Closed

[ao][sparsity] Composability of fusion and sparsity #74847

Closed

HDCharles added a commit that referenced this pull request Mar 28, 2022

composability sparsity+QAT

7c98e3e

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6c0da00 Pull Request resolved: #74848

facebook-github-bot added the cla signed label Mar 28, 2022

Update on "composability sparsity+QAT"

40cbee6

Summary: WIP Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

HDCharles changed the title ~~composability sparsity+QAT~~ [ao][sparsity] comsability for sparsity and QAT convert Mar 28, 2022

HDCharles commented Mar 28, 2022

View reviewed changes

torch/nn/qat/modules/linear.py Outdated Show resolved Hide resolved

HDCharles requested review from andrewor14, dzdang, jerryzh168, terrychenism, vkuzo and z-a-f March 28, 2022 23:51

lezcano reviewed Apr 7, 2022

View reviewed changes

test/test_nn.py Outdated Show resolved Hide resolved

HDCharles requested review from albanD and lezcano April 8, 2022 01:56

lezcano reviewed Apr 8, 2022

View reviewed changes

HDCharles requested a review from lezcano April 8, 2022 18:35

lezcano approved these changes Apr 8, 2022

View reviewed changes

pytorchmergebot closed this in 25ee525 Apr 11, 2022

facebook-github-bot deleted the gh/HDCharles/65/head branch April 15, 2022 14:17

	setattr(to_module, parameter_name, deepcopy(from_module.parametrizations[parameter_name].original))
	setattr(to_module, parameter_name, getattr(from_module, parameter_name))

[ao][sparsity] comsability for sparsity and QAT convert #74848

[ao][sparsity] comsability for sparsity and QAT convert #74848

Uh oh!

Conversation

HDCharles commented Mar 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Mar 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

Uh oh!

HDCharles commented Apr 7, 2022

Uh oh!

Uh oh!

HDCharles commented Apr 7, 2022

Uh oh!

HDCharles commented Apr 7, 2022

Uh oh!

HDCharles commented Apr 8, 2022

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

lezcano Apr 8, 2022

Choose a reason for hiding this comment

Uh oh!

HDCharles Apr 8, 2022

Choose a reason for hiding this comment

Uh oh!

HDCharles commented Apr 8, 2022

Uh oh!

lezcano left a comment

Choose a reason for hiding this comment

Uh oh!

HDCharles commented Apr 8, 2022

Uh oh!

facebook-github-bot commented Apr 11, 2022

Uh oh!

github-actions bot commented Apr 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

HDCharles commented Mar 28, 2022 •

edited

Loading

facebook-github-bot commented Mar 28, 2022 •

edited

Loading