[quant][pyper] Support quantization of ops in fork-wait subgraph #44048

supriyar · 2020-09-02T19:06:11Z

Stack from ghstack:

[quant][pyper] Support quantization of ops in fork-wait subgraph #44048 [quant][pyper] Support quantization of ops in fork-wait subgraph
[quant][pyper] make embedding_bag quantization static #44008 [quant][pyper] make embedding_bag quantization static
[quant][pyper] Support aten::embedding_bag quantization in graph mode #43989 [quant][pyper] Support aten::embedding_bag quantization in graph mode

Summary:
Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph

Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph
and we return future tensor from subgraph

Example

graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper,
       %argument_1.1 : Tensor,
       %argument_2.1 : Tensor):
   %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0
   return (%3)
 with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper,
       %argument_1.1 : Tensor,
       %argument_2.1 : Tensor):
   %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1)
   %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1)
   %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4)
   %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0
   %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0
   %8 : Tensor[] = prim::ListConstruct(%6)
   return (%8)

Test Plan:
python test/test_quantization.py test_interface_with_fork

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D23481003

Summary: Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph and we return future tensor from subgraph Example ``` graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0 return (%3) with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1) %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1) %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4) %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0 %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0 %8 : Tensor[] = prim::ListConstruct(%6) return (%8) ``` Test Plan: python test/test_quantization.py test_interface_with_fork Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-09-02T19:17:35Z

💊 CI failures summary and remediations

As of commit b4423cd (more details on the Dr. CI page):

3/3 failures possibly* introduced in this PR
- 1/3 non-CircleCI failure(s)

🕵️ 2 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_linux_xenial_py3_6_gcc5_4_ge_config_simple_test (1/2)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

Sep 04 20:21:36 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future

Sep 04 20:21:36 At: 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:21:36  
Sep 04 20:21:36 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Sep 04 20:21:36  
Sep 04 20:21:36 At: 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:21:36  
Sep 04 20:21:36 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Sep 04 20:21:36  
Sep 04 20:21:36 At: 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:21:36   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:21:36  
Sep 04 20:21:36 [W tensorpipe_agent.cpp:576] RPC agent for worker2 encountered error when reading incoming request from worker0: EOF: end of file (this is expected to happen during shutdown) 
Sep 04 20:21:36 ok (1.643s) 
Sep 04 20:21:37   test_return_future_remote (__main__.TensorPipeRpcTestWithSpawn) ... [W tensorpipe_agent.cpp:576] RPC agent for worker0 encountered error when reading incoming request from worker1: EOF: end of file (this is expected to happen during shutdown) 
Sep 04 20:21:37 [W tensorpipe_agent.cpp:576] RPC agent for worker2 encountered error when reading incoming request from worker0: EOF: end of file (this is expected to happen during shutdown) 
Sep 04 20:21:38 ok (1.643s)

pytorch_linux_xenial_py3_6_gcc5_4_test (2/2)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

Sep 04 20:16:50 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future

Sep 04 20:16:50 At: 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:16:50  
Sep 04 20:16:50 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Sep 04 20:16:50  
Sep 04 20:16:50 At: 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:16:50  
Sep 04 20:16:50 [E request_callback_no_python.cpp:619] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Sep 04 20:16:50  
Sep 04 20:16:50 At: 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Sep 04 20:16:50   /opt/conda/lib/python3.6/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Sep 04 20:16:50  
Sep 04 20:16:50 [W tensorpipe_agent.cpp:576] RPC agent for worker0 encountered error when reading incoming request from worker3: EOF: end of file (this is expected to happen during shutdown) 
Sep 04 20:16:50 ok (1.538s) 
Sep 04 20:16:51   test_return_future_remote (__main__.TensorPipeRpcTestWithSpawn) ... [W tensorpipe_agent.cpp:576] RPC agent for worker1 encountered error when reading incoming request from worker0: EOF: end of file (this is expected to happen during shutdown) 
Sep 04 20:16:52 ok (1.440s) 
Sep 04 20:16:53   test_return_local_rrefs (__main__.TensorPipeRpcTestWithSpawn) ... [W tensorpipe_agent.cpp:576] RPC agent for worker2 encountered error when reading incoming request from worker3: EOF: end of file (this is expected to happen during shutdown)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.7-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 11 times.

…bgraph" Summary: Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph and we return future tensor from subgraph Example ``` graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0 return (%3) with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1) %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1) %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4) %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0 %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0 %8 : Tensor[] = prim::ListConstruct(%6) return (%8) ``` Test Plan: python test/test_quantization.py test_interface_with_fork Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23481003](https://our.internmc.facebook.com/intern/diff/D23481003) [ghstack-poisoned]

Summary: Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph and we return future tensor from subgraph Example ``` graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0 return (%3) with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1) %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1) %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4) %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0 %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0 %8 : Tensor[] = prim::ListConstruct(%6) return (%8) ``` Test Plan: python test/test_quantization.py test_interface_with_fork Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: e2f0d12 Pull Request resolved: #44048

vkuzo

lg, although feel free to wait for @jerryzh168 if this needs a deeper review

…bgraph" Summary: Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph and we return future tensor from subgraph Example ``` graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0 return (%3) with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1) %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1) %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4) %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0 %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0 %8 : Tensor[] = prim::ListConstruct(%6) return (%8) ``` Test Plan: python test/test_quantization.py test_interface_with_fork Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23481003](https://our.internmc.facebook.com/intern/diff/D23481003) [ghstack-poisoned]

Summary: Inline the fork-wait calls to make sure we can see the ops to be quantized in the main graph Also fix the InlineForkWait JIT pass to account for the case where the aten::wait call isn't present in the main graph and we return future tensor from subgraph Example ``` graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_6325.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : Future[Tensor[]] = prim::fork_0(%self.1, %argument_1.1, %argument_2.1) # :0:0 return (%3) with prim::fork_0 = graph(%self.1 : __torch__.dper3.core.interop.___torch_mangle_5396.DperModuleWrapper, %argument_1.1 : Tensor, %argument_2.1 : Tensor): %3 : __torch__.dper3.core.interop.___torch_mangle_6330.DperModuleWrapper = prim::GetAttr[name="x"](%self.1) %4 : __torch__.dper3.core.interop.___torch_mangle_5397.DperModuleWrapper = prim::GetAttr[name="y"](%self.1) %5 : __torch__.dper3.core.interop.___torch_mangle_6327.DperModuleWrapper = prim::GetAttr[name="z"](%4) %6 : Tensor = prim::CallMethod[name="forward"](%5, %argument_1.1, %argument_2.1) # :0:0 %7 : None = prim::CallMethod[name="forward"](%3, %6) # :0:0 %8 : Tensor[] = prim::ListConstruct(%6) return (%8) ``` Test Plan: python test/test_quantization.py test_interface_with_fork Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 8a919ff Pull Request resolved: #44048

facebook-github-bot · 2020-09-05T20:13:31Z

This pull request has been merged in 199c73b.

supriyar requested a review from apaszke as a code owner September 2, 2020 19:06

This was referenced Sep 2, 2020

[quant][pyper] Support aten::embedding_bag quantization in graph mode #43989

Closed

[quant][pyper] make embedding_bag quantization static #44008

Closed

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 2, 2020

vkuzo approved these changes Sep 3, 2020

View reviewed changes

facebook-github-bot closed this in 199c73b Sep 5, 2020

facebook-github-bot added the merged label Sep 5, 2020

facebook-github-bot deleted the gh/supriyar/172/head branch September 9, 2020 14:18

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant][pyper] Support quantization of ops in fork-wait subgraph #44048

[quant][pyper] Support quantization of ops in fork-wait subgraph #44048

Uh oh!

supriyar commented Sep 2, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Sep 2, 2020 •

edited

Loading

Uh oh!

vkuzo left a comment

Uh oh!

facebook-github-bot commented Sep 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[quant][pyper] Support quantization of ops in fork-wait subgraph #44048

[quant][pyper] Support quantization of ops in fork-wait subgraph #44048

Uh oh!

Conversation

supriyar commented Sep 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Sep 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 2 new failures recognized by patterns

pytorch_linux_xenial_py3_6_gcc5_4_ge_config_simple_test (1/2)

pytorch_linux_xenial_py3_6_gcc5_4_test (2/2)

ci.pytorch.org: 1 failed

Uh oh!

vkuzo left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

supriyar commented Sep 2, 2020 •

edited

Loading

dr-ci bot commented Sep 2, 2020 •

edited

Loading