Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638

heitorschueroff · 2020-08-05T22:32:07Z

Stack from ghstack:

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638 Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all)

Differential Revision: D23030317

…(_all) [ghstack-poisoned]

…(_all) ghstack-source-id: 666fa77 Pull Request resolved: #42638

dr-ci · 2020-08-05T23:02:16Z

💊 CI failures summary and remediations

As of commit edfd435 (more details on the Dr. CI page):

2/2 failures possibly* introduced in this PR
- 1/2 non-CircleCI failure(s)

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_macos_10_13_py3_test (1/1)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

Aug 10 15:16:50 [E request_callback_no_python.cpp:618] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future

Aug 10 15:16:50 At: 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Aug 10 15:16:50  
Aug 10 15:16:50 [E request_callback_no_python.cpp:618] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Aug 10 15:16:50  
Aug 10 15:16:50 At: 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Aug 10 15:16:50  
Aug 10 15:16:50 [E request_callback_no_python.cpp:618] Received error while processing request type 2: RuntimeError: Can not pickle torch.futures.Future 
Aug 10 15:16:50  
Aug 10 15:16:50 At: 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(93): serialize 
Aug 10 15:16:50   /Users/distiller/workspace/miniconda3/lib/python3.7/site-packages/torch/distributed/rpc/internal.py(145): serialize 
Aug 10 15:16:50  
Aug 10 15:16:50 [W tensorpipe_agent.cpp:504] RPC agent for worker0 encountered error when reading incoming request from worker1: EOF: end of file (this is expected to happen during shutdown) 
Aug 10 15:16:50 [W tensorpipe_agent.cpp:504] RPC agent for worker3 encountered error when reading incoming request from worker2: EOF: end of file (this is expected to happen during shutdown) 
Aug 10 15:16:50 [W tensorpipe_agent.cpp:504] RPC agent for worker0 encountered error when reading incoming request from worker3: EOF: end of file (this is expected to happen during shutdown) 
Aug 10 15:16:50 [W tensorpipe_agent.cpp:504] RPC agent for worker2 encountered error when reading incoming request from worker1: EOF: end of file (this is expected to happen during shutdown) 
Aug 10 15:16:50 [W tensorpipe_agent.cpp:504] RPC agent for worker0 encountered error when reading incoming request from worker2: EOF: end of file (this is expected to happen during shutdown)

ci.pytorch.org: 1 failed

Failed: pr/caffe2-pytorch-linux-xenial-rocm3.5.1-py3.6-test

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 12 times.

glaringlee · 2020-08-06T03:23:18Z

@ezyang
Hi, Ed, would CUDAUtils.h be a good place to put this function? Can you advise?

ezyang · 2020-08-07T16:40:46Z

This probably doesn't belong in the cuda/ folder because it doesn't actually have any direct dependencies on CUDA headers (everything is indirected through Context). Is there a bug report / issue corresponding to this? That would help me assess.

heitorschueroff · 2020-08-07T16:43:36Z

This probably doesn't belong in the cuda/ folder because it doesn't actually have any direct dependencies on CUDA headers (everything is indirected through Context). Is there a bug report / issue corresponding to this? That would help me assess.

#40361

glaringlee · 2020-08-07T17:14:47Z

This probably doesn't belong in the cuda/ folder because it doesn't actually have any direct dependencies on CUDA headers (everything is indirected through Context). Is there a bug report / issue corresponding to this? That would help me assess.

@ezyang
We want to have a c++ equivalent api for torch.cuda.manual_seed, since we do have equivalent api for torch.manual_seed, but not torch.cuda.manual_seed.
The torch.manual_seed equivalent api is in ATen folder here:

pytorch/aten/src/ATen/Context.h

Line 241 in df7c059

static inline void manual_seed(uint64_t seed) {

Please suggest where should this torch.cuda.manual_seed be placed, thanks.

ezyang · 2020-08-07T18:20:30Z

Thanks, the issue helps.

There are few things you have to be careful about when writing C++ API ports of PyTorch functionality. (This is mostly directed at @glaringlee, but hopefully this also gives you some useful context @heitorschueroff).

The function of directory structure in Python is different than in C++. In Python, torch.cuda structure is just for organization purposes. In C++, the ATen/cuda folder structure has a very specific purpose, which is to demarcate what goes into the torch_cpu shared library versus the torch_cuda shared library. Because the dependency relationship between these libraries is unidirectional (torch cuda depends on torch cpu, but not vice versa), it means that code in torch_cpu cannot directly call torch_cuda; it must indirect through a dynamic dispatch of some sort. (Python doesn't have this problem, because everything is indirected there!)

Furthermore, we must be careful about distinguishing the ATen/cuda from the torch/csrc/api/include/torch/cuda.h. ATen/cuda are files put in torch_cuda; but torch/cuda.h is shipped with torch_cpu library and is available even if PyTorch wasn't compiled with CUDA (because it does dynamic dispatches). Finally, torch/cuda.h is what actually constitutes the public C++ API; headers in ATen are more private (they just happen to frequently get reexported in torch namespace, and our "official" API is often spotty so people go to ATen to find the stuff they actually need.)

So what does this mean for this issue? The user is requesting torch::cuda::manual_seed in the public C++ API. To me, that indicates that it should go in torch/csrc/api/include/torch/cuda.h. The API should be available even if you didn't build with CUDA support. You should make sure that the new functions are implemented equivalently to their Python counterparts.

glaringlee · 2020-08-07T19:11:18Z

@ezyang Thanks a lot, Ed, very useful info to me.

…manual_seed(_all)" [ghstack-poisoned]

…(_all) ghstack-source-id: 14251f2 Pull Request resolved: #42638

glaringlee

LGTM now.

glaringlee

LGTM now

…manual_seed(_all)" Differential Revision: [D23030317](https://our.internmc.facebook.com/intern/diff/D23030317) [ghstack-poisoned]

…(_all) ghstack-source-id: c9d32d7 Pull Request resolved: #42638

facebook-github-bot · 2020-08-11T18:09:34Z

@heitorschueroff merged this pull request in d396d13.

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed…

22d6904

…(_all) [ghstack-poisoned]

heitorschueroff added a commit that referenced this pull request Aug 5, 2020

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed…

c5aee60

…(_all) ghstack-source-id: 666fa77 Pull Request resolved: #42638

heitorschueroff requested a review from glaringlee August 5, 2020 22:32

Update on "Added torch::cuda::manual_seed(_all) to mirror torch.cuda.…

9d4de1c

…manual_seed(_all)" [ghstack-poisoned]

Update on "Added torch::cuda::manual_seed(_all) to mirror torch.cuda.…

454075b

…manual_seed(_all)" [ghstack-poisoned]

heitorschueroff requested review from ebetica, goldsborough and yf225 as code owners August 7, 2020 20:54

heitorschueroff added a commit that referenced this pull request Aug 7, 2020

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed…

ea87a77

…(_all) ghstack-source-id: 14251f2 Pull Request resolved: #42638

glaringlee removed request for ebetica and goldsborough August 8, 2020 01:38

glaringlee approved these changes Aug 10, 2020

View reviewed changes

Update on "Added torch::cuda::manual_seed(_all) to mirror torch.cuda.…

edfd435

…manual_seed(_all)" Differential Revision: [D23030317](https://our.internmc.facebook.com/intern/diff/D23030317) [ghstack-poisoned]

heitorschueroff added a commit that referenced this pull request Aug 10, 2020

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed…

26e0d03

…(_all) ghstack-source-id: c9d32d7 Pull Request resolved: #42638

facebook-github-bot closed this in d396d13 Aug 11, 2020

facebook-github-bot added the merged label Aug 11, 2020

facebook-github-bot deleted the gh/heitorschueroff/3/head branch August 15, 2020 14:17

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638

Uh oh!

heitorschueroff commented Aug 5, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Aug 5, 2020 •

edited

Loading

Uh oh!

glaringlee commented Aug 6, 2020

Uh oh!

ezyang commented Aug 7, 2020

Uh oh!

heitorschueroff commented Aug 7, 2020

Uh oh!

glaringlee commented Aug 7, 2020 •

edited

Loading

Uh oh!

ezyang commented Aug 7, 2020

Uh oh!

glaringlee commented Aug 7, 2020

Uh oh!

glaringlee left a comment

Uh oh!

glaringlee left a comment

Uh oh!

facebook-github-bot commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638

Added torch::cuda::manual_seed(_all) to mirror torch.cuda.manual_seed(_all) #42638

Uh oh!

Conversation

heitorschueroff commented Aug 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Aug 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

pytorch_macos_10_13_py3_test (1/1)

ci.pytorch.org: 1 failed

Uh oh!

glaringlee commented Aug 6, 2020

Uh oh!

ezyang commented Aug 7, 2020

Uh oh!

heitorschueroff commented Aug 7, 2020

Uh oh!

glaringlee commented Aug 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Aug 7, 2020

Uh oh!

glaringlee commented Aug 7, 2020

Uh oh!

glaringlee left a comment

Choose a reason for hiding this comment

Uh oh!

glaringlee left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

heitorschueroff commented Aug 5, 2020 •

edited

Loading

dr-ci bot commented Aug 5, 2020 •

edited

Loading

glaringlee commented Aug 7, 2020 •

edited

Loading