Don't initialize the current device in CUDAGenerator::CUDAGenerator #7392

colesbury · 2018-05-08T21:33:26Z

Previously, CUDAGenerator::CUDAGenerator would initialize the random
number generator on the current device. This would usually be device 0.
This is undesirable because initialize the CUDA context allocates a few
100 MBs due to all the kernels in libTHC.so.

This avoids the unecessary call to THCRandom_getGenerator() in the
CUDAGenerator constructor.

Fixes #7320

apaszke · 2018-05-09T20:25:57Z

Can we add a test that checks for this unwanted init? It's not the first time we get this regression.

Previously, CUDAGenerator::CUDAGenerator would initialize the random number generator on the current device. This would usually be device 0. This is undesirable because initialize the CUDA context allocates a few 100 MBs due to all the kernels in libTHC.so. This avoids the unecessary call to THCRandom_getGenerator() in the CUDAGenerator constructor. Fixes pytorch#7320

colesbury · 2018-05-10T17:37:47Z

@apaszke, that's a good idea but I'm not sure what's a good way to do that. The CUDA driver API provides cuDevicePrimaryCtxGetState( CUdevice dev, unsigned int* flags, int* active ). @ngimel is there any equivalent function to check if the primary context is active using the CUDA runtime API?

…e in CUDAGenerator::CUDAGenerator (pytorch/pytorch#7392) pytorch/pytorch@976b1d5

ngimel · 2018-05-13T03:18:18Z

@colesbury unfortunately cuda runtime API ignore the existence of contexts alltogether, so there's no way to query context using them.

…ytorch#7392) Previously, CUDAGenerator::CUDAGenerator would initialize the random number generator on the current device. This would usually be device 0. This is undesirable because initialize the CUDA context allocates a few 100 MBs due to all the kernels in libTHC.so. This avoids the unecessary call to THCRandom_getGenerator() in the CUDAGenerator constructor. Fixes pytorch#7320 Previously, CUDAGenerator::CUDAGenerator would initialize the random number generator on the current device. This would usually be device 0. This is undesirable because initialize the CUDA context allocates a few 100 MBs due to all the kernels in libTHC.so. This avoids the unecessary call to THCRandom_getGenerator() in the CUDAGenerator constructor. Fixes pytorch#7320 * Fix call to get THCState

colesbury requested review from apaszke, ezyang, gchanan, soumith and zdevito as code owners May 8, 2018 21:33

onnxbot-worker-2 mentioned this pull request May 8, 2018

[auto] pytorch-pr-7392 onnxbot/onnx-fb-universe#1989

Closed

ezyang approved these changes May 8, 2018

View reviewed changes

colesbury force-pushed the issue7320 branch from 771fccd to 1e16549 Compare May 10, 2018 17:36

Fix call to get THCState

773352c

ezyang merged commit 976b1d5 into pytorch:master May 13, 2018

onnxbot added a commit to onnxbot/onnx-fb-universe that referenced this pull request May 13, 2018

[auto] Update pytorch to 976b1d5 - Don't initialize the current devic…

be4c6a2

…e in CUDAGenerator::CUDAGenerator (pytorch/pytorch#7392) pytorch/pytorch@976b1d5

zou3519 mentioned this pull request May 14, 2018

DataParallel spills memory to GPU #0 #7071

Closed

fmassa mentioned this pull request Jun 14, 2018

pytorch 0.4.0 always allocates memory on GPU:0 when the model and data are on other GPU. #8480

Closed

soumith mentioned this pull request Jul 27, 2018

[Problem report] GPU 0 out of memory even if assigning one GPU for each model in multiprocessing #9871

Closed

ProGamerGov mentioned this pull request Sep 26, 2019

Added support for Multi-GPU and CPU ProGamerGov/neural-style-pt#20

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't initialize the current device in CUDAGenerator::CUDAGenerator #7392

Don't initialize the current device in CUDAGenerator::CUDAGenerator #7392

Uh oh!

colesbury commented May 8, 2018

Uh oh!

apaszke commented May 9, 2018

Uh oh!

colesbury commented May 10, 2018

Uh oh!

ngimel commented May 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Don't initialize the current device in CUDAGenerator::CUDAGenerator #7392

Don't initialize the current device in CUDAGenerator::CUDAGenerator #7392

Uh oh!

Conversation

colesbury commented May 8, 2018

Uh oh!

apaszke commented May 9, 2018

Uh oh!

colesbury commented May 10, 2018

Uh oh!

ngimel commented May 13, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants