Fix broadcast copying device[0] tensor when not using NCCL #8222

ssnl · 2018-06-06T22:58:49Z

Fix broadcast copying device[0] tensor when not using NCCL
Avoids potential extra copy in flatten_dense_tensors

ezyang · 2018-06-07T02:29:38Z

23:50:46 ======================================================================
23:50:46 FAIL: test_broadcast_coalesced (__main__.TestCuda)
23:50:46 ----------------------------------------------------------------------
23:50:46 Traceback (most recent call last):
23:50:46   File "/var/lib/jenkins/workspace/test/common.py", line 213, in wrapper
23:50:46     method(*args, **kwargs)
23:50:46   File "test_cuda.py", line 905, in test_broadcast_coalesced
23:50:46     self._test_broadcast_coalesced(self, tensors, num_bytes * 5 // 2)
23:50:46   File "test_cuda.py", line 876, in _test_broadcast_coalesced
23:50:46     self.assertEqual(bt.get_device(), 1)
23:50:46   File "/var/lib/jenkins/workspace/test/common.py", line 325, in assertEqual
23:50:46     super(TestCase, self).assertLessEqual(abs(x - y), prec, message)
23:50:46 AssertionError: 1 not less than or equal to 1e-05
23:50:46

torch/csrc/utils/tensor_flatten.h

ssnl · 2018-06-07T17:22:21Z

@pytorchbot retest this please

ssnl · 2018-06-14T14:11:19Z

can I get a review on this please?

fixed

ssnl · 2018-06-15T16:58:28Z

@apaszke can you take a look at this when you have time?

torch/csrc/cuda/comm.cpp

…tential extra copy in flatten_dense_tensors

ssnl · 2018-06-18T18:36:14Z

Let me know how this looks to you. @apaszke

ezyang · 2018-06-22T19:08:38Z

I notice there was no test added for this

ssnl · 2018-06-22T19:15:40Z

adding one soon @ezyang

ssnl requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners June 6, 2018 22:58

fmassa reviewed Jun 7, 2018

View reviewed changes

torch/csrc/utils/tensor_flatten.h Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke previously requested changes Jun 7, 2018

View reviewed changes

torch/csrc/utils/tensor_flatten.h Outdated

This comment was marked as off-topic.

Sign in to view

ssnl force-pushed the broadcast_no_nccl branch from 23c25eb to 4afaa6a Compare June 7, 2018 17:18

ssnl force-pushed the broadcast_no_nccl branch from 4afaa6a to daf1740 Compare June 14, 2018 23:26

apaszke suggested changes Jun 15, 2018

View reviewed changes

torch/csrc/cuda/comm.cpp Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

torch/csrc/cuda/comm.cpp Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

ssnl added 4 commits June 18, 2018 13:03

Fix broadcast copying device[0] tensor when not using NCCL; Avoids po…

922402b

…tential extra copy in flatten_dense_tensors

use toType

e0919aa

revert dense_flat changes

115c8d5

address comments

3eca907

ssnl force-pushed the broadcast_no_nccl branch from daf1740 to 3eca907 Compare June 18, 2018 17:13

apaszke approved these changes Jun 19, 2018

View reviewed changes

ssnl merged commit 2bf8b70 into pytorch:master Jun 19, 2018

ssnl deleted the broadcast_no_nccl branch June 19, 2018 20:34

ssnl mentioned this pull request Jun 22, 2018

Test that broadcast doesn't copy when dst and src devices are the same #8803

Merged

ezyang added the open source label Jun 24, 2019

Fix broadcast copying device[0] tensor when not using NCCL #8222

Fix broadcast copying device[0] tensor when not using NCCL #8222

Uh oh!

Conversation

ssnl commented Jun 6, 2018

Uh oh!

ezyang commented Jun 7, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ssnl commented Jun 7, 2018

Uh oh!

ssnl commented Jun 14, 2018

Uh oh!

ssnl commented Jun 15, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ssnl commented Jun 18, 2018

Uh oh!

ezyang commented Jun 22, 2018

Uh oh!

ssnl commented Jun 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants