Skip to content

Commit af4b406

Browse files
committed
Update on "[15/N] Add allreduce_coalesced custom op with CPU/CUDA implementations"
[ghstack-poisoned]
1 parent 34906c8 commit af4b406

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

torch/csrc/distributed/c10d/OpsImpl.cpp

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -394,14 +394,13 @@ TORCH_LIBRARY_IMPL(c10d, CUDA, m) {
394394
}
395395

396396
TORCH_LIBRARY_IMPL(c10d, CPU, m) {
397-
m.impl("allreduce_coalesced", allreduce_coalesced_cpu_);
397+
m.impl("allreduce_coalesced_", allreduce_coalesced_cpu_);
398398
}
399399

400400
TORCH_LIBRARY_IMPL(c10d, CUDA, m) {
401-
m.impl("allreduce_coalesced", allreduce_coalesced_cuda_);
401+
m.impl("allreduce_coalesced_", allreduce_coalesced_cuda_);
402402
}
403403

404-
405404
TORCH_LIBRARY_IMPL(c10d, CPU, m) {
406405
m.impl("allgather_", allgather_cpu_);
407406
}

0 commit comments

Comments
 (0)