Commit f5df685
Enable channels_last_3d on SyncBatchNorm (#88401)
This PR enabled the use of fast channels_last kernels on SyncBatchNorm with channels_last_3d memory format.
With a small benchmark script here #88021 (comment), on V100, I got
master:
```
DDP channels_last=False, run_forward_backward, time: 0.8945400714874268 sec
DDP channels_last=True, run_forward_backward, time: 1.4736433029174805 sec
```
This PR:
```
DDP channels_last=False, run_forward_backward, time: 0.8927242755889893 sec
DDP channels_last=True, run_forward_backward, time: 0.48697471618652344 sec
```
This PR is a follow-up of #46906
Close #88021
Pull Request resolved: #88401
Approved by: https://github.com/ngimel1 parent 8023c9d commit f5df685
File tree
4 files changed
+26
-12
lines changed- aten/src/ATen/native/cuda
- test
- torch
- nn/modules
- testing/_internal/distributed
4 files changed
+26
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
48 | 48 | | |
49 | 49 | | |
50 | 50 | | |
51 | | - | |
52 | | - | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
53 | 56 | | |
54 | 57 | | |
55 | 58 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10283 | 10283 | | |
10284 | 10284 | | |
10285 | 10285 | | |
10286 | | - | |
| 10286 | + | |
10287 | 10287 | | |
10288 | | - | |
10289 | | - | |
| 10288 | + | |
| 10289 | + | |
10290 | 10290 | | |
10291 | 10291 | | |
10292 | 10292 | | |
10293 | 10293 | | |
10294 | | - | |
10295 | | - | |
| 10294 | + | |
| 10295 | + | |
10296 | 10296 | | |
10297 | 10297 | | |
10298 | 10298 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
11 | 14 | | |
12 | 15 | | |
13 | 16 | | |
| |||
104 | 107 | | |
105 | 108 | | |
106 | 109 | | |
107 | | - | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
108 | 114 | | |
109 | 115 | | |
110 | 116 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5324 | 5324 | | |
5325 | 5325 | | |
5326 | 5326 | | |
| 5327 | + | |
| 5328 | + | |
| 5329 | + | |
| 5330 | + | |
5327 | 5331 | | |
5328 | 5332 | | |
5329 | 5333 | | |
| |||
5336 | 5340 | | |
5337 | 5341 | | |
5338 | 5342 | | |
5339 | | - | |
| 5343 | + | |
| 5344 | + | |
5340 | 5345 | | |
5341 | | - | |
| 5346 | + | |
5342 | 5347 | | |
5343 | 5348 | | |
5344 | 5349 | | |
5345 | 5350 | | |
5346 | | - | |
| 5351 | + | |
5347 | 5352 | | |
5348 | 5353 | | |
5349 | 5354 | | |
| |||
0 commit comments