Implement torch.util.bottleneck #5216

zou3519 · 2018-02-13T17:20:48Z

This is a tool that is intended to be used as initial exploratory debugging of bottlenecks in user scripts. Run it with

python -m torch.utils.bottleneck /path/to/source/script.py

Internally it runs the script once with the python profiler and once with the autograd profiler and prints the top 15 hits sorted by cpu time (for both profilers).

Sample Output

cc @soumith

Test Plan

Really basic tests to check that the output of torch.util.bottleneck isn't completely empty. Taking suggestions on how to write better tests.

Built the new docs page:

ezyang · 2018-02-13T18:15:33Z

SO COOL! :D

torch/utils/bottleneck/__main__.py

vadimkantorov · 2018-02-13T19:03:21Z

very cool! wish list for monitoring stats: gpu-util / cpu-util / analyzing cpu-bound vs ram-bound vs gpu-compute bound vs gpu-memory bound / analysis of committed gpu memory

apaszke · 2018-02-13T21:39:08Z

Nice tool! However presenting only the output of CUDA_LAUNCH_BLOCKING can be very misleading, as it will show some ops as very costly, even though they are the ones that allow to hide latency much later. I think we should at least mention this in the output, or produce two lists (one more without CUDA_LAUNCH_BLOCKING).

torch/utils/bottleneck/__main__.py

Evpok · 2018-02-24T10:53:05Z

Would it be possible to allow running scripts with command line arguments ?

zou3519 · 2018-02-26T14:36:06Z

@Evpok that sounds like a good idea. I'll look into it and put it into the next iteration of this.

This is a tool that is intended to be used as initial exploratory debugging of bottlenecks in user scripts. Run it with python -m torch.utils.bottleneck /path/to/source/script.py

apaszke

Looks great. It would be good to expand on CUDA profiling a bit (because it's really very complicated), but should be good to go after that.

docs/source/bottleneck.rst

+    Due to the asynchronous nature of CUDA kernels, when running against
+    CUDA code, the cProfile output and CPU-mode autograd profilers may
+    not show correct timings. In this case, the CUDA-mode autograd
+    profiler is better at assigning blame to the relevant operator(s).


* upstream/master: (663 commits) Fix "command not found" error in perf test (pytorch#5982) add pip mkl-devel to the error message when mkl is found but mkl headers are not (pytorch#5984) Support batch LowerCholeskyTransform (pytorch#5980) Linearly interpolating upsampling fix (pytorch#5927) Store perf numbers in S3 (pytorch#5951) Modidy setup docs for Windows (pytorch#5981) Group Normalization (pytorch#5968) [distributions] Implement Power transform (pytorch#5976) Disable TestBottleneck test_cuda on Windows (pytorch#5977) Fix crash when cat-ing empty cuda tensors (pytorch#5971) Update no_unions flag for nanopb gen and update ONNX proto files (pytorch#5972) Expose gradients w.r.t. input & weight for conv1d, conv2d, conv3d in Python (pytorch#5408) Fixed non-determinate preprocessing on DataLoader (pytorch#4640) add AVX2 implementation for sigmoid function (pytorch#5010) Implement torch.util.bottleneck (pytorch#5216) Remove pragma once from cpp file (pytorch#5965) fix mvn docs (pytorch#5967) Fix incorrect rendering of Tensor.index_*_ doc examples. (pytorch#5969) Implement range for loop in script (pytorch#5827) Add windows doc (pytorch#5859) ... # Conflicts: # aten/src/TH/generic/THTensorMath.c # torch/_tensor_docs.py # torch/csrc/generic/methods/TensorCompare.cwrap

onnxbot-worker-3 mentioned this pull request Feb 13, 2018

[auto] pytorch-pr-5216 onnxbot/onnx-fb-universe#669

Closed

dogancan reviewed Feb 13, 2018

View reviewed changes

torch/utils/bottleneck/__main__.py Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke reviewed Feb 13, 2018

View reviewed changes

torch/utils/bottleneck/__main__.py Outdated

This comment was marked as off-topic.

Sign in to view

torch/utils/bottleneck/__main__.py Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke reviewed Feb 23, 2018

View reviewed changes

zou3519 force-pushed the autoperf branch from 6cb0d6b to 4bd4f1b Compare February 23, 2018 16:11

zou3519 force-pushed the autoperf branch from 4d94df3 to 9db1912 Compare March 7, 2018 17:30

zou3519 and others added 5 commits March 7, 2018 13:22

Implement torch.util.bottleneck

352d161

This is a tool that is intended to be used as initial exploratory debugging of bottlenecks in user scripts. Run it with python -m torch.utils.bottleneck /path/to/source/script.py

Refactor and address comments

ed9ccfe

Fix tests

80d1809

Allow passing of args to the profiled script

661500e

Replace Variable

1528ed4

zou3519 force-pushed the autoperf branch from 9db1912 to 1528ed4 Compare March 7, 2018 18:22

apaszke approved these changes Mar 12, 2018

View reviewed changes

ezyang merged commit feb2785 into pytorch:master Mar 23, 2018

zou3519 mentioned this pull request Apr 18, 2018

[docs] Document CUDA profiling gatchas in bottleneck docs #6715

Merged

Implement torch.util.bottleneck #5216

Implement torch.util.bottleneck #5216

Uh oh!

Conversation

zou3519 commented Feb 13, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Plan

Uh oh!

ezyang commented Feb 13, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

vadimkantorov commented Feb 13, 2018

Uh oh!

apaszke commented Feb 13, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Evpok commented Feb 24, 2018

Uh oh!

zou3519 commented Feb 26, 2018

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

zou3519 commented Feb 13, 2018 •

edited

Loading