Add binary to benchmark model load speed #74700

SS-JIA · 2022-03-24T20:13:14Z

Stack from ghstack:

Add binary to benchmark model load speed #74700 Add binary to benchmark model load speed
[vulkan] Add option to trigger Vulkan context to load on application start #74769 [vulkan] Add option to trigger Vulkan context to load on application start
[vulkan] Remove unnecessary include in vulkan_api_test #74699 [vulkan] Remove unnecessary include in vulkan_api_test

Differential Revision: D35124881

[ghstack-poisoned]

ghstack-source-id: 927e467 Pull Request resolved: #74700

facebook-github-bot · 2022-03-24T20:13:26Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/74700
Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 1e13a90 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

SS-JIA · 2022-03-24T20:20:21Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

ghstack-source-id: 0c072e2 Pull Request resolved: #74700

beback4u · 2022-03-24T23:49:07Z

binaries/load_benchmark_torch.cc

+#include <vector>
+
+#include <ATen/ATen.h>
+#include "caffe2/core/timer.h"


Can we re-organize the order of includes?

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA · 2022-03-25T20:10:54Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA · 2022-03-25T21:07:48Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

ghstack-source-id: 29acc17 Pull Request resolved: #74700

SS-JIA · 2022-03-25T21:22:48Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA · 2022-03-28T18:46:55Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

ghstack-source-id: e184a83 Pull Request resolved: #74700

SS-JIA · 2022-03-28T18:50:55Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

ghstack-source-id: 91691b1 Pull Request resolved: #74700

SS-JIA · 2022-03-28T22:01:42Z

@SS-JIA has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #74700 Test Plan: Imported from OSS Some results running this benchmark for a quantized CPU xirp14b model on a Pixel 5: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "46749"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19261"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19235"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19396"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19486"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19562"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19566"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19559"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19632"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19938"} ``` Some results running this benchmark for the Vulkan xirp20a model on Pixel 5, after pre-loading the Context: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "38664"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19921"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20316"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20255"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20219"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20329"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20463"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "21072"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20668"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20889"} ``` Without pre-loading Context: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "70850"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19867"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20211"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20039"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20082"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20268"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20363"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "21103"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20511"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20528"} ``` Reviewed By: mrshenli Differential Revision: D35124881 Pulled By: SS-JIA fbshipit-source-id: 0f093e4aa45d69c538a4fe2003e0d5617d72b97a

Summary: Pull Request resolved: #74700 Test Plan: Imported from OSS Some results running this benchmark for a quantized CPU xirp14b model on a Pixel 5: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "46749"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19261"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19235"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19396"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19486"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19562"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19566"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19559"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19632"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19938"} ``` Some results running this benchmark for the Vulkan xirp20a model on Pixel 5, after pre-loading the Context: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "38664"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19921"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20316"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20255"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20219"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20329"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20463"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "21072"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20668"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20889"} ``` Without pre-loading Context: ``` PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "70850"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "19867"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20211"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20039"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20082"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20268"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20363"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "21103"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20511"} PyTorchObserver {"type": "NET", "unit": "us", "metric": "latency", "value": "20528"} ``` Reviewed By: mrshenli Differential Revision: D35124881 Pulled By: SS-JIA fbshipit-source-id: 0f093e4aa45d69c538a4fe2003e0d5617d72b97a (cherry picked from commit 96f9914)

github-actions · 2022-03-30T20:23:53Z

Hey @SS-JIA.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

ghstack-source-id: c34962a Pull Request resolved: pytorch/pytorch#74700

Add binary to benchmark model load speed

78afb6e

[ghstack-poisoned]

This was referenced Mar 24, 2022

[vulkan] Refactor Vulkan Runtime and Adapter #74698

Closed

[vulkan] Remove unnecessary include in vulkan_api_test #74699

Closed

facebook-github-bot added the cla signed label Mar 24, 2022

SS-JIA added a commit that referenced this pull request Mar 24, 2022

Add binary to benchmark model load speed

90e0e63

ghstack-source-id: 927e467 Pull Request resolved: #74700

Update on "Add binary to benchmark model load speed"

66568f3

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

Update on "Add binary to benchmark model load speed"

89935cb

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA pushed a commit that referenced this pull request Mar 24, 2022

Add binary to benchmark model load speed

616aa96

ghstack-source-id: 0c072e2 Pull Request resolved: #74700

SS-JIA requested review from beback4u and kimishpatel March 24, 2022 23:28

beback4u approved these changes Mar 24, 2022

View reviewed changes

Update on "Add binary to benchmark model load speed"

cdc5502

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA mentioned this pull request Mar 25, 2022

[vulkan] Add option to trigger Vulkan context to load on application start #74769

Closed

Sicheng Jia added 2 commits March 25, 2022 15:40

Update on "Add binary to benchmark model load speed"

5e601f4

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

Update on "Add binary to benchmark model load speed"

fb38232

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

Update on "Add binary to benchmark model load speed"

f3bbaa5

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

Update on "Add binary to benchmark model load speed"

1802372

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA pushed a commit that referenced this pull request Mar 25, 2022

Add binary to benchmark model load speed

6cc7aad

ghstack-source-id: 29acc17 Pull Request resolved: #74700

Update on "Add binary to benchmark model load speed"

0460351

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

Update on "Add binary to benchmark model load speed"

c159116

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA pushed a commit that referenced this pull request Mar 28, 2022

Add binary to benchmark model load speed

9f0d3a4

ghstack-source-id: e184a83 Pull Request resolved: #74700

Update on "Add binary to benchmark model load speed"

b1cd1e3

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA mentioned this pull request Mar 28, 2022

[test] experiment without pre-loading context #74862

Closed

Update on "Add binary to benchmark model load speed"

1e13a90

Differential Revision: [D35124881](https://our.internmc.facebook.com/intern/diff/D35124881) [ghstack-poisoned]

SS-JIA pushed a commit that referenced this pull request Mar 28, 2022

Add binary to benchmark model load speed

1eeb42a

ghstack-source-id: 91691b1 Pull Request resolved: #74700

pytorchmergebot closed this Mar 30, 2022

NesrineMHB pushed a commit to NesrineMHB/pytorch that referenced this pull request Apr 7, 2022

Add binary to benchmark model load speed

2ac038a

ghstack-source-id: c34962a Pull Request resolved: pytorch/pytorch#74700

facebook-github-bot deleted the gh/SS-JIA/46/head branch April 30, 2022 14:17

WBobby mentioned this pull request Aug 17, 2022

Add ROCm5.2.3/AMDGPU support for PyTorch WBobby/pytorch#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add binary to benchmark model load speed #74700

Add binary to benchmark model load speed #74700

Uh oh!

SS-JIA commented Mar 24, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 24, 2022 •

edited

Loading

Uh oh!

SS-JIA commented Mar 24, 2022

Uh oh!

beback4u Mar 24, 2022

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

github-actions bot commented Mar 30, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add binary to benchmark model load speed #74700

Add binary to benchmark model load speed #74700

Uh oh!

Conversation

SS-JIA commented Mar 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Mar 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

SS-JIA commented Mar 24, 2022

Uh oh!

beback4u Mar 24, 2022

Choose a reason for hiding this comment

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 25, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

SS-JIA commented Mar 28, 2022

Uh oh!

github-actions bot commented Mar 30, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SS-JIA commented Mar 24, 2022 •

edited

Loading

facebook-github-bot commented Mar 24, 2022 •

edited

Loading