Simplify copy kernel #28352

zasdfgbnm · 2019-10-21T03:46:39Z

Stack from ghstack:

Simplify copy kernel #28352 Simplify copy kernel
Make TensorIterator stop promoting types by copying #28344 Make TensorIterator stop promoting types by copying
Move type casting to c10/util/TypeCast.h #28343 Move type casting to c10/util/TypeCast.h

Using the new type promotion and dynamic casting added to
TensorIterator, the copy kernels could be greatly simplified.

Script:

import torch
import timeit
import pandas
import itertools
from tqdm import tqdm
import math
print(torch.__version__)
print()

_10M = 10 * 1024 ** 2

d = {}

for from_, to in tqdm(itertools.product(torch.testing.get_all_dtypes(),
repeat=2)):
    if from_ not in d:
        d[from_] = {}
    a = torch.zeros(_10M, dtype=from_)
    min_ = math.inf
    for i in range(100):
        start = timeit.default_timer()
        a.to(to)
        end = timeit.default_timer()
        elapsed = end - start
        if elapsed < min_:
            min_ = elapsed
    d[from_][to] = int(elapsed * 1000 * 1000)

pandas.DataFrame(d)

Before:

After:

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. **Script:** ```python import torch import timeit import pandas import itertools from tqdm import tqdm import math print(torch.__version__) print() _10M = 10 * 1024 ** 2 d = {} for from_, to in tqdm(itertools.product(torch.testing.get_all_dtypes(), repeat=2)): if from_ not in d: d[from_] = {} a = torch.zeros(_10M, dtype=from_) min_ = math.inf for i in range(100): start = timeit.default_timer() a.to(to) end = timeit.default_timer() elapsed = end - start if elapsed < min_: min_ = elapsed d[from_][to] = int(elapsed * 1000 * 1000) pandas.DataFrame(d) ``` **Before:** ![image](https://user-images.githubusercontent.com/1032377/67171274-2e93d000-f36b-11e9-8fa0-91edd7dbc8ec.png) **After:** ![image](https://user-images.githubusercontent.com/1032377/67171200-d361dd80-f36a-11e9-9b22-66292e395a09.png) ghstack-source-id: 3e43f97 Pull Request resolved: #28352

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. **Script:** ```python import torch import timeit import pandas import itertools from tqdm import tqdm import math print(torch.__version__) print() _10M = 10 * 1024 ** 2 d = {} for from_, to in tqdm(itertools.product(torch.testing.get_all_dtypes(), repeat=2)): if from_ not in d: d[from_] = {} a = torch.zeros(_10M, dtype=from_) min_ = math.inf for i in range(100): start = timeit.default_timer() a.to(to) end = timeit.default_timer() elapsed = end - start if elapsed < min_: min_ = elapsed d[from_][to] = int(elapsed * 1000 * 1000) pandas.DataFrame(d) ``` **Before:** ![image](https://user-images.githubusercontent.com/1032377/67171274-2e93d000-f36b-11e9-8fa0-91edd7dbc8ec.png) **After:** ![image](https://user-images.githubusercontent.com/1032377/67171200-d361dd80-f36a-11e9-9b22-66292e395a09.png) [ghstack-poisoned]

zasdfgbnm · 2019-10-21T20:41:51Z

Sorry there is a bug in my benchmark code: I am not reporting the minimum time but reporting the last result instead. Here is the fixed script and result:

script:

import torch
import timeit
import pandas
import itertools
from tqdm.notebook import tqdm
import math
print(torch.__version__)
print()

_10M = 10 * 1024 ** 2

d = {}

for from_, to in tqdm(itertools.product(torch.testing.get_all_dtypes(), repeat=2)):
    if from_ not in d:
        d[from_] = {}
    a = torch.zeros(_10M, dtype=from_)
    min_ = math.inf
    for i in range(100):
        start = timeit.default_timer()
        a.to(to)
        end = timeit.default_timer()
        elapsed = end - start
        if elapsed < min_:
            min_ = elapsed
    d[from_][to] = int(min_ * 1000 * 1000)
    
pandas.DataFrame(d)

before:

after:

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. **Script:** ```python import torch import timeit import pandas import itertools from tqdm import tqdm import math print(torch.__version__) print() _10M = 10 * 1024 ** 2 d = {} for from_, to in tqdm(itertools.product(torch.testing.get_all_dtypes(), repeat=2)): if from_ not in d: d[from_] = {} a = torch.zeros(_10M, dtype=from_) min_ = math.inf for i in range(100): start = timeit.default_timer() a.to(to) end = timeit.default_timer() elapsed = end - start if elapsed < min_: min_ = elapsed d[from_][to] = int(elapsed * 1000 * 1000) pandas.DataFrame(d) ``` **Before:** ![image](https://user-images.githubusercontent.com/1032377/67171274-2e93d000-f36b-11e9-8fa0-91edd7dbc8ec.png) **After:** ![image](https://user-images.githubusercontent.com/1032377/67171200-d361dd80-f36a-11e9-9b22-66292e395a09.png) ghstack-source-id: d7fc960 Pull Request resolved: #28352

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

This was referenced Oct 21, 2019

Move type casting to c10/util/TypeCast.h #28343

Merged

Make TensorIterator stop promoting types by copying #28344

Merged

zasdfgbnm requested a review from colesbury October 21, 2019 03:48

zasdfgbnm added better-engineering Relatively self-contained tasks for better engineering contributors module: operators labels Oct 21, 2019

soumith requested review from VitalyFedyunin and ngimel October 21, 2019 21:24

zasdfgbnm merged commit d3cbbca into gh/zasdfgbnm/10/base Oct 22, 2019

zasdfgbnm mentioned this pull request Oct 22, 2019

Simplify copy kernel #28428

Closed

zasdfgbnm added a commit that referenced this pull request Oct 22, 2019

Update on "Simplify copy kernel"

0cf634c

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 23, 2019

Update on "Simplify copy kernel"

e84ac97

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 24, 2019

Update on "Simplify copy kernel"

5fc3a07

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 24, 2019

Update on "Simplify copy kernel"

1a1c3f8

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 25, 2019

Update on "Simplify copy kernel"

44b398e

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 25, 2019

Update on "Simplify copy kernel"

dd612d0

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 26, 2019

Update on "Simplify copy kernel"

25e7b33

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

zasdfgbnm added a commit that referenced this pull request Oct 26, 2019

Update on "Simplify copy kernel"

27ecdf9

Using the new type promotion and dynamic casting added to `TensorIterator`, the copy kernels could be greatly simplified. For benchmark, see #28352 (comment) [ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify copy kernel #28352

Simplify copy kernel #28352

Uh oh!

zasdfgbnm commented Oct 21, 2019 •

edited

Loading

Uh oh!

zasdfgbnm commented Oct 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Simplify copy kernel #28352

Simplify copy kernel #28352

Uh oh!

Conversation

zasdfgbnm commented Oct 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zasdfgbnm commented Oct 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zasdfgbnm commented Oct 21, 2019 •

edited

Loading