Create ATen tensors via TensorOptions #7869

goldsborough · 2018-05-26T03:22:09Z

This PR is a prototype / WIP for a broad change to the way tensors are created in ATen. The fundamental motivation for this PR is to make tensor creation from C++ more like in Python, which means changing the API for factory functions like at::ones, as well as teaching ATen about construction axes beyond the backend and scalar type, meaning especially a device. This change essentially lays the ground for how tensor creation will happen in c10, and thus defines a new API, currently bolted on top of existing ATen.

This PR proposes the following:

Creation of a type encapsulating all construction axes of a tensor

At the moment, this type will contain:

A scalar type,
A backend,

And in the near future shall contain (and already does somewhat in this prototype):

A device ordinal,
A layout (strided, coo_sparse etc.)
A requires_grad boolean.

in code:

struct TensorOptions {
        size_t device = 0;
	Layout layout = kStrided;
	ScalarType dtype = kFloat;
        Backend backend = kCPU;
	bool requires_grad = false;
};

Each of these fields have default values, such that a default-constructed TensorOptions object fully specifies a new tensor.

Furthermore, this class shall have chaining methods:

TensorOptions().device(1).dtype(kInt)

and there shall be functions with the names of fields, that return a new TensorOptions instance:

// dtype is a free function that returns a new TensorOptions
auto options = dtype(kFloat).requires_grad(true);

Use of `TensorOptions` in all factory methods

The new API for tensor creation shall then be (e.g. for ones):

Tensor ones(IntList size, TensorOptions options = {}) { }

with usage examples such as

at::ones({2, 2}) // float cpu tensor
at::ones({2, 2}, kInt); // int cpu tensor
at::ones({2, 2}, type(kInt).requires_grad(true)) // int cpu tensor, which sets requires_grad for variables

New code path

This PR also implements a new code path for factory methods. For our recollection, at the moment, we create tensors via e.g.

at::ones(at::CPU(at::kFloat), {3, 3})

which dispatches to

at::CPU(at::kFloat).ones({3, 3});

which itself calls

at::native::ones(at::CPU(at::kFloat), {3, 3})

which then calls

auto tensor = at::CPU(at::kFloat).tensor({3, 3});
tensor.fill_(1);

This PR proposes to remove factory methods from Type, and skip the dispatch via Type altogether for all factory methods except tensor, which remains the fundamental source of tensors invoked by all other tensor factories:

at::ones({3, 3}, at::type(at::kFloat).device(at::CPU))

now directly calls

at::native::ones({3, 3}, at::type(at::kFloat).device(at::CPU))

which now does

// `TensorOptions::type()`  returns the type corresponding to its `backend` and `scalar_type`
auto tensor = tensor_options.type().tensor({3, 3});
tensor.fill_(1)

At the moment, I have implemented this for ones, and I think nothing broke. There should be no implications for the Python API.

@zdevito @apaszke @gchanan @colesbury @ezyang

Fixes #6285
Fixes #6286
Fixes #7735

aten/src/ATen/TensorOptions.h

ezyang · 2018-05-30T15:55:23Z

In PyTorch, there is a context-manager which can be used to change the default device options, e.g., changing everything from float to double. Do you plan to eventually support this? If so, with the design as written here you are committing to the fact that these values get read at DeviceOption construction time (since these are direct things, not optional)

goldsborough · 2018-05-30T16:02:03Z

That's a very good point. I think it would be easy to implement with thread local globals and a RAII mechanism to set defaults in a particular scope. I would personally think it's fine that values are fixed at the call site (where you write at::ones(...)). Do you foresee a need to defer the point in time when values are fixed?

gchanan · 2018-05-30T16:17:20Z

Why does TensorOptions have backend and integral device? Why not have device objects (like python) and get rid of backend (which can be inferred from device and layout)?

goldsborough · 2018-05-30T16:19:25Z

That's Coming

aten/src/ATen/Layout.cpp

tools/autograd/gen_variable_factories.py

aten/src/ATen/function_wrapper.py

ezyang

ship it already yo

zdevito

There are some details about TensorOptions like the underlying Type* field that we might want to figure out better solutions for. However, this PR is getting to big, so assuming there are not any known bugs, I think we should merge and revisit those nits afterwards.

Storing the type in TensorOptions to solve the Variable problem Created convenience creation functions for TensorOptions and added tests Converted zeros to TensorOptions Converted rand to TensorOptions Fix codegen for TensorOptions and multiple arguments Put TensorOptions convenience functions into torch namespace too All factory functions except *_like support TensorOptions Integrated with recent JIT changes Support *_like functions Fix in place modification Some cleanups and fixes Support sparse_coo_tensor Fix bug in Type.cpp Fix .empty calls in C++ API Fix bug in Type.cpp Trying to fix device placement Make AutoGPU CPU compatible Remove some auto_gpu.h uses Fixing some headers Fix some remaining CUDA/AutoGPU issues Fix some AutoGPU uses Fixes to dispatch_tensor_conversion Reset version of new variables to zero Implemented parsing device strings Random fixes to tests Self review cleanups flake8 Undo changes to variable.{h,cpp} because they fail on gcc7.2 Add [cuda] tag to tensor_options_cuda.cpp Move AutoGPU::set_index_from into .cpp file because Windows is stupid and sucks Fix linker error in AutoGPU.cpp Fix bad merge conflict in native_functions.yaml Fixed caffe2/contrib/aten Fix new window functions added to TensorFactories.cpp

Added code to generate wrapper functions for factory methods Add implicit constructor from Backend to TensorOptions Remove Var() from C++ API and use torch:: functions Use torch:: functions more subtly in C++ API Make AutoGPU::set_device more exception safe Check status directly in DynamicCUDAHooksInterface Rename AutoGPU to DeviceGuard Removed set_requires_grad from python_variables.h and warn appropriately in Variable::set_requires_grad remove python_default_init: self.type() Add back original factory functions, but with deprecation warnings Disable DeviceGuard for a couple functions in ATen Remove print statement Fix DeviceGuard construction from undefined tensor Fixing CUDA device compiler issues Moved as many methods as possible into header files Dont generate python functions for deprecated factories Remove merge conflict artefact Fix tensor_options_cuda.cpp Fix set_requires_grad not being checked Fix tensor_new.h TEMPORARILY put some methods in .cpp files to see if it solves issues on windows and mac Fix bug in DeviceGuard.h Missing includes TEMPORARILY moving a few more methods into .cpp to see if it fixes windows Fixing linker errors

Undo device agnostic behavior of DeviceGuard Use -1 instead of optional for default device index Also move DeviceGuard methods into header Fixes around device index after optional -> int32_t switch Fix use of DeviceGuard in new_with_tensor_copy Fix tensor_options.cpp

torch/lib/THD/base/data_channels/DataChannelNccl.cpp


  // Guard GPU device
-  AutoGPU gpuGuard;
+  at::DeviceGuard gpuGuard;


aten/src/ATen/TensorOptions.h

+    } else {
+      backend = (layout_ == kStrided) ? kCUDA : kSparseCUDA;
+    }
+    return getType(backend, dtype_);


goldsborough requested review from apaszke, colesbury, ebetica, ezyang, gchanan, soumith and zdevito as code owners May 26, 2018 03:22

apaszke reviewed May 28, 2018

View reviewed changes

aten/src/ATen/TensorOptions.h Outdated

This comment was marked as off-topic.

Sign in to view

goldsborough force-pushed the tensor-options branch from bcf2d70 to e94ff2b Compare May 29, 2018 16:59

goldsborough force-pushed the tensor-options branch 7 times, most recently from 1aec84c to 1b6813b Compare June 4, 2018 00:55

goldsborough requested review from anderspapitto, bddppq, dzhulgakov, houseroad, jamesr66a and smessmer as code owners June 4, 2018 04:39

goldsborough force-pushed the tensor-options branch 4 times, most recently from 755fa3e to 4e3cb37 Compare June 5, 2018 07:23

goldsborough force-pushed the tensor-options branch from d5645f6 to c7040a5 Compare June 15, 2018 00:56

ezyang reviewed Jun 15, 2018

View reviewed changes

aten/src/ATen/Layout.cpp Outdated

This comment was marked as off-topic.

Sign in to view

ezyang reviewed Jun 15, 2018

View reviewed changes

tools/autograd/gen_variable_factories.py Outdated

This comment was marked as off-topic.

Sign in to view

ezyang reviewed Jun 15, 2018

View reviewed changes

aten/src/ATen/function_wrapper.py Outdated

This comment was marked as off-topic.

Sign in to view

ezyang approved these changes Jun 15, 2018

View reviewed changes

zdevito approved these changes Jun 15, 2018

View reviewed changes

goldsborough force-pushed the tensor-options branch 7 times, most recently from bb97ef5 to 87159de Compare June 16, 2018 00:31

goldsborough added 10 commits June 15, 2018 20:03

Fix Type::copy(

64f4010

Remove test_non_float_params from ONNX tests

0795535

Set requires_grad=False in ONNX tests that use ints

38db77c

Put layout/dtype/device on Tensor

6080ff7

Post merge fixes

9328c69

Change behavior of DeviceGuard to match AutoGPU

9b2435e

Fix C++ API integration tests

d7b2814

goldsborough force-pushed the tensor-options branch from 87159de to d7b2814 Compare June 16, 2018 03:03

Fix flip functions

c5b2af6

goldsborough merged commit 372d1d6 into pytorch:master Jun 16, 2018

goldsborough deleted the tensor-options branch June 16, 2018 07:40

colesbury reviewed Jun 18, 2018

View reviewed changes

torch/lib/THD/base/data_channels/DataChannelNccl.cpp

// Guard GPU device

AutoGPU gpuGuard;

at::DeviceGuard gpuGuard;

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

ssnl reviewed Jun 30, 2018

View reviewed changes

aten/src/ATen/TensorOptions.h

} else {

backend = (layout_ == kStrided) ? kCUDA : kSparseCUDA;

}

return getType(backend, dtype_);

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

Create ATen tensors via TensorOptions #7869

Create ATen tensors via TensorOptions #7869

Uh oh!

Conversation

goldsborough commented May 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Creation of a type encapsulating all construction axes of a tensor

Use of TensorOptions in all factory methods

New code path

Uh oh!

This comment was marked as off-topic.

Uh oh!

ezyang commented May 30, 2018

Uh oh!

goldsborough commented May 30, 2018

Uh oh!

gchanan commented May 30, 2018

Uh oh!

goldsborough commented May 30, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

goldsborough commented May 26, 2018 •

edited

Loading

Use of `TensorOptions` in all factory methods