Grid sampler: nearest interpolation & reflection padding #10051

ssnl · 2018-07-31T03:31:28Z

closes #9702 .

Commit structure:

Change the index calculation logic. I will explain using 1-D for simplicity.

Previously we have (in pseudo code):
```
// 1. get the float locations from grid
scalar_t x = from_grid()

// 2. find the integral surrounding indices
int x_left = floor(x)
int x_right = x_left + 1

// 3. calculate the linear interpolate weights
scalar_t w_left = x_right - x
scalar_t w_right = x - x_left

// 4. manipulate the integral surrounding indices if needed
// (e.g., clip for border padding_mode)
x_left = manipulate(x_left, padding_mode)
x_right = manipulate(x_right, padding_mode)

// 5. interpolate
output_val = interpolate(w_left, w_right, x_left, x_right)
```
This is actually incorrect (and also unintuitive) because it calculates the
weights before manipulate out-of-boundary indices. Fortunately, this
isn't manifested in both of the current supported modes, 'zeros' and
'border' padding:
- 'zeros': doesn't clip
- 'border': clips, but for out-of-bound x both x_left and x_right are
  clipped to the same value, so weights don't matter
But this is a problem with reflection padding, since after each time we reflect,
the values of w_left and w_right should be swapped.

So in this commit I change the algorithm to (numbers corresponding to the
ordering in the above pseudo-code)
```
1. get float location
4. clip the float location 
2. find the integral surrounding indices
3. calculate the linear interpolate weights
```
In the backward, because of this change, I need to add new variables to track
d manipulate_output / d manipulate_input, which is basically a multiplier
on the gradient calculated for grid. From benchmarking this addition doesn't
cause obvious slow downs.

Implement reflection padding. The indices will keep being reflected until
they become within boundary.

Added variant of clip_coordinates and reflect_coordinates to be used in
backward. E.g.,

// clip_coordinates_set_grad works similarly to clip_coordinates except that
// it also returns the `d output / d input` via pointer argument `grad_in`.
// This is useful in the backward pass of grid_sampler.
scalar_t clip_coordinates_set_grad(scalar_t in, int64_t clip_limit, scalar_t *grad_in)

For example, if in is clipped in 'border' mode, grad_in is set to 0.
If in is reflected odd times in 'reflection' mode, grad_in
is set to -1.

Implement nearest interpolation.
Add test cases
Add better input checking
Discussed with @goldsborough for moving operator<< of at::Device,
at::DeviceType and at::Layout into at namespace. (Otherwise
AT_CHECK can't find them.)
Support empty tensors. cc @gchanan
- Make empty tensors not acceptable by cudnn.
- Add AT_ASSERT(kernel block size > 0) if using GET_BLOCKS
- Cache numel in TensorGeometry
  I was going to use numel to test if cudnn descriptor should accept a
  tensor, but it isn't used eventually. I can revert this if needed.
Add more test cases, including on input checking and empty tensors
Remove an obsolete comment
Update docs. Manually tested by generating docs.

aten/src/ATen/native/GridSampler.cpp

facebook-github-bot