Use integer math to compute output size of pooling operations #14405

f0k · 2018-11-27T09:58:09Z

As reported in #13386, the pooling operations can return wrong results for large inputs. The root of the problem is that while the output shape is initially being computed with integer operations, it is converted to float32 for division by the stride and applying either a ceil or a floor depending on the ceil_mode. Since even moderately large integers (the smallest being 16,777,217) cannot be expressed exactly in float32, this leads to wrong result shapes.

This PR relies purely on integer operations to perform the shape computation, including the ceil/floor distinction. Since I could not stand all that duplicated code, I pulled it out into a pooling_shape.h header, similar to the existing linear_upsampling.h header. I hope this is acceptable, let me know if you'd like to see it solved differently. I've also added tests to test_nn.py that fail without my changes and pass with my changes. They cover {max,avg}_pool{1,2,3}d() for CPU and GPU.

Fixes #13386.

…ions

soumith · 2018-11-27T14:22:58Z

looks pretty good, thanks @f0k !

facebook-github-bot

@soumith has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: As reported in #13386, the pooling operations can return wrong results for large inputs. The root of the problem is that while the output shape is initially being computed with integer operations, it is converted to float32 for division by the stride and applying either a `ceil` or a `floor` depending on the `ceil_mode`. Since even moderately large integers (the smallest being 16,777,217) cannot be expressed exactly in float32, this leads to wrong result shapes. This PR relies purely on integer operations to perform the shape computation, including the ceil/floor distinction. Since I could not stand all that duplicated code, I pulled it out into a `pooling_shape.h` header, similar to the existing `linear_upsampling.h` header. I hope this is acceptable, let me know if you'd like to see it solved differently. I've also added tests to `test_nn.py` that fail without my changes and pass with my changes. They cover `{max,avg}_pool{1,2,3}d()` for CPU and GPU. Fixes #13386. Pull Request resolved: pytorch/pytorch#14405 Differential Revision: D13215260 Pulled By: soumith fbshipit-source-id: 802588ce6cba8db6c346448c3b3c0dac14d12b2d

f0k force-pushed the fix-large-pooling-size branch from 78109d6 to 72852f6 Compare November 27, 2018 10:12

f0k added 2 commits November 27, 2018 12:49

Use integer math to compute output size of pooling operations

f942a89

Use integer min/max instead of fminf/fmaxf in pooling window computat…

81a436e

…ions

f0k force-pushed the fix-large-pooling-size branch from 72852f6 to 81a436e Compare November 27, 2018 13:35

soumith approved these changes Nov 27, 2018

View reviewed changes

facebook-github-bot reviewed Nov 27, 2018

View reviewed changes

facebook-github-bot closed this in c19af59 Nov 27, 2018

ezyang added open source merged labels Jun 24, 2019

f0k deleted the fix-large-pooling-size branch June 27, 2019 09:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use integer math to compute output size of pooling operations #14405

Use integer math to compute output size of pooling operations #14405

Uh oh!

f0k commented Nov 27, 2018

Uh oh!

soumith commented Nov 27, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use integer math to compute output size of pooling operations #14405

Use integer math to compute output size of pooling operations #14405

Uh oh!

Conversation

f0k commented Nov 27, 2018

Uh oh!

soumith commented Nov 27, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants