Batch norm layer for pseudo-fp16 support. #2388

csarofeen · 2017-08-11T16:05:22Z

Batch norm layer that keeps parameters in 32-bit when using 16-bit input/output. This layer is necessary for successful pseudo fp16 training.

soumith

thanks for introducing nn.contrib. Long-standing needed change.

ngimel · 2017-08-29T16:03:56Z

It generates unused variable warnings, because now all cudnn routines call PyObject* accumTensorClass = getAccumTensorClass(args); even though only batch norm uses differently typed accum tensors. Could not figure out how to get around it, would appreciate if someone gave me pointers how to do that in cwrap.

csarofeen · 2017-11-10T18:15:22Z

Removed BN layer in torch/nn left backend in place.

.gitmodules

apaszke · 2017-11-10T18:28:26Z

I'm not sure what's going on in this change. It's changing the BatchNorm code path in C++, but stats will still be half on the Python side? Also, it's going to conflict with @ezyang's changes (to no longer use THVoidTensor)

soumith · 2017-11-10T18:32:04Z

it's a rebased branch to get Zach going with Volta fp16.
After ed's changes all go in, this can be rebased on top.

apaszke · 2017-11-10T18:33:13Z

Yeah, but this is lacking Python changes. I can't see how will it not error out when you try to use BatchNorm

ngimel · 2017-11-10T18:37:29Z

@apaszke, It does not touch current batch norm in any way, just allows to call cudnn batch norm with fp16 inputs from a custom python module. It will have to be changed after Ed's changes go in.

csarofeen · 2017-12-05T00:29:57Z

Replaced with #4021 due to cuDNN rewrite in aten.

soumith approved these changes Aug 29, 2017

View reviewed changes

csarofeen force-pushed the BNfp16 branch from 0915211 to 2ea9e9c Compare November 10, 2017 18:15

soumith reviewed Nov 10, 2017

View reviewed changes

.gitmodules Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

Redo BN fp16 backend again

fc3d7b3

csarofeen force-pushed the BNfp16 branch from 2ea9e9c to fc3d7b3 Compare November 10, 2017 18:45

csarofeen added a commit to csarofeen/examples that referenced this pull request Nov 10, 2017

Update to match pytorch/pytorch#2388

d7811d1

csarofeen closed this Dec 5, 2017

ezyang added the open source label Jun 24, 2019

csarofeen deleted the BNfp16 branch February 12, 2020 13:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batch norm layer for pseudo-fp16 support. #2388

Batch norm layer for pseudo-fp16 support. #2388

Uh oh!

csarofeen commented Aug 11, 2017

Uh oh!

soumith left a comment

Uh oh!

ngimel commented Aug 29, 2017

Uh oh!

csarofeen commented Nov 10, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Nov 10, 2017 •

edited

Loading

Uh oh!

soumith commented Nov 10, 2017

Uh oh!

apaszke commented Nov 10, 2017

Uh oh!

ngimel commented Nov 10, 2017

Uh oh!

csarofeen commented Dec 5, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Batch norm layer for pseudo-fp16 support. #2388

Batch norm layer for pseudo-fp16 support. #2388

Uh oh!

Conversation

csarofeen commented Aug 11, 2017

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

ngimel commented Aug 29, 2017

Uh oh!

csarofeen commented Nov 10, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Nov 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soumith commented Nov 10, 2017

Uh oh!

apaszke commented Nov 10, 2017

Uh oh!

ngimel commented Nov 10, 2017

Uh oh!

csarofeen commented Dec 5, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

apaszke commented Nov 10, 2017 •

edited

Loading