Update from facebook #8384

sf-wind · 2018-06-12T17:14:20Z

No description provided.

soumith

Declarations.yaml shouldn't be included. It's auto-generated.

soumith · 2018-06-12T18:57:37Z

commit Skip calling ncclCommDestroy in destructor 0a6dd52 is already in pytorch master, it should be removed from this PR

soumith · 2018-06-12T18:58:34Z

Fix ATenOp dispatch for ops with TensorList arg is already in master via #8226 , so can be removed.

soumith · 2018-06-12T19:00:18Z

I believe "Resolve conflicts for tools/jit/gen_jit_dispatch.py b8e2d3a" is already in master as well. cc: @jamesr66a

aten/src/ATen/Declarations.yaml

Hotfix for failues in conv_transpose

lint with black

as desc.

ctc_greedy_decoder same as tf's

Allow multiple callbacks per event

The motivation is to do weighted sum in HoNet/crossnet, in the next diff, I'll replace model.Add with model.WeightedSum in honet: https://fburl.com/f4rmolg2 crossnet: https://fburl.com/v7awn8se, https://fburl.com/63filbnm

Some callers expect RunAsync to block, replicate that behavior in case of explicit 'dag' net type

as title

ezyang · 2018-06-12T19:18:52Z

@pytorchbot retest this please

Overriding dag, async_dag and async_polling with async_scheduling

Caffe thread pools currently inherit the thread names from the thread that starts them, which can be misleading. Give them an explicit name instead.

Change argument type to int64_t for shape argument of FillerOp (used in ConstantFill, XavierFill, etc)

It's not used anywhere and depends on old lua torch that conflicts with Aten. Given PT1 it's not relevant any more (though it was nice and clever code!) #accept2ship

The multiplier needs to be non-negative, not strictly positive.

This is after 2 years and we do not seem to have a use case for this one, so for the sake of clean API design we should potentially remove this. This would allow us to potentially pass in arguments to optionally construct an object, although it is indeed a little bit unclear how we can reuse existing objects if constructor arguments are passed in. In any case, we may want to remove this dangling feature.

Speedup generate proposals by partial_sort. FACEBOOK: - Saw speed improvement for training with this op. - Yanghan benchmarked the op on a small dataset and see consistent 100% improvement on speed (6ms -> 3ms) on 420 input resolution. See next diff for details.

More parallel processing friendly for CPP version of GenerateProposals.

…est_torch.py

This reverts commit f7f434dc5c34ca6058b9765d2ef615453d2276a9 @bypass-lint An infra SEV is better than not reverting this diff. If you copy this password, see you in SEV Review! @cause_a_sev_many_files

…8364) * [IDEEP] Upgrade IDEEP version Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com> * [IDEEP] Fix accuracy issue in conv op Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com> * Fix build error due to lack of src in CMakeLists Signed-off-by: Gu, Jinghui <jinghui.gu@intel.com>

* ATen fallback for ONNX export * Move to enum * Fix model test * Add comment * Address comments BC interface

* Add hip support for caffe2 core * Add MIOPEN header/wrapper to caffe2 core * Add HIP device into caffe2 PB * top level makefile change for rocm/hip * makefile scaffolding for AMD/RocM/HIP * Makefile scafodding for AMD/RocM/HIP; add makefile/utility for HIP files * caffe2 PB update for AMD/ROCM HIP device * Add AMD/RocM/Thrust dependency * HIP threadpool update * Fix makefile macro * makefile fix: duplicate test/binary name * makefile clean-up * makefile clean-up * add HIP operator registry * add utilities for hip device * Add USE_HIP to config summary * makefile fix for BUILD_TEST * merge latest * Fix indentation * code clean-up * Guard builds without HIP and use the same cmake script as PyTorch to find HIP * Setup rocm environment variables in build.sh (ideally should be done in the docker images) * setup locale * set HIP_PLATFORM * Revert "set HIP_PLATFORM" This reverts commit 8ec58db. * continue the build script environment variables mess * HCC_AMDGPU_TARGET * Cleanup the mess, has been fixed in the lastest docker images * Assign protobuf field hip_gpu_id a new field number for backward compatibility * change name to avoid conflict * Fix duplicated thread pool flag * Refactor cmake files to not add hip includes and libs globally * Fix the wrong usage of environment variables detection in cmake * Add MIOPEN CNN operators * Revert "Add MIOPEN CNN operators" This reverts commit 6e89ad4. * Add MIOPEN pooling operator * Add MIOPEN activation operator * Add MIOPEN softmax operator * Add MIOPEN spatial batch norm operator * Add MIOPEN loacl response normalization operator * Add MIOPEN conv operator * Clean-up LRN ops * enable fp16 in MIOPEN pool ops * Enable fp16 for MIOPEN relu op * Enable fp16 for MIOPEN spatial batch norm op * code clean-up * revert float16 support * Create Caffe2 python binding for AMD/ROCM/HIP * Add op fallback for HIP operator * add hip src/test files in cmake * exclude hip src/test files * fix python binding for hip backend * fix MIOPEN pooling op workspace * hack to compile miopen operators * fix include path for MIOPEN ops * Fix include path * Add HIP math utilities * Fix path for HIP math utils * cmake fix * Cmake fix / hipcc for hip files * suppress hipcc warning * cmake fix /replcae USE_HIP with USE_ROCM * revert LoadHIP.cmake change * fix include for thrust/cub-hip * include path fix for conversion.h * Updated with latest upstream changes * clang format fixes * Context_hip updates * Fixed typo in rocblas handle get function * Updated hipified math utils * Updated math hip test util * Updated context hip test * Updated common_hip * Updated net async dag for HIP * Added MIOPEN in operator hip test * fix * C2 dependencies clean-up * fix include path for building custom protobuf * Decouple miopen pool op and conv_pool_op base * cmake refactor * fix operator_hip_test * move all hip/miopen ops files into caffe2/operators/hip * sanitize cmake * permission issue * remove extra parenthesis * remove artifact from resolving merge conflict * cont. sanitize cmake files * fix syntax error * sanitize conversion.h * . * Revert "." This reverts commit 56020cb. * clang-format

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

…nal use (pytorch#8428)

* Enable some of the ONNX backend test on broadcasting * enable gemm broadcast

* Expose proto utils and ONNX from PyTorch libcaffe2.so * Try to use protobuf from _C.so * Fix ONNX proto header include * Adjust order of imports for ONNX until nanopb goes away * Set and use ONNX_NAMESPACE for PyTorch builds * Show protobuf summary for all builds * Add ONNX_NAMESPACE for cpp_build * Statically link libprotobuf.a into libtorch.so * Set ONNX_NAMESPACE on Windows build * Move core/dispatch up as well * Add /MD flag for Windows build of _C * Potential Windows fix for ONNX and protobuf * Add direct linkage from _C to ONNX on Windows * Only include protobuf wrapper for PyTorch * Pass extra_compile_args to _nvrtc ext build * Remove installation of .a files

…orch into update-from-facebook

tools/autograd/gen_python_functions.py

-except ImportError:
-    from tools.shared.module_loader import import_module
-    CodeTemplate = import_module('code_template', 'aten/src/ATen/code_template.py').CodeTemplate
+CodeTemplate = import_module('code_template', 'aten/src/ATen/code_template.py').CodeTemplate


tools/autograd/utils.py

-    from tools.shared.module_loader import import_module
-    CodeTemplate = import_module('code_template', 'aten/src/ATen/code_template.py').CodeTemplate
+
+CodeTemplate = import_module('code_template', 'aten/src/ATen/code_template.py').CodeTemplate


tools/nnwrap/generate_wrappers.py

-    except OSError:
-        pass
-    with open(os.path.join(install_dir, 'THNN.cwrap'), 'w') as f:
+    with open('torch/csrc/nn/THNN.cwrap', 'w') as f:


soumith

merge after tests pass

Yangqing · 2018-06-14T06:21:46Z

Are we sure the codebase is synced properly? For example, c10 folder was added 6 days ago in D8305610 (see internal), but is deleted in a more recent PR #8264 . It seems that some of the changes got lost along the way.

Yangqing · 2018-06-14T06:22:23Z

cc @sf-wind @orionr FYI - Any possibility there are some commit ordering issue in the sync script?

Yangqing · 2018-06-14T16:44:43Z

Did a bit more digging and it seems that tooling is fine. Basically fb internal diff landed on 3/7 afternoon (adding the c10 folder), and the github PR landed 3/7 night (strictly speaking 3/8), and because the PR added and then deleted the c10 folder during the discussion, the final landing was not aware of the c10 folder situation at all in the single squash and merge. And as a result, it was basically that a dangling c10 folder need manual deletion - sync script itself is correct.

sf-wind requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners June 12, 2018 17:14

soumith previously requested changes Jun 12, 2018

View reviewed changes

soumith reviewed Jun 12, 2018

View reviewed changes

aten/src/ATen/Declarations.yaml Outdated

This comment was marked as off-topic.

Sign in to view

bwasti and others added 10 commits June 12, 2018 12:12

[fix] fixup the bias multiplier data access issue

f22b326

Hotfix for failues in conv_transpose

[D2][Easy]: lint regularizer

61623da

lint with black

[GanH]: Split mu in adaptive weight for diagnose

ef8acc3

[Dper] Add the ability to split FC weights into multiple smaller ones

1247502

fix SumReduceLikeOp for empty blob

d8ff1c4

as desc.

add ctc_greedy_decoder for caffe2

86d32ba

ctc_greedy_decoder same as tf's

Update event callback handling

e2c322e

Allow multiple callbacks per event

Add WeightedSum layer

a4fec7b

The motivation is to do weighted sum in HoNet/crossnet, in the next diff, I'll replace model.Add with model.WeightedSum in honet: https://fburl.com/f4rmolg2 crossnet: https://fburl.com/v7awn8se, https://fburl.com/63filbnm

Replicate DAG's behavior

79a5393

Some callers expect RunAsync to block, replicate that behavior in case of explicit 'dag' net type

[dper] layernorm layer

01f8865

as title

Ilia Cherniavskii and others added 8 commits June 12, 2018 12:20

Override dag, async_dag, async_polling

7fa1915

Overriding dag, async_dag and async_polling with async_scheduling

Name the thread pools

2ace3e1

Caffe thread pools currently inherit the thread names from the thread that starts them, which can be misleading. Give them an explicit name instead.

[Caffe2] FilleOp should support int64_t dimensions

b9b458c

Change argument type to int64_t for shape argument of FillerOp (used in ConstantFill, XavierFill, etc)

Remove caffe2/caffe2/contrib/torch/

db1f099

It's not used anywhere and depends on old lua torch that conflicts with Aten. Given PT1 it's not relevant any more (though it was nice and clever code!) #accept2ship

Fix linearWarmup multiplier check

50b6152

The multiplier needs to be non-negative, not strictly positive.

More parallel processing friendly for CPP version of GenerateProposals.

be9e5a8

More parallel processing friendly for CPP version of GenerateProposals.

bwasti and others added 17 commits June 13, 2018 11:19

resolve conflicts for caffe2/core/logging_is_google_glog.h and test/t…

83ba250

…est_torch.py

Revert D7962948: [caffe2][nomnigraph] Concat elim for sparseNN

9d3e363

This reverts commit f7f434dc5c34ca6058b9765d2ef615453d2276a9 @bypass-lint An infra SEV is better than not reverting this diff. If you copy this password, see you in SEV Review! @cause_a_sev_many_files

Remove Declarations.yaml

1bcf743

Include common.h

cb7bbaa

Change std::stoi to caffe2::stoi

c254f73

Remove the code per soumith's comments

2931292

[ONNX] Add an ATen fallback pathway for ONNX export (pytorch#8273)

c3ac213

* ATen fallback for ONNX export * Move to enum * Fix model test * Add comment * Address comments BC interface

Remove imaginary file (pytorch#8415)

9f73a68

Enable some reduce operators' ONNX backend tests (pytorch#8418)

95fc555

fix old comment to point to the right file (pytorch#8416)

8f61edc

Stop pinning nccl version. (pytorch#8421)

8a807c2

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

Expose logsumexp docs and mark log_sum_exp in distributions for inter…

7f5f5d0

…nal use (pytorch#8428)

Enable some of the ONNX backend test on broadcasting (pytorch#8423)

8c28749

* Enable some of the ONNX backend test on broadcasting * enable gemm broadcast

Merge branch 'update-from-facebook' of https://github.com/sf-wind/pyt…

3075ce8

…orch into update-from-facebook

soumith reviewed Jun 13, 2018

View reviewed changes

tools/nnwrap/generate_wrappers.py Outdated

except OSError:

pass

with open(os.path.join(install_dir, 'THNN.cwrap'), 'w') as f:

with open('torch/csrc/nn/THNN.cwrap', 'w') as f:

This comment was marked as off-topic.

Sign in to view

sf-wind added 3 commits June 13, 2018 12:10

Rebase creates some weird situations, revert them manually

6139d1f

Remove more weird changes due to rebase

2f58721

Need to add thread_name.cc after merge

030daee

soumith approved these changes Jun 13, 2018

View reviewed changes

sf-wind merged commit 5b86c3a into pytorch:master Jun 13, 2018

sf-wind deleted the update-from-facebook branch October 24, 2018 18:04

ezyang added the open source label Jun 24, 2019

Update from facebook #8384

Update from facebook #8384

Uh oh!

Conversation

sf-wind commented Jun 12, 2018

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith commented Jun 12, 2018

Uh oh!

soumith commented Jun 12, 2018

Uh oh!

soumith commented Jun 12, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

ezyang commented Jun 12, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

Yangqing commented Jun 14, 2018

Uh oh!

Yangqing commented Jun 14, 2018

Uh oh!

Yangqing commented Jun 14, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants