-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Move EmbeddingBag into ATen #4856
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@pytorchbot add to whitelist |
|
RE: the build errors: s/_VariableBase/_VariableFunctions |
tools/autograd/derivatives.yaml
Outdated
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
colesbury
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks pretty good.
- There's a change to pybind11 submodule that doesn't look like it belongs here
- The Windows build failure looks related to this PR:
01:14:54 LINK : fatal error LNK1181: cannot open input file 'src\ATen\CMakeFiles\ATen.dir\native\EmbeddingBag.cpp.obj'
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
82e8247 to
7d8faa3
Compare
|
chiming in to say that the relevant issue is #4441 , which should be closed once this PR is approved and merged. |
This comment was marked as off-topic.
This comment was marked as off-topic.
Sorry, something went wrong.
2fc03c6 to
874e3e3
Compare
This diff creates code related to EmbeddingBag in ATen. It also allows sparse gradients.
This is the command I used to create the timings for both
Master
emb_tmp
This is the script I used to benchmark.
EDIT: Added CUDA synchronize
The performance between both branches matches, but this new code has sparse gradients and lives within ATen.
The code was built with
The timings of the first 1000 runs are discarded and the benchmark is executed 10000 times.