Add non-eager registration to dispatch autogen #74557

wconstab · 2022-03-22T17:24:52Z

Summary:
Previously, the torchscript backend would be (partially) initialized at startup.

the dispatcher registrations would be registered,
but other backend components would not be initialized until explicitly calling
the backend init function

With this change, the torchscript backend is not initialized until its explicit
initialization function is called.

This enables external backends to register their own backend instead of the torchscript
backend to the same (Lazy) key.

Lands a change contributed by @antoniojkim via lazy_tensor_staging branch (#73973)

Differential Revision: D35051464

facebook-github-bot · 2022-03-22T17:25:10Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/74557
Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 835a272 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

facebook-github-bot · 2022-03-22T17:25:28Z

This pull request was exported from Phabricator. Differential Revision: D35051464

bdhirsh · 2022-03-22T17:51:29Z

torch/csrc/lazy/ts_backend/ts_backend_impl.h

are all the changes here part of this PR? (or is there any chance you can split the eager changes into their own PR for review?)

sorry, i have been exporting these PRs from a stack in phabricator, and I didn't realize that the way github would render them is unusable. If you click on just the top commit shown here, it will show you the small set of changes that actually goes with this PR.

bdhirsh · 2022-03-22T18:11:01Z

aten/src/ATen/templates/RegisterDispatchKey.cpp

"external backends" -> "everything that isn't LTC"? (Technically I think this applies both our eager backends in-tree like CPU/CUDA, and also any other backends out-of-tree like Intel XPU)

yes, it was meant as in external LTC backends such as one developed out of tree. I should probably update it to say more clearly that only the in-tree lazy torchscript backend gets special behavior, and others don't

bdhirsh · 2022-03-22T18:18:11Z

aten/src/ATen/templates/RegisterDispatchKey.cpp

I'm a little confused by this split between DispatchKeyNativeFunctions.h and RegisterDispatchKey.cpp.

Don't we always end up #includeing DispatchKeyNativeFunctions.h in this file, so the Register${BackendName}${DispatchKey}Modules function will already be declared? (and the macro above will already be defined).

Maybe in practice this might be true. But its not explicitly clear just from the template definition that this is the case. Which is why I added the #ifndef clause. If it happens that there is a case (either now or in the future) where the DispatchKeyNativeFunctions.h is not included, it would avoid a compile error here and revert back to the default behaviour.

I think the "delay registration using a lambda" feels like a niche enough use case to me that we should try to implement it with less clutter to this file. Instead of using the macros, how do you feel about unconditionally generating the Register{...}Modules lambda? And still using ${eager_registration} to decide in the codegen whether or not we call it directly in the file or not (to run the registrations at load time).

The initTSBackend() function wouldn't need to check a macro - it would just always call the lambda to run the registrations. You might run into problems if we got the codegen wrong and accidentally registered the TS backend on startup, but you could also just add a unit test for this (assert in one of the C++ tests that there aren't any kernels registered to the Lazy key on startup).

I think that might not be a bad idea. But perhaps better left to a follow PR seeing as this is just a port of a previously merged PR?

Perhaps what could be done is if the Register{...}Modules function is called at c++ static initialization, we could also reset the value to nullptr so it can't be called again. Then we can just run the function only if it hasn't been called before. This would eliminate the need of the EAGER_REGISTRATION macro. Thoughts?

yep, that sounds reasonable.

I kind of don't want to clutter up this file with the extra macros, mostly because this is a pretty core template used in a bunch of different contexts (used for most of our eager mode dispatcher registrations too) and so anyone looking at this file for non-LTC purposes would now have to reason about the macros.

@wconstab would you rather make changes to this PR directly, or address feedback later?

I think we can make the changes here. I already landed this as-is on the staging branch, but we are almost at the point of abandoning the staging branch so I think we can start to do things differently here instead of striving to stay in sync there.

It also depends how we make the changes; if you have some clear ideas of what i should change I can just make the changes. If you actually want to try to modify it yourself, I could either land it first and you change it, or maybe you could commandeer this diff.

I think what @antoniojkim put sounds reasonable:

(1) kill the macro and unconditionally generate the Register${BackendName}${DispatchKey}Modules();

(2) In the codegen, if the eager_registration argument is true, then emit the code to run the function immediately (which this PR already does), and then also set it to re-assign it to a lambda that no-ops (that way you won't accidentally call it and register things multiple times - although I think the risk of this being a problem in practice is pretty small, since nobody should ever call this function except for the torchscript one. So if this is hard to do then I think it's ok if you don't add it).

bdhirsh · 2022-03-22T18:22:01Z

torch/csrc/lazy/ts_backend/ts_backend_impl.cpp

technically this should always be set to 1 for TS, right? Maybe we should just assert it.

Yea, I think that's true.

bdhirsh · 2022-03-22T18:24:17Z

aten/src/ATen/templates/RegisterDispatchKey.cpp

This might just be from a different PR, but I think we technically don't need BackendName here? (the function name will be unique without it). Plumbing a new BackendName argument through the codegen for the name seems like overkill to me, but maybe there are other places in the codegen where BackendName actually is required.

@bdhirsh This is something I added to differentiate between backends. There is no guarantee as far as I am aware as to the order in which the symbols are statically initialized. This means that if we had just a RegisterLazyModules function being defined by both the TS backend and a custom backend, then there is no way of knowing who assigned the value of that function pointer last. To avoid ambiguity, I plumbed through the backend name to make things clear and unambiguous

yeah good point, I can't think of an easy way to get around this. (cc @ezyang if you happen to have a different opinion. Context = this PR updates the codegen'd dispatcher registrations to be able to be delayed, and registered through a lambda instead of at load time).

facebook-github-bot · 2022-03-23T01:31:08Z

This pull request was exported from Phabricator. Differential Revision: D35051464

henrytwo · 2022-03-24T15:33:36Z

tools/codegen/gen_lazy_tensor.py

@wconstab I think there's a problem here. According to the function definition of gen_dispatchkey_nativefunc_headers, the positional argument after autograd_dispatch_key is eager_registration, but instead you passed in backend_name. Since you also set the value of eager_registration using a keyword, the following error occurs:

TypeError: gen_dispatchkey_nativefunc_headers() got multiple values for argument 'eager_registration'

facebook-github-bot · 2022-03-24T16:08:30Z

This pull request was exported from Phabricator. Differential Revision: D35051464

facebook-github-bot · 2022-03-24T18:41:57Z

This pull request was exported from Phabricator. Differential Revision: D35051464

Summary: Pull Request resolved: pytorch#74557 Previously, the torchscript backend would be (partially) initialized at startup. - the dispatcher registrations would be registered, - but other backend components would not be initialized until explicitly calling the backend init function With this change, the torchscript backend is not initialized until its explicit initialization function is called. This enables external backends to register their own backend instead of the torchscript backend to the same (Lazy) key. Lands a change contributed by antoniojkim via lazy_tensor_staging branch (pytorch#73973) Differential Revision: D35051464 fbshipit-source-id: 9bbf789e7cccbac73267568abdc178df2d9efcfa

facebook-github-bot · 2022-03-24T20:07:09Z

This pull request was exported from Phabricator. Differential Revision: D35051464

ezyang · 2022-03-25T02:02:26Z

aten/src/ATen/templates/RegisterDispatchKey.cpp

+
+  };
+
+  ${call_register_dispatchkey_modules};


This is pretty invasive, I'm not sure I like it.

ezyang · 2022-03-25T02:05:18Z

@wconstab this seems like a pretty circuitous way to do something that fundamentally should be relatively simple. The TORCH_LIBRARY macro exists solely because we want to run the registrations at static initialization time. If you don't need to run at static initialization time, don't bother with the macro; instead, just define a public function and use MAKE_TORCH_LIBRARY to get the library object which you do registrations on.

So I'd expect TORCH_LIBRARY to move into the codegen code, and then you switch between defining a function or using TORCH_LIBRARY, and then all of the extra faffing about with std::function goes away.

bdhirsh · 2022-03-25T17:16:38Z

tools/codegen/gen_backend_stubs.py

nit: you can probably just use one argument here instead of two in the template file (dispatch_registrations), since right now one of these two arguments is always an empty string.

but I need to put the content in 2 different places, one place is inside anonymous namespace and the other is inside at:: namespace. Maybe I'm missing something?

ah yep, i forgot about that. makes sense!

bdhirsh · 2022-03-25T17:17:34Z

tools/codegen/gen_backend_stubs.py

minor nit: most of our codegen uses f-strings now instead of CodeTemplate, e.g:

deferred_dispatch_registrations = f"""\ TORCH_API void Register{backend_name}{dispatch_key}NativeFunctions() {{ ...

is it bad to use CodeTemplate though? I started off using f-strings, but since I am substituting a multi-line string (with indentation) CodeTemplate actually provides value here.

I think f-strings are pretty much superior to CodeTemplate (you can do multi-line f-strings just fine, like in the codegen here). It's not too big of a deal though so i don't wanna get hung up on it.

tools/codegen/gen_backend_stubs.py

wconstab · 2022-03-25T22:51:00Z

hmm I seem to have totally broken this (for the TS backend - other keys seem ok)

I took the liberty of using 'MAKE_TORCH_LIBRARY_IMPL' instead of literally what @ezyang suggested, maybe I misunderstood. It seems like either nothing is getting registered (in time?) for TS, or maybe just the fallback is not working and that's breaking virtually all tests since we fall back for some factory function. It might matter what namespace the deferred registration function is defined in?

ezyang · 2022-03-26T02:09:36Z

Oops, MAKE_TORCH_LIBRARY_IMPL is the right call. I don't see anything obviously wrong, I guess I would stare at the diff between here and the previous working version

tools/codegen/gen_backend_stubs.py

torch/csrc/lazy/ts_backend/ts_eager_fallback.cpp

facebook-github-bot · 2022-03-29T23:55:17Z

@wconstab has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-03-30T00:24:18Z

@wconstab has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

bdhirsh · 2022-03-30T15:27:56Z

test/cpp/lazy/test_lazy_ops.cpp

It looks like ASAN isn't happy (the ASAN test is complaining about initialization-order-fiasco: https://github.com/pytorch/pytorch/runs/5746632034?check_suite_focus=true)

I think the ordering problem might be between this static bool (which eventually calls RegisterTorchScriptLazyNativeFunctions), and the static library object that that function uses in the codegen: static auto m = MAKE_TORCH_LIBRARY_IMPL(aten, $dispatch_key);.

You might need to either access m through a function, or maybe not make this bool static?

Summary: Pull Request resolved: pytorch#74557 Previously, the torchscript backend would be (partially) initialized at startup. - the dispatcher registrations would be registered, - but other backend components would not be initialized until explicitly calling the backend init function With this change, the torchscript backend is not initialized until its explicit initialization function is called. This enables external backends to register their own backend instead of the torchscript backend to the same (Lazy) key. Lands a change contributed by antoniojkim via lazy_tensor_staging branch (pytorch#73973) Differential Revision: D35051464 fbshipit-source-id: 9bbf789e7cccbac73267568abdc178df2d9efcfa

part 1/2 Co-authored-by: Jae Hoon (Antonio) Kim <17433012+antoniojkim@users.noreply.github.com>

part 2/2 Co-authored-by: Jae Hoon (Antonio) Kim <17433012+antoniojkim@users.noreply.github.com>

bdhirsh

lgtm!

facebook-github-bot · 2022-03-31T19:50:48Z

@wconstab has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: Previously, the torchscript backend would be (partially) initialized at startup. - the dispatcher registrations would be registered, - but other backend components would not be initialized until explicitly calling the backend init function With this change, the torchscript backend is not initialized until its explicit initialization function is called. This enables external backends to register their own backend instead of the torchscript backend to the same (Lazy) key. Lands a change contributed by antoniojkim via lazy_tensor_staging branch (#73973) Pull Request resolved: #74557 Reviewed By: bdhirsh Differential Revision: D35051464 Pulled By: wconstab fbshipit-source-id: 5a8b0851293e394f49427d1416ee571a8881fe9f

github-actions · 2022-04-01T03:43:30Z

Hey @wconstab.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

facebook-github-bot added the cla signed label Mar 22, 2022

facebook-github-bot added the fb-exported label Mar 22, 2022

wconstab requested a review from bdhirsh March 22, 2022 17:34

bdhirsh reviewed Mar 22, 2022

View reviewed changes

wconstab force-pushed the export-D35051464 branch from e9bb86f to 641cab9 Compare March 23, 2022 01:31

henrytwo reviewed Mar 24, 2022

View reviewed changes

wconstab force-pushed the export-D35051464 branch from 641cab9 to a8d05c7 Compare March 24, 2022 16:08

wconstab force-pushed the export-D35051464 branch from a8d05c7 to d995a74 Compare March 24, 2022 18:41

wconstab force-pushed the export-D35051464 branch from d995a74 to 347edb0 Compare March 24, 2022 20:07

ezyang reviewed Mar 25, 2022

View reviewed changes

wconstab force-pushed the export-D35051464 branch from 347edb0 to df198f8 Compare March 25, 2022 16:38

bdhirsh reviewed Mar 25, 2022

View reviewed changes

tools/codegen/gen_backend_stubs.py Outdated Show resolved Hide resolved

wconstab force-pushed the export-D35051464 branch from df198f8 to 47a121a Compare March 25, 2022 19:30

wconstab force-pushed the export-D35051464 branch from 47a121a to 519221b Compare March 25, 2022 20:08

wconstab force-pushed the export-D35051464 branch from 519221b to 9549095 Compare March 25, 2022 20:54

wconstab force-pushed the export-D35051464 branch from 9549095 to ecdfd52 Compare March 25, 2022 22:48

antoniojkim suggested changes Mar 29, 2022

View reviewed changes

tools/codegen/gen_backend_stubs.py Outdated Show resolved Hide resolved

torch/csrc/lazy/ts_backend/ts_eager_fallback.cpp Outdated Show resolved Hide resolved

wconstab force-pushed the export-D35051464 branch from 1b04b30 to 3236be6 Compare March 30, 2022 00:23

bdhirsh reviewed Mar 30, 2022

View reviewed changes

antoniojkim mentioned this pull request Mar 31, 2022

Decouple MLIR Backend from TS Backend llvm/torch-mlir#723

Merged

wconstab and others added 4 commits March 31, 2022 16:28

Update tools/codegen/gen_backend_stubs.py

d0d14bf

part 1/2 Co-authored-by: Jae Hoon (Antonio) Kim <17433012+antoniojkim@users.noreply.github.com>

Update torch/csrc/lazy/ts_backend/ts_eager_fallback.cpp

909033e

part 2/2 Co-authored-by: Jae Hoon (Antonio) Kim <17433012+antoniojkim@users.noreply.github.com>

Move static library impls inside function to resolve init order

835a272

wconstab force-pushed the export-D35051464 branch from 9e8ffbd to 835a272 Compare March 31, 2022 16:43

bdhirsh approved these changes Mar 31, 2022

View reviewed changes

pytorchmergebot closed this in b9e535a Apr 1, 2022

wconstab added topic: not user facing topic category release notes: lazy release notes category labels Apr 1, 2022

WBobby mentioned this pull request Aug 17, 2022

Add ROCm5.2.3/AMDGPU support for PyTorch WBobby/pytorch#2

Closed

Add non-eager registration to dispatch autogen #74557

Add non-eager registration to dispatch autogen #74557

Uh oh!

Conversation

wconstab commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

facebook-github-bot commented Mar 22, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 23, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 24, 2022

Uh oh!

facebook-github-bot commented Mar 24, 2022

Uh oh!

facebook-github-bot commented Mar 24, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang commented Mar 25, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wconstab commented Mar 25, 2022

Uh oh!

ezyang commented Mar 26, 2022

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Mar 29, 2022

wconstab commented Mar 22, 2022 •

edited

Loading

facebook-github-bot commented Mar 22, 2022 •

edited

Loading