[dynamo] Rehaul the autograd.Function support by anijain2305 · Pull Request #166788 · pytorch/pytorch

anijain2305 · 2025-11-01T06:31:27Z

Stack from ghstack (oldest at bottom):

We make a rehaul because
(1) we want to support non-proxyable outputs in the fwd method
(2) we saw general softness in the support.

I have put lot of comments in the code.

Follow up

Graph break on backward stride dependent computation. This is BC breaking, so needs care.
Use DynamoAutogradFunctionTraceHelper for backward tracer.
Add test cases for module input, pytree input/outputs, pg groups
Consider unifying automatic and automatic_with_forced_placeholders
Better error messages - especially nonstrict trace for stride dependent backward computation.

[ghstack-poisoned]

pytorch-bot · 2025-11-01T06:31:31Z

🔗 Helpful Links

Note: Links to docs will display an error until the docs builds have been completed.

As of commit cad48c9 with merge base 033659b ():

NEW FAILURES - The following jobs have failed:

trunk / linux-jammy-cuda12.8-py3.10-gcc11-no-ops / build (gh)
ninja: build stopped: subcommand failed
trunk / linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1) (gh)
Process completed with exit code 1.
trunk / linux-jammy-rocm-py3.10 / test (default, 3, 6, linux.rocm.gpu.gfx942.1) (gh)
Process completed with exit code 1.
trunk / linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1) (gh)
Process completed with exit code 1.
trunk / linux-jammy-rocm-py3.10 / test (default, 5, 6, linux.rocm.gpu.gfx942.1) (gh)
Process completed with exit code 1.
trunk / linux-jammy-rocm-py3.10 / test (default, 6, 6, linux.rocm.gpu.gfx942.1) (gh)
Process completed with exit code 1.

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor / inductor-cpu-test / test (cpu_inductor_torchbench, 1, 2, linux.2xlarge.amx) (gh) (similar failure)
doctr_reco_predictor
inductor / inductor-test-cuda13 / test (inductor_torchbench, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
doctr_reco_predictor
pull / linux-jammy-py3.14-clang12 / test (dynamo_wrapped, 3, 3, lf.linux.2xlarge) (gh) (similar failure)
test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_e2e
trunk / linux-jammy-rocm-py3.10 / test (default, 1, 6, linux.rocm.gpu.gfx942.1) (gh) (detected as infra flaky with no log or failing log classifier)
trunk / linux-jammy-rocm-py3.10 / test (distributed, 1, 3, linux.rocm.gpu.gfx942.4) (gh) (detected as infra flaky with no log or failing log classifier)
trunk / linux-jammy-rocm-py3.10 / test (distributed, 2, 3, linux.rocm.gpu.gfx942.4) (gh) (similar failure)
Process completed with exit code 1.
trunk / linux-jammy-rocm-py3.10 / test (distributed, 3, 3, linux.rocm.gpu.gfx942.4) (gh) (similar failure)
Process completed with exit code 1.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 073736f Pull Request resolved: #166788