[DO NOT REVIEW] Inductor lite mode with CUDAGraph support by BoyuanFeng · Pull Request #166320 · pytorch/pytorch

BoyuanFeng · 2025-10-27T18:11:34Z

This PR tracks the inductor fallback cudagraph backend. The code changes will be landed in smaller PRs.

Cache get_free_symbol_uses for faster compilation time of inductor graph partition [GraphPartition] cache get_free_symbol_uses #166338
Land inductor lite mode without cudagrah Inductor Lite Mode #167115
Receive right static input indices when tensor subclass is used (fix static_input_indices subclass remapping under training #167127 by @bdhirsh and [export] Fix static_input_indices for aot_export_joint #166761 by @angelayi)

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @msaroufim @dcci @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @aditew01 @ezyang @EikanWang @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @zhuhaozhe @blzheng @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela @xmfan

pytorch-bot · 2025-10-27T18:11:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166320

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm][CI] Machines under the label linux.rocm.gpu.2, label linux.rocm.gpu.4, linux.rocm.gpu.gfx1100 are undergoing maintenance.

❌ 6 New Failures

As of commit 1d3633a with merge base 9f9dbe0 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-mypy-partial / linux-job (gh)
>>> Lint for torch/fx/passes/regional_inductor.py:
Lint / lintrunner-noclang-partial / linux-job (gh)
>>> Lint for torch/_inductor/output_code.py:
pull / linux-jammy-py3.10-clang12 / test (default, 3, 5, lf.linux.4xlarge) (gh)
test/dynamo/test_subclasses.py::SubclassTests::test_mark_static_with_subclass_desugaring_dynamic_False
pull / linux-jammy-py3.10-clang18-asan / test (default, 6, 7, lf.linux.4xlarge) (gh)
test/dynamo/test_subclasses.py::SubclassTests::test_mark_static_with_subclass_desugaring_dynamic_False
pull / linux-jammy-py3.10-gcc11 / test (default, 5, 5, lf.linux.2xlarge) (gh)
test/dynamo/test_subclasses.py::SubclassTests::test_mark_static_with_subclass_desugaring_dynamic_False
pull / linux-jammy-py3.13-clang12 / test (default, 5, 5, lf.linux.4xlarge) (gh)
test/dynamo/test_subclasses.py::SubclassTests::test_mark_static_with_subclass_desugaring_dynamic_False

This comment was automatically generated by Dr. CI and updates every 15 minutes.

wconstab · 2025-11-05T18:33:17Z

torch/_inductor/__init__.py


    mode_options: dict[str, dict[str, bool]] = {
        "default": {},
+        # lightweight backend


@BoyuanFeng just a question about how the config works. in autoparallel, we directly call compile_fx_inner from aot_export_joint_...'s compile_fn. Can I directly use this 'mode' for light somewhere? or do i just need to manually set these same set of options in the inductor.config module?

for a workaround, we can just set the config for autoparallel for now. I will try to land these config soon.

In general, autoparallel uses aot_export_join_... taking fw_compiler and bw_compiler, which is different from compile_fx. We need a bit refactor on inductor side to provide these api directly.

pytorch-bot bot added ciflow/inductor module: inductor release notes: fx release notes category labels Oct 27, 2025

BoyuanFeng marked this pull request as draft October 27, 2025 18:11

facebook-github-bot added the fx label Oct 27, 2025

BoyuanFeng changed the title ~~[DO NOT REVIEW] Inductor CUDAGraph backend~~ [DO NOT REVIEW] Inductor Fallback CUDAGraph backend Oct 27, 2025

BoyuanFeng changed the title ~~[DO NOT REVIEW] Inductor Fallback CUDAGraph backend~~ [DO NOT REVIEW] Inductor lite mode with CUDAGraph support Oct 31, 2025

BoyuanFeng force-pushed the bf/cg-backend branch from b9c60ed to 8856569 Compare November 1, 2025 22:41

init

cba13b3

BoyuanFeng force-pushed the bf/cg-backend branch 2 times, most recently from b9c60ed to cba13b3 Compare November 1, 2025 22:45

BoyuanFeng removed release notes: distributed (checkpoint) ciflow/rocm Trigger "default" config CI on ROCm module: compiled autograd compiled_autograd release notes: inductor (aoti) ciflow/h100 ciflow/h100-symm-mem ciflow/b200 labels Nov 1, 2025

workaround wrong maybe_subclass_meta.fw_metadata.static_input_indices

1d3633a

pytorch-bot bot added the release notes: fx release notes category label Nov 2, 2025

wconstab reviewed Nov 5, 2025

View reviewed changes

BoyuanFeng closed this Nov 20, 2025

github-actions bot deleted the bf/cg-backend branch December 21, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT REVIEW] Inductor lite mode with CUDAGraph support#166320

[DO NOT REVIEW] Inductor lite mode with CUDAGraph support#166320
BoyuanFeng wants to merge 2 commits intomainfrom
bf/cg-backend

BoyuanFeng commented Oct 27, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading

Uh oh!

wconstab Nov 5, 2025

Uh oh!

BoyuanFeng Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

BoyuanFeng commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166320

❗ 1 Active SEVs

❌ 6 New Failures

Uh oh!

wconstab Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

BoyuanFeng Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BoyuanFeng commented Oct 27, 2025 •

edited

Loading

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading