[inductor][ez] add overridable env var for disabling fx graph cache by shunting314 · Pull Request #166138 · pytorch/pytorch

shunting314 · 2025-10-23T17:45:41Z

Stack from ghstack (oldest at bottom):

-> [inductor][ez] add overridable env var for disabling fx graph cache #166138

I set TORCHINDUCTOR_FX_GRAPH_CACHE=0 a lot to make sure the compilation
happens by disabling fx graph caching. I even put this in my .bashrc.
But this cause a simple vllm script fail:
https://gist.github.com/shunting314/4253b2b5ab5e7d1b0fc9516c84054904

Error log:
https://gist.github.com/shunting314/1d04bbeb58bc486f975684f56d65615d

The root cause is,

vllm patch inductor_config.fx_graph_cache to True here:
https://github.com/vllm-project/vllm/blob/e255d929902dcf8968541d2cbf0d18f0fe3f9c49/vllm/compilation/compiler_interface.py#L308

The code in vllm relies fx graph cache is on (unless
VLLM_DISABLE_COMPILE_CACHE is overriden to false)
setting TORCHINDUCTOR_FX_GRAPH_CACHE=0 will cause
inductor_config.fx_graph_cache not overridable.

I add TORCHINDUCTOR_FX_GRAPH_CACHE_DEFAULT so that we can still use it to skip fx
graph cache while still allow project like vllm to override it.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

I set TORCHINDUCTOR_FX_GRAPH_CACHE=0 a lot to make sure the compilation happens by disabling fx graph caching. I even put this in my .bashrc. But this cause a simple vllm script fail: https://gist.github.com/shunting314/4253b2b5ab5e7d1b0fc9516c84054904 Error log: https://gist.github.com/shunting314/1d04bbeb58bc486f975684f56d65615d The root cause is, 1. vllm patch inductor_config.fx_graph_cache to True here: https://github.com/vllm-project/vllm/blob/e255d929902dcf8968541d2cbf0d18f0fe3f9c49/vllm/compilation/compiler_interface.py#L308 The code in vllm relies fx graph cache is on (unless VLLM_DISABLE_COMPILE_CACHE is overriden to false) 2. setting TORCHINDUCTOR_FX_GRAPH_CACHE=0 will cause inductor_config.fx_graph_cache not overridable. I add TORCHINDUCTOR_FX_GRAPH_CACHE_DEFAULT so that we can still use it to skip fx graph cache while still allow project like vllm to override it. [ghstack-poisoned]

pytorch-bot · 2025-10-23T17:45:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166138

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit edd15b3 with merge base e20c9bf ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable) (gh) (#166072)
export/tests/test_target_recipes.py::TestTargetRecipes::test_vit_model

This comment was automatically generated by Dr. CI and updates every 15 minutes.

I set TORCHINDUCTOR_FX_GRAPH_CACHE=0 a lot to make sure the compilation happens by disabling fx graph caching. I even put this in my .bashrc. But this cause a simple vllm script fail: https://gist.github.com/shunting314/4253b2b5ab5e7d1b0fc9516c84054904 Error log: https://gist.github.com/shunting314/1d04bbeb58bc486f975684f56d65615d The root cause is, 1. vllm patch inductor_config.fx_graph_cache to True here: https://github.com/vllm-project/vllm/blob/e255d929902dcf8968541d2cbf0d18f0fe3f9c49/vllm/compilation/compiler_interface.py#L308 The code in vllm relies fx graph cache is on (unless VLLM_DISABLE_COMPILE_CACHE is overriden to false) 2. setting TORCHINDUCTOR_FX_GRAPH_CACHE=0 will cause inductor_config.fx_graph_cache not overridable. I add TORCHINDUCTOR_FX_GRAPH_CACHE_DEFAULT so that we can still use it to skip fx graph cache while still allow project like vllm to override it. ghstack-source-id: 836f92f Pull Request resolved: #166138

BoyuanFeng · 2025-10-23T21:40:42Z

In addition to 'VLLM_DISABLE_COMPILE_CACHE=0', we can also remove cached code under '.cache/vllm/'

shunting314 · 2025-10-23T22:14:10Z

@BoyuanFeng that triggers compilation (rather than load from saved artifacts), but this line will still fail since the compiled result is not saved to cache: https://github.com/vllm-project/vllm/blob/e255d929902dcf8968541d2cbf0d18f0fe3f9c49/vllm/compilation/compiler_interface.py#L476

shunting314 · 2025-10-23T22:26:34Z

The setting like TORCHINDUCTOR_FX_GRAPH_CACHE=0 causes the following behavior:

inductor_config.fx_graph_cache = True
print(inductor_config.fx_graph_cache)  # get False since TORCHINDUCTOR_FX_GRAPH_CACHE is assigned as env_name_force field

The new envvar will avoid that and is safe to put in .bashrc

shunting314 · 2025-10-24T21:50:39Z

@pytorchbot merge

pytorchmergebot · 2025-10-24T21:52:24Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-25T03:51:02Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

cyyever · 2025-10-26T03:09:17Z

@pytorchbot merge -i

pytorchmergebot · 2025-10-26T03:11:06Z

Merge started

Your change will be merged while ignoring the following 1 checks: trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-26T09:09:39Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

cyyever · 2025-10-26T10:53:34Z

@pytorchbot merge -i

pytorchmergebot · 2025-10-26T10:55:24Z

Merge started

Your change will be merged while ignoring the following 1 checks: trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-26T16:53:57Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

shunting314 · 2025-10-27T05:23:18Z

@pytorchbot merge

pytorchmergebot · 2025-10-27T05:25:08Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-27T11:23:40Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

shunting314 · 2025-10-27T18:01:45Z

@pytorchbot merge

pytorchmergebot · 2025-10-27T18:03:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-28T00:02:09Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

shunting314 · 2025-10-28T00:25:12Z

@pytorchbot merge -f 'force merge since it fails for so many times..."

pytorch-bot · 2025-10-28T00:25:15Z

❌ 🤖 pytorchbot command failed:

Got EOF while in a quoted string```
Try `@pytorchbot --help` for more info.

shunting314 · 2025-10-28T00:25:33Z

@pytorchbot merge -f 'force merge since it fails for so many times...'

pytorchmergebot · 2025-10-28T00:27:05Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added ciflow/inductor module: inductor labels Oct 23, 2025

shunting314 requested review from BoyuanFeng and eellison October 23, 2025 17:46

eellison approved these changes Oct 24, 2025

View reviewed changes

shunting314 added the topic: not user facing topic category label Oct 24, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 24, 2025

pytorchmergebot added the merging label Oct 24, 2025

pytorchmergebot closed this in dc011d3 Oct 28, 2025

pytorchmergebot added Merged and removed merging labels Oct 28, 2025

github-actions bot deleted the gh/shunting314/245/head branch November 28, 2025 02:16

Conversation

shunting314 commented Oct 23, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166138

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

BoyuanFeng commented Oct 23, 2025

Uh oh!

shunting314 commented Oct 23, 2025

Uh oh!

shunting314 commented Oct 23, 2025

Uh oh!

shunting314 commented Oct 24, 2025

Uh oh!

pytorchmergebot commented Oct 24, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 25, 2025

Uh oh!

cyyever commented Oct 26, 2025

Uh oh!

pytorchmergebot commented Oct 26, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 26, 2025

Uh oh!

cyyever commented Oct 26, 2025

Uh oh!

pytorchmergebot commented Oct 26, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 26, 2025

Uh oh!

shunting314 commented Oct 27, 2025

Uh oh!

pytorchmergebot commented Oct 27, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 27, 2025

Uh oh!

shunting314 commented Oct 27, 2025

Uh oh!

pytorchmergebot commented Oct 27, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 28, 2025

Uh oh!

shunting314 commented Oct 28, 2025

Uh oh!

pytorch-bot bot commented Oct 28, 2025

Uh oh!

shunting314 commented Oct 28, 2025

Uh oh!

pytorchmergebot commented Oct 28, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shunting314 commented Oct 23, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 23, 2025 •

edited

Loading