[inductor] fix issue for example value with unbacked strides by sevenEng · Pull Request #163660 · pytorch/pytorch

sevenEng · 2025-09-23T16:32:28Z

Issue

During autotune, we're not applying size hints atomically for the example inputs used for benchmarking.

If there is unbacked symint showing up in inputs' strides, this might lead to CUDA IMA,

and this could be reproduced by the added unittest, with stride being [128 * u0, 128, 1] and unbacked fallback being 8192, after calling benchmark_example_value, we get back a tensor with stride as [8192, 128, 1] as opposed to [128 * 8192, 128, 1]

Fix

Using the atomic API when trying to apply size hints to input tensor' strides.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-09-23T16:32:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163660

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 0d8f1f5 with merge base 3a110c9 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable) (gh) (similar failure)
RuntimeError: doctests 1/1 failed!

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-09-23T16:36:29Z

@sevenEng has imported this pull request. If you are a Meta employee, you can view this in D83066085.

test/inductor/test_unbacked_symints.py

torch/_inductor/select_algorithm.py

ColinPeppler · 2025-09-24T20:54:55Z

Dumping some thoughts here, since this would be used for torch.compile GEMM autotuning too:

If we graph break, do the sub-graphs share a single SizeVarAllocator?
If we recompile, do both graphs share a single SizeVarAllocator?

Why do I care about SizeVarAllocator so much? We use it to cache unbacked substitutions when calling atomically_apply_size_hint.

pytorch/torch/_inductor/sizevars.py

Lines 750 to 769 in 1495b35

    
           @functools.lru_cache  # noqa: B019 
        
           def _sub_unbacked_exprs(self, expr: Expr) -> Expr: 
        
               # it's fine to cache this fn since self is a singleton 
        
               replacements = self._get_unbacked_replacements() 
        
               while True: 
        
                   new_expr = expr.subs(replacements) 
        
                   if new_expr == expr: 
        
                       return new_expr 
        
                   expr = sympy.factor(new_expr) 
        
           def atomically_apply_size_hint( 
        
               self, expr: Union[Expr, int], *, fallback: Optional[int] = None 
        
           ) -> Union[Expr, int]: 
        
               if isinstance(expr, (int, sympy.Integer)): 
        
                   return int(expr) 
        
               if has_free_unbacked_symbols(expr): 
        
                   # Make sure to substitute with the factored version 
        
                   # e.g. 10*(s0 + u0) instead of 10*s0 + 10*u0 
        
                   expr = self._sub_unbacked_exprs(sympy.factor(expr))

I believe the answer to both (1) and (2) is Yes.

AFAIK everytime we run Inductor, we need to create a brand new GraphLowering.

pytorch/torch/_inductor/compile_fx.py

Line 1391 in 1495b35

graph = GraphLowering(

When we create GraphLowering we also create SizeVarAllocator.

pytorch/torch/_inductor/graph.py

Line 375 in 1495b35

self.sizevars = SizeVarAllocator(shape_env)

ColinPeppler · 2025-09-26T16:16:36Z

torch/_inductor/select_algorithm.py

-            V.graph.sizevars.size_hints(
-                node.get_stride(),
-                fallback=config.unbacked_symint_fallback,
-                hint_override=hint_override,


Yeah, we should definitely keep hint_override here.

We should either make atomically_apply_size_hint available/an option in size_hints or the other way around.

cc: @bobrenjc93 @pianpwk @ezyang for any thoughts

meta-codesync · 2025-10-14T17:43:28Z

@sevenEng has imported this pull request. If you are a Meta employee, you can view this in D83066085.

sevenEng · 2025-10-14T18:16:32Z

@pytorchbot merge

pytorchmergebot · 2025-10-14T18:18:49Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…#163660) ## Issue During autotune, we're not applying size hints atomically for the example inputs used for benchmarking. If there is unbacked symint showing up in inputs' strides, this might lead to CUDA IMA, and this could be reproduced by the added unittest, with stride being `[128 * u0, 128, 1]` and unbacked fallback being 8192, after calling `benchmark_example_value`, we get back a tensor with stride as `[8192, 128, 1]` as opposed to `[128 * 8192, 128, 1]` ## Fix Using the atomic API when trying to apply size hints to input tensor' strides. Pull Request resolved: pytorch#163660 Approved by: https://github.com/ColinPeppler

pytorch-bot bot added ciflow/inductor module: inductor labels Sep 23, 2025

muchulee8 added the release notes: inductor (aoti) label Sep 24, 2025

ColinPeppler reviewed Sep 24, 2025

View reviewed changes

test/inductor/test_unbacked_symints.py Outdated Show resolved Hide resolved

ColinPeppler reviewed Sep 24, 2025

View reviewed changes

torch/_inductor/select_algorithm.py Outdated Show resolved Hide resolved

sevenEng force-pushed the q1l1/fix_autotune_unbacked_strides_example branch from 865aca9 to 5e0f4ac Compare September 25, 2025 18:36

ColinPeppler reviewed Sep 26, 2025

View reviewed changes

ezyang requested review from bobrenjc93 and laithsakka September 28, 2025 12:40

ColinPeppler requested a review from pianpwk October 9, 2025 16:44

sevenEng added 2 commits October 14, 2025 09:30

[inductor] fix issue for example value with unbacked strides

5aff2ea

[inductor] add optional hint override arg to

0d8f1f5

sevenEng force-pushed the q1l1/fix_autotune_unbacked_strides_example branch from 84b80e3 to 0d8f1f5 Compare October 14, 2025 16:46

ColinPeppler approved these changes Oct 14, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 14, 2025

pytorchmergebot added the merging label Oct 14, 2025

pytorchmergebot added the Merged label Oct 14, 2025

pytorchmergebot closed this in 3f83e89 Oct 14, 2025

pytorchmergebot removed the merging label Oct 14, 2025

sevenEng deleted the q1l1/fix_autotune_unbacked_strides_example branch October 14, 2025 20:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] fix issue for example value with unbacked strides#163660

[inductor] fix issue for example value with unbacked strides#163660
sevenEng wants to merge 2 commits intopytorch:mainfrom
sevenEng:q1l1/fix_autotune_unbacked_strides_example

sevenEng commented Sep 23, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 23, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 23, 2025

Uh oh!

Uh oh!

Uh oh!

ColinPeppler commented Sep 24, 2025

Uh oh!

ColinPeppler Sep 26, 2025

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

sevenEng commented Oct 14, 2025

Uh oh!

pytorchmergebot commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

sevenEng commented Sep 23, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Fix

Uh oh!

pytorch-bot bot commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163660

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

facebook-github-bot commented Sep 23, 2025

Uh oh!

Uh oh!

Uh oh!

ColinPeppler commented Sep 24, 2025

Uh oh!

ColinPeppler Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

sevenEng commented Oct 14, 2025

Uh oh!

pytorchmergebot commented Oct 14, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sevenEng commented Sep 23, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 23, 2025 •

edited

Loading