[fx] Add strict argument validation to Interpreter.boxed_run by meghendra6 · Pull Request #166784 · pytorch/pytorch

meghendra6 · 2025-11-01T04:24:08Z

Summary

This PR fixes an issue where torch.fx.Interpreter.boxed_run would silently ignore extra input arguments instead of validating the argument count.

Previously, boxed_run would only consume as many inputs as there were placeholder nodes and then clear the entire args_list, hiding potential bugs. This change introduces a strict check to ensure len(args_list) matches the number of placeholder nodes, raising a RuntimeError on a mismatch.

Fixes #166583.

Changes

Validate len(args_list) against the number of placeholder nodes at the beginning of boxed_run.
Raise a RuntimeError with a clear message ("extra arguments" or "missing arguments") if the counts do not match.
Move args_list.clear() to only execute after successful validation and environment setup. If an error is raised, args_list is preserved for debugging.

Testing

Added test_interpreter_boxed_run_argument_validation to test/test_fx.py.
This test covers three scenarios:
1. Correct number of arguments (succeeds, args_list is cleared).
2. Extra arguments (raises RuntimeError, args_list is preserved).
3. Missing arguments (raises RuntimeError, args_list is preserved).

User-facing impact / BC notes

This is a bug fix. Code that was incorrectly passing the wrong number of arguments to boxed_run will now fail fast with a RuntimeError instead of executing silently with unintended inputs. Correctly written code is unaffected.

cc @ezyang @EikanWang @jgong5 @wenzhe-nrv @chauhang @penguinwu @xmfan

pytorch-bot · 2025-11-01T04:24:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166784

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f157e8b with merge base 7b64ad9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2025-11-01T04:24:14Z

The committers listed above are authorized under a signed CLA.

✅ login: meghendra6 / name: Scott Lee (5330413, 6ae3135, 889ba78, f157e8b)

xmfan · 2025-11-01T18:56:37Z

torch/fx/interpreter.py

+        placeholder_nodes = [n for n in self.graph.nodes if n.op == "placeholder"]
+        expected_args = len(placeholder_nodes)
+        actual_args = len(args_list)
+        if actual_args != expected_args:
+            detail = (
+                "extra arguments"
+                if actual_args > expected_args
+                else "missing arguments"
+            )
+            raise RuntimeError(
+                f"Interpreter.boxed_run expected {expected_args} arguments "
+                f"for placeholders but received {actual_args} ({detail})."
+            )
+
+        env = {n: arg for n, arg in zip(placeholder_nodes, args_list)}


This seems like a lot of changes, could we just add an assertion on line 228?

args_iter = iter(args_list) env = {} for n in self.graph.nodes: if n.op == "placeholder": env[n] = next(args_iter) + assert len(args_list) == len(env), "<error message>"

Thanks for the suggestion! I went ahead and simplified the code following your idea. The commit (4353144
) keeps the original loop and adds a simple check for the arg count. Appreciate the quick feedback!

ezyang

Yeah this seems overly complicated

ezyang

I think I would prefer simplifying this more by collecting the list of nodes to assign first, and then doing the length check, and then assigning. This eliminates the StopIteration logic.

meghendra6 · 2025-11-02T23:12:26Z

I think I would prefer simplifying this more by collecting the list of nodes to assign first, and then doing the length check, and then assigning. This eliminates the StopIteration logic.

Good point, that makes sense. I’ve updated the code in 81c6beb
to collect the placeholder nodes first and then do the length check before assigning. Thanks for the helpful suggestion!

ezyang · 2025-11-03T03:46:36Z

@pytorchbot merge

pytorchmergebot · 2025-11-03T03:48:32Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-11-03T04:09:55Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-jammy-py3.10-clang12 / test (dynamo_wrapped, 2, 3, lf.linux.2xlarge)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

meghendra6 · 2025-11-04T03:11:33Z

@ezyang I've checked the CI failures, and they appear to be unrelated to the changes in this PR.

The main issue seems to be in test_external_module_register_with_existing_backend, which is failing with:
torch._dynamo.exc.InternalTorchDynamoError: RuntimeError: Device 'maia' does not have a corresponding module registered as 'torch.maia'.

This looks like an infrastructure or environment issue, not caused by my code.

Could someone advise on how to proceed? Happy to rebase or try a re-run.

xmfan · 2025-11-04T03:52:43Z

dynamo_wrapped contain the most important tests for this PR, could we rebase on viable/strict to let CI try again?

ezyang · 2025-11-04T04:19:19Z

@pytorchbot merge -r

pytorchmergebot · 2025-11-04T04:21:00Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

Fixes issue where extra arguments were silently ignored, leading to graphs running with incorrect inputs. Now raises clear RuntimeError for both extra and missing arguments. Also improves error messages for better debugging experience.

pytorchmergebot · 2025-11-04T04:21:03Z

Successfully rebased fx-interpreter-boxed-run-strict-args onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout fx-interpreter-boxed-run-strict-args && git pull --rebase)

pytorchmergebot · 2025-11-04T04:22:25Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-11-04T04:22:34Z

Merge failed

Reason: 3 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

meghendra6 · 2025-11-04T13:53:50Z

It looks like all checks are green, but the PR still shows that the merge failed. I'm a bit confused about what's causing the issue.

Any chance you could see what's going on?

xmfan · 2025-11-05T04:10:16Z

@pytorchbot merge

pytorchmergebot · 2025-11-05T04:12:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added the release notes: fx release notes category label Nov 1, 2025

facebook-github-bot added the fx label Nov 1, 2025

pytorchbot added the open source label Nov 1, 2025

xmfan reviewed Nov 1, 2025

View reviewed changes

ezyang requested changes Nov 1, 2025

View reviewed changes

ezyang requested changes Nov 2, 2025

View reviewed changes

ezyang approved these changes Nov 3, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 3, 2025

pytorchmergebot added the merging label Nov 3, 2025

pytorchmergebot removed the merging label Nov 3, 2025

pytorch-bot bot removed the ciflow/trunk Trigger trunk jobs on your pull request label Nov 3, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 4, 2025

meghendra6 added 4 commits November 4, 2025 04:21

[fx] Simplify boxed_run validation to use original loop structure

889ba78

[fx] Simplify validation by collecting placeholders first

5330413

[fx] Fix line length formatting in boxed_run

f157e8b

pytorchmergebot force-pushed the fx-interpreter-boxed-run-strict-args branch from 2ff84f6 to f157e8b Compare November 4, 2025 04:21

pytorch-bot bot removed the ciflow/trunk Trigger trunk jobs on your pull request label Nov 4, 2025

pytorchmergebot added the merging label Nov 4, 2025

pytorchmergebot removed the merging label Nov 4, 2025

xmfan approved these changes Nov 5, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 5, 2025

pytorchmergebot added the merging label Nov 5, 2025

pytorchmergebot added the Merged label Nov 5, 2025

pytorchmergebot closed this in 59a6c83 Nov 5, 2025

pytorchmergebot removed the merging label Nov 5, 2025

Conversation

meghendra6 commented Nov 1, 2025

Summary

Changes

Testing

User-facing impact / BC notes

Uh oh!

pytorch-bot bot commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166784

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xmfan Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

meghendra6 Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

meghendra6 commented Nov 2, 2025

Uh oh!

ezyang commented Nov 3, 2025

Uh oh!

pytorchmergebot commented Nov 3, 2025

Merge started

Uh oh!

pytorchmergebot commented Nov 3, 2025

Merge failed

Uh oh!

meghendra6 commented Nov 4, 2025

Uh oh!

xmfan commented Nov 4, 2025

Uh oh!

ezyang commented Nov 4, 2025

Uh oh!

pytorchmergebot commented Nov 4, 2025

Uh oh!

pytorchmergebot commented Nov 4, 2025

Uh oh!

pytorchmergebot commented Nov 4, 2025

Merge started

Uh oh!

pytorchmergebot commented Nov 4, 2025

Merge failed

Uh oh!

meghendra6 commented Nov 4, 2025

Uh oh!

xmfan commented Nov 5, 2025

Uh oh!

pytorchmergebot commented Nov 5, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pytorch-bot bot commented Nov 1, 2025 •

edited

Loading

linux-foundation-easycla bot commented Nov 1, 2025 •

edited

Loading