[Inductor] Make combo kernel MAX_NUM_ARGS configurable by andyanwang · Pull Request #166274 · pytorch/pytorch

andyanwang · 2025-10-26T22:08:19Z

The MAX_NUM_ARGS of ComboKernel is currently a fixed number. We need to tune this number to avoid large fusion for MTIA, thus making it configurable.

Stack from ghstack (oldest at bottom):

Differential Revision: D85509352

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

Differential Revision: [D85509352](https://our.internmc.facebook.com/intern/diff/D85509352/) [ghstack-poisoned]

pytorch-bot · 2025-10-26T22:08:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166274

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8e29b50 with merge base c7eee49 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-10-27T17:32:12Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-10-27T17:39:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…uts (#166275) MTIA triton currently has a limit that it can't support the cases when there are too many input/output buffers. This PR adds the limitation to prevent large fusion with many input/output buffer. Differential Revision: [D85509351](https://our.internmc.facebook.com/intern/diff/D85509351/) Pull Request resolved: #166275 Approved by: https://github.com/eellison ghstack dependencies: #166274

The MAX_NUM_ARGS of ComboKernel is currently a fixed number. We need to tune this number to avoid large fusion for MTIA, thus making it configurable. Differential Revision: [D85509352](https://our.internmc.facebook.com/intern/diff/D85509352/) Pull Request resolved: #166274 Approved by: https://github.com/eellison

…uts (#166275) MTIA triton currently has a limit that it can't support the cases when there are too many input/output buffers. This PR adds the limitation to prevent large fusion with many input/output buffer. Differential Revision: [D85509351](https://our.internmc.facebook.com/intern/diff/D85509351/) Pull Request resolved: #166275 Approved by: https://github.com/eellison ghstack dependencies: #166274

The MAX_NUM_ARGS of ComboKernel is currently a fixed number. We need to tune this number to avoid large fusion for MTIA, thus making it configurable. Pull Request resolved: pytorch/pytorch#166274 ghstack-source-id: 318804069 @exported-using-ghexport Differential Revision: [D85509352](https://our.internmc.facebook.com/intern/diff/D85509352/)

[Inductor] Make combo kernel MAX_NUM_ARGS configurable

8e29b50

Differential Revision: [D85509352](https://our.internmc.facebook.com/intern/diff/D85509352/) [ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Oct 26, 2025

This was referenced Oct 26, 2025

[Inductor] Prevent kernel fusion with too many unique inputs and outputs #166275

Closed

[Inductor] Prevent large fusion for aten::cat #166276

Open

meta-codesync bot added fb-exported meta-exported labels Oct 26, 2025

andyanwang added the topic: not user facing topic category label Oct 26, 2025

andyanwang requested a review from eellison October 26, 2025 22:21

eellison approved these changes Oct 27, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 27, 2025

pytorchmergebot added the merging label Oct 27, 2025

pytorchmergebot closed this in eb2bad5 Oct 27, 2025

pytorchmergebot added Merged and removed merging labels Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] Make combo kernel MAX_NUM_ARGS configurable#166274

[Inductor] Make combo kernel MAX_NUM_ARGS configurable#166274
andyanwang wants to merge 1 commit intogh/andyanwang/40/basefrom
gh/andyanwang/40/head

andyanwang commented Oct 26, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 26, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 27, 2025

Uh oh!

pytorchmergebot commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

andyanwang commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166274

✅ No Failures

Uh oh!

facebook-github-bot commented Oct 27, 2025

Uh oh!

pytorchmergebot commented Oct 27, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

andyanwang commented Oct 26, 2025 •

edited

Loading

pytorch-bot bot commented Oct 26, 2025 •

edited

Loading