[PP] Add optional argument to not save outputs by H-Huang · Pull Request #165822 · pytorch/pytorch

H-Huang · 2025-10-18T04:55:11Z

Stack from ghstack (oldest at bottom):

-> [PP] Add optional argument to not save outputs #165822

Fix #159251

Add an optional argument return_outputs to the schedule step

cc @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @msaroufim @dcci

[ghstack-poisoned]

pytorch-bot · 2025-10-18T04:55:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165822

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

AWS was down, GHA infrastructure effected / recovering

✅ No Failures

As of commit c03169d with merge base fe80f03 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 22fed43 Pull-Request: #165822

[ghstack-poisoned]

ghstack-source-id: 8f6556b Pull-Request: #165822

[ghstack-poisoned]

ghstack-source-id: 14a9209 Pull-Request: #165822

[ghstack-poisoned]

ghstack-source-id: e5343a8 Pull-Request: #165822

wconstab · 2025-10-20T22:32:24Z

test/distributed/pipelining/test_schedule_multiproc.py

+        if self.rank == self.world_size - 1:
+            output = schedule.step(target=target, losses=losses, return_outputs=False)
+        else:
+            schedule.step(x)


technically output is None for this line too, right?

yep that is also None (every rank without the last stage returns None)

wconstab

looks good to me.
testing that we actually do not save stuff seems (instead of just that we do not return stuff) seems safer in a paranoid sense, if you can think of a way to do that.

H-Huang · 2025-10-21T00:01:49Z

testing that we actually do not save stuff seems (instead of just that we do not return stuff) seems safer in a paranoid sense, if you can think of a way to do that.

Yeah agree there, maybe there can be a way to verify the memory usage doesn't increase significantly when increasing # of microbatches. I will think about it

H-Huang · 2025-10-21T00:02:11Z

@pytorchbot merge

pytorchmergebot · 2025-10-21T00:03:56Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fix pytorch#159251 Add an optional argument `return_outputs` to the schedule `step` Pull Request resolved: pytorch#165822 Approved by: https://github.com/wconstab

Uses the API added in pytorch/pytorch#165822, since we do not return any output from PP step(). This allows us to release the memory earlier,

Update

70cb8e9

[ghstack-poisoned]

H-Huang added a commit that referenced this pull request Oct 18, 2025

[PP] Add optional argument to not save outputs

eada7fe

ghstack-source-id: 22fed43 Pull-Request: #165822

pytorch-bot bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Oct 18, 2025

H-Huang added the release notes: distributed (pipeline) release notes category label Oct 18, 2025

Update

1f39d7d

[ghstack-poisoned]

H-Huang added a commit that referenced this pull request Oct 18, 2025

[PP] Add optional argument to not save outputs

fd5f0a3

ghstack-source-id: 8f6556b Pull-Request: #165822

H-Huang added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 18, 2025

H-Huang requested review from fegin, kwen2501, wconstab and wwwjn October 18, 2025 15:09

Update

4d21346

[ghstack-poisoned]

H-Huang added a commit that referenced this pull request Oct 18, 2025

[PP] Add optional argument to not save outputs

317a63b

ghstack-source-id: 14a9209 Pull-Request: #165822

Update

c03169d

[ghstack-poisoned]

H-Huang added a commit that referenced this pull request Oct 18, 2025

[PP] Add optional argument to not save outputs

4b274fa

ghstack-source-id: e5343a8 Pull-Request: #165822

wconstab reviewed Oct 20, 2025

View reviewed changes

wconstab approved these changes Oct 20, 2025

View reviewed changes

pytorchmergebot added the merging label Oct 21, 2025

pytorchmergebot added the Merged label Oct 21, 2025

pytorchmergebot closed this in b20deec Oct 21, 2025

pytorchmergebot removed the merging label Oct 21, 2025

H-Huang mentioned this pull request Oct 21, 2025

Update PP to release memory earlier pytorch/torchtitan#1922

Merged

H-Huang added a commit to pytorch/torchtitan that referenced this pull request Oct 22, 2025

Update PP to release memory earlier (#1922)

e5ef99a

Uses the API added in pytorch/pytorch#165822, since we do not return any output from PP step(). This allows us to release the memory earlier,

tianyu-l mentioned this pull request Oct 28, 2025

Break the tests/integration_tests/run_tests.py UT pytorch/torchtitan#1950

Closed

github-actions bot deleted the gh/H-Huang/227/head branch November 20, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PP] Add optional argument to not save outputs#165822

[PP] Add optional argument to not save outputs#165822
H-Huang wants to merge 4 commits intogh/H-Huang/227/basefrom
gh/H-Huang/227/head

H-Huang commented Oct 18, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 18, 2025 •

edited

Loading

Uh oh!

wconstab Oct 20, 2025

Uh oh!

H-Huang Oct 20, 2025 •

edited

Loading

Uh oh!

wconstab left a comment

Uh oh!

H-Huang commented Oct 21, 2025

Uh oh!

H-Huang commented Oct 21, 2025

Uh oh!

pytorchmergebot commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

H-Huang commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165822

❗ 1 Active SEVs

✅ No Failures

Uh oh!

wconstab Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

H-Huang Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wconstab left a comment

Choose a reason for hiding this comment

Uh oh!

H-Huang commented Oct 21, 2025

Uh oh!

H-Huang commented Oct 21, 2025

Uh oh!

pytorchmergebot commented Oct 21, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

H-Huang commented Oct 18, 2025 •

edited

Loading

pytorch-bot bot commented Oct 18, 2025 •

edited

Loading

H-Huang Oct 20, 2025 •

edited

Loading