[PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths by banitag1 · Pull Request #164501 · pytorch/pytorch

banitag1 · 2025-10-02T20:42:19Z

Summary:
This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths).

Example:

PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048

This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called.

Differential Revision: D83779074

…ent to avoid slow paths Summary: This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths). Example: PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048 This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called. Differential Revision: D83779074

pytorch-bot · 2025-10-02T20:42:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164501

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 19eb2d8 with merge base 6b79701 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-10-02T20:42:37Z

@banitag1 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83779074.

banitag1 · 2025-10-02T20:45:16Z

@pytorchbot label "release notes: cuda"

facebook-github-bot · 2025-10-03T01:58:11Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-10-03T01:59:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-10-03T02:52:49Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-py3-arm64 / build

Details for Dev Infra team

Raised by workflow job

yangw-dev · 2025-10-03T16:34:23Z

@pytorchbot merge

pytorchmergebot · 2025-10-03T16:36:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ent to avoid slow paths (pytorch#164501) Summary: This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths). Example: PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048 This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called. Differential Revision: D83779074 Pull Request resolved: pytorch#164501 Approved by: https://github.com/yangw-dev

banitag1 requested review from Aidyn-A, eqy and syed-ahmed as code owners October 2, 2025 20:42

facebook-github-bot added fb-exported meta-exported labels Oct 2, 2025

yangw-dev self-requested a review October 2, 2025 20:43

yangw-dev approved these changes Oct 2, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 2, 2025

pytorch-bot bot added the release notes: cuda release notes category label Oct 2, 2025

pytorchmergebot added the merging label Oct 3, 2025

pytorchmergebot removed the merging label Oct 3, 2025

pytorchmergebot added the merging label Oct 3, 2025

pytorchmergebot added the Merged label Oct 3, 2025

pytorchmergebot closed this in f39789c Oct 3, 2025

pytorchmergebot removed the merging label Oct 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths#164501

[PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths#164501
banitag1 wants to merge 1 commit intopytorch:mainfrom
banitag1:export-D83779074

banitag1 commented Oct 2, 2025

Uh oh!

pytorch-bot bot commented Oct 2, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 2, 2025

Uh oh!

banitag1 commented Oct 2, 2025

Uh oh!

facebook-github-bot commented Oct 3, 2025

Uh oh!

pytorchmergebot commented Oct 3, 2025

Uh oh!

pytorchmergebot commented Oct 3, 2025

Uh oh!

yangw-dev commented Oct 3, 2025

Uh oh!

pytorchmergebot commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

banitag1 commented Oct 2, 2025

Uh oh!

pytorch-bot bot commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164501

✅ No Failures

Uh oh!

facebook-github-bot commented Oct 2, 2025

Uh oh!

banitag1 commented Oct 2, 2025

Uh oh!

facebook-github-bot commented Oct 3, 2025

Uh oh!

pytorchmergebot commented Oct 3, 2025

Merge started

Uh oh!

pytorchmergebot commented Oct 3, 2025

Merge failed

Uh oh!

yangw-dev commented Oct 3, 2025

Uh oh!

pytorchmergebot commented Oct 3, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Oct 2, 2025 •

edited

Loading