[BugFix] chunk_size should always be int64_t by lingebeng · Pull Request #165971 · pytorch/pytorch

lingebeng · 2025-10-21T06:39:50Z

aspired by #156872

pytorch-bot · 2025-10-21T06:39:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165971

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4d17f59 with merge base 03f3f78 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cyyever · 2025-10-21T07:09:00Z

While it is too expensive to create large dense tensors, is it possible to create large sparse tensors to test the ops?

lingebeng · 2025-10-21T08:56:00Z

Of course!

lingebeng · 2025-10-21T10:17:34Z

import torch
from torch.optim import Adagrad


def test_torch_adagrad():
    num_params = 27000008
    param_size = 192
    param = torch.randn(num_params, param_size, device="cuda", dtype=torch.float32, requires_grad=True)
    grad = torch.randn_like(param) * 0.01
    param.grad = grad
    optimizer = Adagrad([param], lr=0.01)
    optimizer.step()
    torch.cuda.synchronize()


if __name__ == "__main__":
    test_torch_adagrad()

I am so sorry. I cannot run the code,could you run it? @cyyever

torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 19.31 GiB. GPU 0 has a total capacity of 31.74 GiB of which 12.12 GiB is free. Including non-PyTorch memory, this process has 19.61 GiB memory in use. Of the allocated memory 19.31 GiB is allocated by PyTorch, and 0 bytes is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.  See documentation for Memory Management  (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

cyyever · 2025-10-21T11:08:04Z

@lingebeng

import torch
from torch.optim import Adagrad


def test_torch_adagrad():
    num_params = 27000008
    param_size = 150
    param = torch.randn(num_params, param_size, device="cuda:1", dtype=torch.bfloat16, requires_grad=True)
    grad = torch.randn_like(param) * 0.01
    param.grad = grad
    optimizer = Adagrad([param], lr=0.01)
    optimizer.step()
    torch.cuda.synchronize()


if __name__ == "__main__":
    test_torch_adagrad()

This one allocates less than 40GB, but no error raises.

lingebeng · 2025-10-21T11:14:38Z

Thanks,I see. Maybe it's not a bug!

cyyever · 2025-10-21T11:39:31Z

@lingebeng We can have further chats via e-mail.

lingebeng · 2025-10-21T12:41:15Z

OK,I have contacted you!

albanD

@cyyever my layman understanding is that int is int64_t on linux/mac but int32_t on windows. So I would only expect to see this fail on windows.

Generally, we do want to make these type explicit to avoid windows-only issues.

Skylion007 · 2025-10-21T17:35:54Z

@pytorchbot merge

pytorchmergebot · 2025-10-21T17:38:15Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

cyyever · 2025-10-22T00:00:27Z

@albanD Yes, MSVC still recognises int as int32_t, likely for Win32 compatibility.

aspired by pytorch#156872 Pull Request resolved: pytorch#165971 Approved by: https://github.com/albanD

[BugFix] chunk_size should always be int64_t for Foreach functors

1a9a74e

lingebeng requested review from Aidyn-A, eqy and syed-ahmed as code owners October 21, 2025 06:39

pytorch-bot bot added the release notes: cuda release notes category label Oct 21, 2025

Del newline at end of fused_adagrad_utils.cuh

1a6b838

pytorchbot added the open source label Oct 21, 2025

lingebeng added 2 commits October 21, 2025 14:49

update

a298fd5

update

4d17f59

lingebeng changed the title ~~[BugFix] chunk_size should always be int64_t for Foreach functors~~ [BugFix] chunk_size should always be int64_t Oct 21, 2025

cyyever requested a review from albanD October 21, 2025 16:05

albanD approved these changes Oct 21, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 21, 2025

pytorchmergebot added the merging label Oct 21, 2025

pytorchmergebot added the Merged label Oct 21, 2025

pytorchmergebot closed this in 1c7fe8f Oct 21, 2025

pytorchmergebot removed the merging label Oct 21, 2025

zhudada0120 pushed a commit to zhudada0120/pytorch that referenced this pull request Oct 22, 2025

[BugFix] chunk_size should always be int64_t (pytorch#165971)

3b0eaa7

aspired by pytorch#156872 Pull Request resolved: pytorch#165971 Approved by: https://github.com/albanD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] chunk_size should always be int64_t#165971

[BugFix] chunk_size should always be int64_t#165971
lingebeng wants to merge 4 commits intopytorch:mainfrom
lingebeng:linhaifeng/bug_fix/Integer-overflow

lingebeng commented Oct 21, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 21, 2025 •

edited

Loading

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

albanD left a comment

Uh oh!

Skylion007 commented Oct 21, 2025

Uh oh!

pytorchmergebot commented Oct 21, 2025

Uh oh!

cyyever commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

lingebeng commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165971

✅ No Failures

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

cyyever commented Oct 21, 2025

Uh oh!

lingebeng commented Oct 21, 2025

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Skylion007 commented Oct 21, 2025

Uh oh!

pytorchmergebot commented Oct 21, 2025

Merge started

Uh oh!

cyyever commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

lingebeng commented Oct 21, 2025 •

edited

Loading

pytorch-bot bot commented Oct 21, 2025 •

edited

Loading