[cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks by eqy · Pull Request #161749 · pytorch/pytorch

eqy · 2025-08-28T23:49:08Z

Since CUDA 11.x (need to update the docs for this, current PR is saying 12.2 which is incorrect) we've been allocating cuBLAS workspaces explicitly per handle/stream combination #85447

According to the cuBLAS documentation, this appears to be sufficient for determinism without any explicit workspace requirements to e.g., :4096:8 or :16:8 as was previously expressed in PyTorch docs https://docs.nvidia.com/cuda/cublas/#results-reproducibility

Planning to add an explicit determinism test as well...

cc @ptrblck @msaroufim @jerryzh168 @csarofeen @xwang233 @mruberry @kurtamohler

pytorch-bot · 2025-08-28T23:49:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161749

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 896198b with merge base ac7b4e7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ngimel · 2025-09-04T18:37:46Z

docs/source/notes/randomness.rst

             [ 0.0333, -1.1444]]], device='cuda:0')

-Furthermore, if you are using CUDA tensors, and your CUDA version is 10.2 or greater, you
+Furthermore, if you are using CUDA tensors, and your CUDA version is between 10.2 and 11.0 you


no longer supported, just remove this sentence

ngimel · 2025-09-10T21:54:50Z

Can this one be landed?

eqy · 2025-09-10T22:46:46Z

Sure, let's see CI signal after removing the old determinisic alert test

ngimel · 2025-09-11T18:55:09Z

Now there's a dtype error in _scaled_mm that shouldn't be related?

eqy · 2025-09-15T16:51:14Z

H100 _scaled_mm failure should be addressed by #162022
I think we're seeing it because I manually opted into ciflow/H100 here

ngimel · 2025-09-18T14:08:55Z

ciflow/H100 is still run on trunk (see on HUD), if it doesn't report existing failures that's a problem (and looks like it doesn't).

eqy · 2025-10-02T20:26:18Z

@pytorchmergebot merge

pytorchmergebot · 2025-10-02T20:28:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

… checks (pytorch#161749) Since CUDA 11.x (need to update the docs for this, current PR is saying 12.2 which is incorrect) we've been allocating cuBLAS workspaces explicitly per handle/stream combination pytorch#85447 According to the cuBLAS documentation, this appears to be sufficient for determinism without any explicit workspace requirements to e.g., `:4096:8` or `:16:8` as was previously expressed in PyTorch docs https://docs.nvidia.com/cuda/cublas/#results-reproducibility Planning to add an explicit determinism test as well... Pull Request resolved: pytorch#161749 Approved by: https://github.com/ngimel

eqy requested a review from syed-ahmed as a code owner August 28, 2025 23:49

eqy added module: cuda Related to torch.cuda, and CUDA support in general module: cublas Problem related to cublas support module: determinism open source release notes: cuda release notes category ciflow/trunk Trigger trunk jobs on your pull request ciflow/h100 labels Aug 28, 2025

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Sep 3, 2025

zou3519 requested a review from ngimel September 3, 2025 16:31

ngimel approved these changes Sep 4, 2025

View reviewed changes

ngimel mentioned this pull request Sep 4, 2025

[AsyncTP] Fixes AsyncMM #162040

Closed

eqy added 4 commits October 2, 2025 16:58

check in

e2e70df

update

a83ced5

Update randomness.rst

3e384ac

delete

896198b

eqy force-pushed the cublasnowdeterministic branch from 3f484ec to 896198b Compare October 2, 2025 18:42

eqy requested a review from Aidyn-A as a code owner October 2, 2025 18:42

pytorchmergebot added the merging label Oct 2, 2025

pytorchmergebot added the Merged label Oct 3, 2025

pytorchmergebot closed this in f7082e9 Oct 3, 2025

pytorchmergebot removed the merging label Oct 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks#161749

[cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks#161749
eqy wants to merge 4 commits intopytorch:mainfrom
eqy:cublasnowdeterministic

eqy commented Aug 28, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 28, 2025 •

edited

Loading

Uh oh!

ngimel Sep 4, 2025

Uh oh!

ngimel commented Sep 10, 2025

Uh oh!

eqy commented Sep 10, 2025

Uh oh!

ngimel commented Sep 11, 2025

Uh oh!

eqy commented Sep 15, 2025

Uh oh!

ngimel commented Sep 18, 2025

Uh oh!

eqy commented Oct 2, 2025

Uh oh!

pytorchmergebot commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

eqy commented Aug 28, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161749

✅ No Failures

Uh oh!

ngimel Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

ngimel commented Sep 10, 2025

Uh oh!

eqy commented Sep 10, 2025

Uh oh!

ngimel commented Sep 11, 2025

Uh oh!

eqy commented Sep 15, 2025

Uh oh!

ngimel commented Sep 18, 2025

Uh oh!

eqy commented Oct 2, 2025

Uh oh!

pytorchmergebot commented Oct 2, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eqy commented Aug 28, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 28, 2025 •

edited

Loading