Skip to content

Conversation

@naromero77amd
Copy link
Collaborator

@naromero77amd naromero77amd commented Oct 30, 2024

This PR increases hipBLASLt default workspace size to 76 MB which is the recommended default. This PR does not contain any bug fixes.

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang

@pytorch-bot
Copy link

pytorch-bot bot commented Oct 30, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139300

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8f5e23e with merge base ad637a4 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@naromero77amd naromero77amd added the ciflow/rocm Trigger "default" config CI on ROCm label Oct 30, 2024
@naromero77amd naromero77amd changed the title Increase hipBLASLt default workspace size [ROCM] Increase hipBLASLt default workspace size Oct 30, 2024
@pytorch-bot pytorch-bot bot added the module: rocm AMD GPU support for Pytorch label Oct 30, 2024
@naromero77amd naromero77amd added the topic: not user facing topic category label Oct 30, 2024
@naromero77amd naromero77amd force-pushed the increase_hipblaslt_default_workspace_size branch from 74cb4ce to 8f5e23e Compare October 30, 2024 19:26
@jeffdaily jeffdaily changed the title [ROCM] Increase hipBLASLt default workspace size [ROCm] Increase hipBLASLt default workspace size Oct 30, 2024
Copy link
Collaborator

@eqy eqy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving but note that historically this setting is sticky as it would break e.g., Meta internal workloads that are on the edge of memory usage.

@jeffdaily
Copy link
Collaborator

Approving but note that historically this setting is sticky as it would break e.g., Meta internal workloads that are on the edge of memory usage.

Going from 32MB to 76MB for the tunableop case is too much? For the non-tunable path, the 8MB default was too small for adequate performance. Should have also been at least 32MB. 76MB is necessary for highest-performing algorithms.

@eqy
Copy link
Collaborator

eqy commented Oct 30, 2024

We've had the 31MiB increase (1 -> 32 MiB) get reverted on the cuBLASLt side a few times, see e.g., #120925

@naromero77amd
Copy link
Collaborator Author

We've had the 31MiB increase (1 -> 32 MiB) get reverted on the cuBLASLt side a few times, see e.g., #120925

Is there some Meta workload that can be run before this is merged in? I was planning on merging in once the CI checks out.

@naromero77amd
Copy link
Collaborator Author

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 30, 2024
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
This PR increases hipBLASLt default workspace size to 76 MB which is the recommended default. This PR does not contain any bug fixes.

Pull Request resolved: pytorch#139300
Approved by: https://github.com/jeffdaily, https://github.com/eqy
@naromero77amd naromero77amd deleted the increase_hipblaslt_default_workspace_size branch October 29, 2025 22:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request Merged module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants