Skip to content

[ROCm][CI] Add docker caching for MI250 runners#169300

Closed
jithunnair-amd wants to merge 2 commits intomainfrom
jithunnair-amd-patch-9
Closed

[ROCm][CI] Add docker caching for MI250 runners#169300
jithunnair-amd wants to merge 2 commits intomainfrom
jithunnair-amd-patch-9

Conversation

@jithunnair-amd
Copy link
Collaborator

@jithunnair-amd jithunnair-amd commented Dec 1, 2025

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 1, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/169300

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c5c6145 with merge base 481e5ab (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added ciflow/rocm Trigger "default" config CI on ROCm module: rocm AMD GPU support for Pytorch topic: not user facing topic category labels Dec 1, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR extends ROCm CI infrastructure to enable docker image caching on MI250 GPU runners, matching the existing functionality available for gfx942 runners. The changes add the new linux.rocm.mi250.docker-cache runner to both the workflow matrix and the actionlint configuration file.

Key Changes

  • Added MI250 docker-cache runner to the workflow matrix for parallel caching across both MI250 and gfx942 hardware
  • Registered the new runner label in actionlint configuration to prevent linting errors

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
.github/workflows/docker-cache-rocm.yml Added linux.rocm.mi250.docker-cache to the runner matrix alongside the existing gfx942 docker-cache runner
.github/actionlint.yaml Registered linux.rocm.mi250.docker-cache in the self-hosted runner labels list under the MI2xx runners section

@jithunnair-amd
Copy link
Collaborator Author

@pytorchbot merge -f "Tested via manual trigger of caching workflow. Doesn't impact any other workflows"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

JacobSzwejbka pushed a commit that referenced this pull request Dec 8, 2025
Enables docker image caching for MI250 CI runners.

Tested via: https://github.com/pytorch/pytorch/actions/runs/19841041285

Pull Request resolved: #169300
Approved by: https://github.com/jeffdaily
@jithunnair-amd jithunnair-amd deleted the jithunnair-amd-patch-9 branch December 18, 2025 20:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rocm Trigger "default" config CI on ROCm Merged module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants