Skip to content

[ROCm][CI] Add distributed testing back to trunk.yml#166915

Closed
jithunnair-amd wants to merge 1 commit intomainfrom
jithunnair-amd-patch-1
Closed

[ROCm][CI] Add distributed testing back to trunk.yml#166915
jithunnair-amd wants to merge 1 commit intomainfrom
jithunnair-amd-patch-1

Conversation

@jithunnair-amd
Copy link
Collaborator

@jithunnair-amd jithunnair-amd commented Nov 4, 2025

Adding distributed testing back to trunk since we have been observing reasonable queueing based on current MI3xx capacity.

Partially addresses #166108.

cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd

@pytorch-bot
Copy link

pytorch-bot bot commented Nov 4, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166915

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

⏳ No Failures, 6 Pending

As of commit 95e2fd0 with merge base 79ff2c6 (image):
💚 Looks good so far! There are no failures yet. 💚

UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added ciflow/rocm Trigger "default" config CI on ROCm module: rocm AMD GPU support for Pytorch topic: not user facing topic category labels Nov 4, 2025
@jithunnair-amd jithunnair-amd added ciflow/trunk Trigger trunk jobs on your pull request and removed ciflow/rocm Trigger "default" config CI on ROCm labels Nov 4, 2025
@jithunnair-amd
Copy link
Collaborator Author

Successfully launched a 4-GPU MI3xx runner: https://github.com/pytorch/pytorch/actions/runs/19053940277/job/54421571731?pr=166915

@jithunnair-amd jithunnair-amd marked this pull request as ready for review November 4, 2025 01:47
@jithunnair-amd jithunnair-amd requested a review from a team as a code owner November 4, 2025 01:47
@pytorch-bot pytorch-bot bot added the ciflow/rocm Trigger "default" config CI on ROCm label Nov 4, 2025
@jeffdaily
Copy link
Collaborator

@pytorchbot merge -f "lint good, added ROCm CI dist shard is running"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request Merged module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants