[ROCm][CI] Add distributed testing back to trunk.yml#166915
[ROCm][CI] Add distributed testing back to trunk.yml#166915jithunnair-amd wants to merge 1 commit intomainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166915
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ⏳ No Failures, 6 PendingAs of commit 95e2fd0 with merge base 79ff2c6 ( UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Successfully launched a 4-GPU MI3xx runner: https://github.com/pytorch/pytorch/actions/runs/19053940277/job/54421571731?pr=166915 |
|
@pytorchbot merge -f "lint good, added ROCm CI dist shard is running" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Adding distributed testing back to trunk since we have been observing [reasonable queueing](https://hud.pytorch.org/queue_time_analysis?dateRange=30&startDate=2025-10-05T01%3A44%3A55.924Z&endDate=2025-11-04T01%3A44%3A55.925Z&granularity=week&chartType=bar&repos=pytorch%2Fpytorch&category=machine_type&machineTypes=linux.rocm.gpu.gfx942.1&items=linux.rocm.gpu.gfx942.1) based on current MI3xx capacity. Partially addresses #166108. Pull Request resolved: #166915 Approved by: https://github.com/jeffdaily
Adding distributed testing back to trunk since we have been observing reasonable queueing based on current MI3xx capacity.
Partially addresses #166108.
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd