[ROCm][inductor] heuristic improvements for reduction kernels#161280
[ROCm][inductor] heuristic improvements for reduction kernels#161280naromero77amd wants to merge 8 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161280
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New Failure, 1 Unrelated FailureAs of commit bb52a1d with merge base d74f9ec ( NEW FAILURE - The following job has failed:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
To add the ciflow label This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows. |
|
To add the ciflow label This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows. |
d08dd22 to
9b6ed63
Compare
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Rebase failed due to Command Raised by https://github.com/pytorch/pytorch/actions/runs/17948121352 |
645f758 to
4eda531
Compare
3fd021b to
1eb723a
Compare
1eb723a to
bb52a1d
Compare
|
Resolved conflict and will try to merge. |
|
@pytorchbot merge |
|
@pytorchbot rebase |
Improvements to reduction kernel heuristics for MI350. Contributions from several members of the AMD Inductor and Triton teams: @jataylo @iupaikov-amd @AmdSampsa @xiaohuguo2023 **Duplicate of this PR:** #161280 (which has already been approved multiple times, but we are unable to merge due to some Meta Internal Check that cannot be cleared). Pull Request resolved: #170931 Approved by: https://github.com/jeffdaily
|
Duplicate PR here was landed: #170931 FWIW, I think the real issue might have been the pytorchbot's token not confirming to AMD security policy. |
Improvements to reduction kernel heuristics for MI350. Contributions from several members of the AMD Inductor and Triton teams: @jataylo @iupaikov-amd @AmdSampsa @xiaohuguo2023 **Duplicate of this PR:** #161280 (which has already been approved multiple times, but we are unable to merge due to some Meta Internal Check that cannot be cleared). Pull Request resolved: #170931 Approved by: https://github.com/jeffdaily
…h#170931) Improvements to reduction kernel heuristics for MI350. Contributions from several members of the AMD Inductor and Triton teams: @jataylo @iupaikov-amd @AmdSampsa @xiaohuguo2023 **Duplicate of this PR:** pytorch#161280 (which has already been approved multiple times, but we are unable to merge due to some Meta Internal Check that cannot be cleared). Pull Request resolved: pytorch#170931 Approved by: https://github.com/jeffdaily
…h#170931) Improvements to reduction kernel heuristics for MI350. Contributions from several members of the AMD Inductor and Triton teams: @jataylo @iupaikov-amd @AmdSampsa @xiaohuguo2023 **Duplicate of this PR:** pytorch#161280 (which has already been approved multiple times, but we are unable to merge due to some Meta Internal Check that cannot be cleared). Pull Request resolved: pytorch#170931 Approved by: https://github.com/jeffdaily (cherry picked from commit 5eceb87)
Improvements to reduction kernel heuristics for MI350.
Contributions from several members of the AMD Inductor and Triton teams: @jataylo @iupaikov-amd @AmdSampsa @xiaohuguo2023
cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @pragupta @jerrymannil @xinyazhang @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @mlazos @dllehr-amd @chenyang78