[ROCm] OffsetCalc Unroll Optimization#161700
[ROCm] OffsetCalc Unroll Optimization#161700amd-hhashemi wants to merge 3 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/161700
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (1 Unrelated Failure)As of commit a190954 with merge base 5babb4d ( UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
| offsets[arg] = divmod.mod * strides_[0][arg]; | ||
| if (dims >= 2) { | ||
| divmod = sizes_[1].divmod(divmod.div); | ||
| #pragma unroll |
There was a problem hiding this comment.
Do you want to add a comment to link to the compiler issue?
There was a problem hiding this comment.
We're investigating what is exactly happening in compiler here. Will provide an issue number later when we've rootcaused.
cherry-pick of pytorch#161700 Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf.
cherry-pick of pytorch#161700 Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Fixes SWDEV-545713, SWDEV-545710
cherry-pick of pytorch#161700 Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Fixes SWDEV-545713, SWDEV-545710
cherry-pick of pytorch#161700 Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Fixes SWDEV-545713, SWDEV-545710
|
@pytorchbot rebase |
1 similar comment
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
842d540 to
0f1927e
Compare
|
Successfully rebased |
0f1927e to
a190954
Compare
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Pull Request resolved: pytorch#161700 Approved by: https://github.com/jeffdaily
Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Pull Request resolved: pytorch#161700 Approved by: https://github.com/jeffdaily
Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Pull Request resolved: pytorch#161700 Approved by: https://github.com/jeffdaily
Our compiler is generating inefficient code for the offsetCalc in certain situations. The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf. Pull Request resolved: pytorch#161700 Approved by: https://github.com/jeffdaily
Our compiler is generating inefficient code for the offsetCalc in certain situations.
The root-cause for this needs to be identified. For now specialized unrolling based on 'dims' notably helps perf.
cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd