Performance improvement for sgemm column major TN (transposeA = T, transposeB = N) case by TimmyLiu · Pull Request #54 · clMathLibraries/clBLAS

TimmyLiu · 2014-10-30T22:30:52Z

Even with the use of tuning tool, the current sgemm TN still pose a poor performance comparing to sgemm NN, sgemm NT and sgemm TT. This pull request propose a wrapper from sgemm TN to sgemm NN by doing the transposition of A in a separate kernel, so that the sgemm TN can benefit from the performance of sgemm NN.

Note that since a out-of-place transposition was implemented, an extra opencl buffer was created within this wrapper. This might be a issue for really big matrix sizes.

To enable this wrapper, one would need to set env CLBLAS_FAST_SGEMM_TN=1. The code was only tested on "Spectre", "Tahiti" and "Hawaii" devices. Thus, at the moment, if the environment variable was not set or if the hardware device is anything other than "Spectre", "Tahiti" and "Hawaii", the "old" kernel without transposition will be called.

…N kernel by doing transpose separately

Performance improvement for sgemm column major TN (transposeA = T, transposeB = N) case

enable sgemm column major TN case to take advantage of faster sgemm N…

37aeff5

…N kernel by doing transpose separately

TimmyLiu pushed a commit that referenced this pull request Nov 6, 2014

Merge pull request #54 from TimmyLiu/develop

045ec55

Performance improvement for sgemm column major TN (transposeA = T, transposeB = N) case

TimmyLiu merged commit 045ec55 into clMathLibraries:develop Nov 6, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvement for sgemm column major TN (transposeA = T, transposeB = N) case #54

Performance improvement for sgemm column major TN (transposeA = T, transposeB = N) case #54
TimmyLiu merged 1 commit intoclMathLibraries:developfrom
TimmyLiu:develop

TimmyLiu commented Oct 30, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TimmyLiu commented Oct 30, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant