Skip to content

Conversation

@umar456
Copy link
Member

@umar456 umar456 commented Feb 21, 2020

Fixes a couple of issues with the CPU OpenCL blas functions. The batching functionality was not working correctly for righ hand side batches. The matmul function was throwing errors in case the same input was passed in. This was caused only on the Intel GPU OpenCL implementation because mapping the same buffer as a READ and WRITE mapped pointer caused an error. These issues have been fixed and the getMappedPtr is refactored to return unique_ptr instead of shared_ptr.

Fixes #1711

@umar456 umar456 force-pushed the opencl_cpu_batch_matmul_fix branch 3 times, most recently from d94382d to a4eb27a Compare February 21, 2020 08:13
The Intel OpenCL mapping the same buffer for write access caused an
error. This caused the matmul operation to fail when the same
array was passed in. To fix this only the READ flag is passed into
the map function instead of the READ and WRITE flags
The new getMappedPtr is now (correctly) returning a unique_ptr.
This commit removes implicit conversions from unique_ptr to
shared ptr.
@umar456 umar456 force-pushed the opencl_cpu_batch_matmul_fix branch from a4eb27a to 2356ec9 Compare February 21, 2020 08:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

matmulTN fails when the two parameters are the same array (Intel/OpenCL)

2 participants