-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLBlast support #32
CLBlast support #32
Conversation
This reverts commit a67b5a4.
/cc @CNugteren |
@psyhtest |
@psyhtest |
@naibaf7 Thanks! You may notice that in blocks dispatching calls into CLBlast I use different formatting and explicitly define some constants (e.g. Another thing is that even similar code blocks use different styles e.g.
|
@naibaf7 How this is different from autotuining code that you are writing. |
@bhack |
@naibaf7
I've implemented support for Cedric Nugteren's CLBlast library. The 0.6.0 version had a few issues but the most recent 0.7.0 version seems to have addressed them. In addition, 0.7.0 added support for xASUM which helped to keep integration clean.
I've tested this integration on the Samsung Chromebook 2 with the ARM Mali-T628 GPU and version v6.0 of the driver, skipping the known test failures (#28, #29, #30) that are currently opened for that platform.
Please review.