CLBlast support #32

psyhtest · 2016-05-12T13:03:58Z

I've implemented support for Cedric Nugteren's CLBlast library. The 0.6.0 version had a few issues but the most recent 0.7.0 version seems to have addressed them. In addition, 0.7.0 added support for xASUM which helped to keep integration clean.

I've tested this integration on the Samsung Chromebook 2 with the ARM Mali-T628 GPU and version v6.0 of the driver, skipping the known test failures (#28, #29, #30) that are currently opened for that platform.

Please review.

…ast.support

This reverts commit a67b5a4.

bhack · 2016-05-12T13:12:34Z

/cc @CNugteren

naibaf7 · 2016-05-13T07:48:18Z

@psyhtest
Nice one, thanks. Will be reviewed over the weekend.

naibaf7 · 2016-05-13T21:41:45Z

@psyhtest
Merged this into my branch for now for people who want to test cutting-edge.
I'll do some cleanup and make lint corrections before pushing it to the BVLC repository.

psyhtest · 2016-05-14T14:08:43Z

@naibaf7 Thanks!

You may notice that in blocks dispatching calls into CLBlast I use different formatting and explicitly define some constants (e.g. incX, offY). I believe it would be beneficial for clBLAS code too, as this would make it more readable, but I understand if you need to follow an established Caffe style.

Another thing is that even similar code blocks use different styles e.g.

        clblast::Scal<float>(
          N, // uppercase
          alpha,
          x, offx, incx, // all lowercase
          &queue
        )

        clblast::Asum<float>(
          n, // lowercase
          Z, offZ,
          X, offX, incX, // uppercase X, mixed case offX and incX
          &queue
        )

bhack · 2016-05-14T18:47:01Z

@naibaf7 How this is different from autotuining code that you are writing.

naibaf7 · 2016-05-14T21:48:18Z

@bhack
Greenea-LibDNN autotuning you mean? There I attempt to autotune a fused kernel that does not need an intermediate convolution buffer.
CLBlast is an autotuned BLAS that can be tested against ViennaCL and clBLAS for regular GEMM convolutions.
And of course a BLAS is also needed for other auxiliary operations in the network.

psyhtest added 26 commits April 28, 2016 14:51

Enable custom CLBlast installation path.

f01ed40

Error when simultaneously requesting to use clBLAS and CLBlast.

ab91347

Include CLBlast header if requested.

8385bee

Prepare for CLBlast: move clBLAS blocks before ViennaCL ones.

a67be4d

Enable custom CLBlast installation path.

1ed0369

Error when simultaneously requesting to use clBLAS and CLBlast.

8144173

Include CLBlast header if requested.

fdda5dd

Prepare for CLBlast: move clBLAS blocks before ViennaCL ones.

7675352

Merge branch 'CLBlast.support' of github.com:dividiti/caffe into CLBl…

3295d75

…ast.support

Call CLBlast with parameters set for clBLAS.

bebc57c

Report an error code from clBLAS and CLBlast.

a67b5a4

Workaround for CLBlast not accepting NULL events.

4c7de20

Enable custom CLBlast installation path.

1a0c3e2

Error when simultaneously requesting to use clBLAS and CLBlast.

f51685e

Include CLBlast header if requested.

bc2e956

Prepare for CLBlast: move clBLAS blocks before ViennaCL ones.

c80387a

Call CLBlast with parameters set for clBLAS.

411fe81

Report an error code from clBLAS and CLBlast.

8381728

Workaround for CLBlast not accepting NULL events.

58084ae

Workaround for CLBlast failing on xDOT.

0408d1a

Revert "Report an error code from clBLAS and CLBlast."

407dcd5

This reverts commit a67b5a4.

Merge remote-tracking branch 'upstream/master' into CLBlast.support

88a5f3a

Restore static cast of clblast::StatusCode to int.

0b74833

Merge remote-tracking branch 'upstream/master' into CLBlast.support

66a3dc1

Enable using xDOT with CLBlast.

bb315e4

Enable using xASUM with CLBlast.

5b21d08

Removed workaround for CLBlast 0.6.0 (CLBlast/issues/52).

a95a523

naibaf7 merged commit a95a523 into naibaf7:master May 13, 2016

psyhtest deleted the CLBlast.support branch May 14, 2016 13:17

intelfx mentioned this pull request Aug 25, 2016

cmake: CLBlast support #45

Merged

psyhtest mentioned this pull request Apr 1, 2017

How can I integrate this project API to TensorFlow or Caffe? ARM-software/ComputeLibrary#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLBlast support #32

CLBlast support #32

psyhtest commented May 12, 2016 •

edited

Loading

bhack commented May 12, 2016

naibaf7 commented May 13, 2016

naibaf7 commented May 13, 2016

psyhtest commented May 14, 2016

bhack commented May 14, 2016

naibaf7 commented May 14, 2016 •

edited

Loading

CLBlast support #32

CLBlast support #32

Conversation

psyhtest commented May 12, 2016 • edited Loading

bhack commented May 12, 2016

naibaf7 commented May 13, 2016

naibaf7 commented May 13, 2016

psyhtest commented May 14, 2016

bhack commented May 14, 2016

naibaf7 commented May 14, 2016 • edited Loading

psyhtest commented May 12, 2016 •

edited

Loading

naibaf7 commented May 14, 2016 •

edited

Loading