Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

_prod_TT fails with CL_OUT_OF_RESOURCE on Mali-T628 #60

Open
psyhtest opened this issue Feb 10, 2017 · 1 comment
Open

_prod_TT fails with CL_OUT_OF_RESOURCE on Mali-T628 #60

psyhtest opened this issue Feb 10, 2017 · 1 comment

Comments

@psyhtest
Copy link

psyhtest commented Feb 10, 2017

After checking that Caffe no longer requests zero-sized OpenCL buffers (#59), I still encounter the same result on SqueezeNet 1.1:

      *** Aborted at 1486746813 (unix time) try "date -d @1486746813" if you are using GNU date ***
      PC: @ 0xb5c8798a mcl_entrypoints_valid_event_list
      *** SIGSEGV (@0x45ccc412) received by PID 29568 (TID 0xb01c6000) from PID 1171047442; stack trace: ***
          @ 0xb5774270 (unknown)

(GoogleNet behaves similarly to what I report for SqueezeNet, just doesn't segfault. AlexNet works fine.)

The first call that goes bad is:

  "call": "clEnqueueNDRangeKernel",
  "name": "_prod_TT",
  "queue": "0xd70c0",
  "kernel": "0x70f100",
  "gwo": [0, 0, 0],
  "gws": [16, 3200, 1],
  "lws": [8, 8, 1],
  "event_wait_list": [],
  "event": "0",
  "timestamp": {
   "start": "2017-02-10T16:55:05.482380",
   "end": "2017-02-10T16:55:05.523968"
  },
  "output profiling_error": -5,
  "profiling": {
   "queued": 532575944823,
   "submit": 532575944823,
   "start": 13079693137503322112,
   "end": 34359738377
  },
  "errcode": 0
 }

The local work size (8x8) should work for any kernel. I suspect the problem might be in the way the arguments are set, but don't have any other clue at the moment.

Please find attached a trace from dividiti's OpenCL profiler up to the failing call:
naibaf7-caffe-60.dvdt-prof.txt. Please see #59 on how to reproduce.

@tequilaguru
Copy link

I’m finding the same issue on a Vivante GC2000, did you ever find a possible cause/solution?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants