CoreMLGPT2 crashing on iPhone X #3

krishkumar · 2019-07-28T06:14:13Z

Message from debugger: Terminated due to memory issue

Is the GPT2-512 model working better on newer devices?

The text was updated successfully, but these errors were encountered:

julien-c · 2019-07-29T20:01:16Z

iPhone X "only" has 3 GB RAM and the OS pretty aggressively kills your app if it goes above 1.7 GB used memory.

Your best bet is to run the model on either an iPhone XS (4 GB RAM) or an iPad Pro (up to 6 GB RAM).

krishkumar · 2019-07-29T20:28:34Z

Makes sense. Thanks!

Wondering if we can train a lightweight or dumber version of the GPT2 model that can run on older devices like the X.

julien-c · 2019-07-29T21:28:10Z

Yes – quantization could work here, see #1. It's also built in to CoreML so it shouldn't be too hard to try.

https://developer.apple.com/documentation/coreml/reducing_the_size_of_your_core_ml_app

krishkumar · 2019-08-12T14:14:55Z

Thanks @julien-c.

I attempted to quantize this to half precision as suggested by the Apple docs.

Getting this error which might or might not be related to the model itself -

/usr/local/lib/python3.6/dist-packages/coremltools/models/utils.py in _convert_nn_spec_to_half_precision(spec)
    320                             ' not yet implemented\n')
    321         else:
--> 322             raise Exception('Unknown layer ' + layer_type)
    323 
    324     return spec

TypeError: must be str, not NoneType

Refer to this Colab notebook to try it out - https://colab.research.google.com/drive/1QC90lE-LUDEUXMUGer-5HHLLFRG6dF7F

julien-c · 2019-08-12T14:21:15Z

Hmm, I think your version of coremltools is way too old :)

Also related to this issue I believe that @LysandreJik converted a smaller version of gpt2-small that successfully runs on iPhone X.

krishkumar · 2019-08-12T14:38:05Z

You are right. But I only managed to get a different error with the latest version of coremltools - 3.0b4

/usr/local/lib/python3.6/dist-packages/coremltools/models/utils.py in _convert_nn_spec_to_half_precision(spec)
    332                             ' not yet implemented\n')
    333         else:
--> 334             raise Exception('Unknown layer ' + layer_type)
    335 
    336     return spec

Exception: Unknown layer expandDims

Is the smaller model available in the Resources folder of this project? I will check them out.

krishkumar closed this as completed Aug 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CoreMLGPT2 crashing on iPhone X #3

CoreMLGPT2 crashing on iPhone X #3

krishkumar commented Jul 28, 2019

julien-c commented Jul 29, 2019

krishkumar commented Jul 29, 2019

julien-c commented Jul 29, 2019

krishkumar commented Aug 12, 2019

julien-c commented Aug 12, 2019

krishkumar commented Aug 12, 2019

CoreMLGPT2 crashing on iPhone X #3

CoreMLGPT2 crashing on iPhone X #3

Comments

krishkumar commented Jul 28, 2019

julien-c commented Jul 29, 2019

krishkumar commented Jul 29, 2019

julien-c commented Jul 29, 2019

krishkumar commented Aug 12, 2019

julien-c commented Aug 12, 2019

krishkumar commented Aug 12, 2019