Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any plan for pretraining? #2

Open
jbmaxwell opened this issue Jul 27, 2019 · 4 comments
Open

Any plan for pretraining? #2

jbmaxwell opened this issue Jul 27, 2019 · 4 comments

Comments

@jbmaxwell
Copy link

I'm curious whether there's any plan to support pretrianing models from scratch?

@julien-c
Copy link
Member

So, one way you could go about it, would be to use pytorch-transformers to pre-train or fine-tune your model, then use the script in model_generation as a starting point to convert to CoreML.

We do not have any plan to experiment with training on device, because realistically those models are way too large to be trained on anything other than a cutting-edge GPU 🙃

@jbmaxwell
Copy link
Author

Excellent, thanks for the reply! I wondered about using gpt2.py as a starting point... Of course, I wouldn't expect to train one of these beasts on the device! That would be madness! Maybe one day... haha...
Just curious; would a model generated in this way be iOS 13+ only, or are the basic layer/objects compatible with iOS 12?

@julien-c
Copy link
Member

julien-c commented Jul 29, 2019

Check out this tweet: https://twitter.com/julien_c/status/1154894146328563715

Short answer: iOS 13+ only

@jbmaxwell
Copy link
Author

Ah, too bad, but not surprising. The new hotness is the new hotness for a reason! ;-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants