-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model didn't converge. #2
Comments
Dear @rAm1n, |
Hi @ahkarami Thanks for for pointing out the issue with ResNet version. I am aware of it but, unfortunately I had no luck getting any number better than 65% on LFW. Regardless of the encoder network, something around 90+ is definitely achievable with triplet loss. I think the link that you shared is an implementation of this paper which is a bit different with FaceNet. I've stop working on it for a short while but I recommend this paper to you: How to Train Triplet Networks with 100K Identities? also, if you are really interested about embeddings and solving face verification with open-set configuration, make sure to have a look on recent works based on angular loss: insightface, sphereface |
Dear @rAm1n, |
Hi @rAm1n, |
Hi @magwyz I didn't really continue working on this. If you really want to make this work, maybe start with a Softmax version and then fine-tune using triplet-loss. Also, re-implementing the triplet selection from the tensorflow repository might help. And don't forget to play with the learning rate too. I would guess it will take time too converge and most probably the loss will drop rapidly after few hours of training. |
Thanks @rAm1n for the hints! |
Hi rAm1n, https://github.com/tbmoon/facenet I achieved an 90% accuracy on LFW dataset. If you are interested in my codes, don't hesitate to refer to it. |
Hi @liorshk
Thanks for sharing your code. It seems clean and well-written, however, I had problem having it converge.
I trained it on filtered version of MsCeleb with 5 Million images and 79K identities. Your hyper-parameters seems to be identical with the Tensorflow implementation davidsandberg/facenet and I also tried different ones but I never got more than 65% accuracy on LFW.
I think it's mostly because of the way that triplet selection has been implemented. The paper suggests having batches of 1800 images from a certain number of identities (40-45), rather than choosing it completely randomly. I tried this but only with 180 images at most, yet still it didn't converge.
Do you have any idea that can help me? If you had any success training the model, could you please share your weights too?
Thanks,
The text was updated successfully, but these errors were encountered: