-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
European Dataset #26
Comments
Hi! Great to receive that feedback about the new model! I might release the full data in the future, it was trained with 41k plates.
When training the
That's a good question. I tried this architecture (which has attention blocks) to see if we could obtain good results without the rectifier network, i.e. STN. The dataset I used for the European plates angles were much more difficult than the Argentinian dataset I had, and the model I used for the Argentinian ones didn't satisfy me in terms of accuracy. It would be nice to try the STN module before the MobileViTv2 and see the results if it provides noticeable improvement. Having that said, I didn't have time to rework STN layer with Keras 3, but if you want to contribute that to the repo it would be great!
That seems like a nice idea. Did you apply the affine transformation before feeding the crop to an OCR? |
I'm not tied down to the MobileViTv2 architecture I used for the European plates dataset, I mainly used it because I wanted to try with attention blocks, to have more global context at an early stage of the network. Also chose it because it is available in I think for speed (which is a key criterion for me), it might be better to use a STN (rectifier) + ConvNet kind of network, like LPRNet paper does. I'm planning to release a datase for experiments, which should be representative with worldwide plates (let me know if it sounds interesting to you). This way we can test different networks with speed/accuracy in mind. Btw, feel free to contribute other architectures if you find them suitable for the task. |
Hello again! Amazing work with the new updates! I found the model trained on European license plates incredibly interesting and versatile! I even tested it on license plates from Mercosur and Mexico, and it performed very well! However, I’d like to gradually integrate more data to make it even more robust. I noticed you shared the validation dataset—do you plan to release the training dataset as well? Similar to what you did with Argentina, my idea is to generate synthetic data for Uruguay, for example.
On another note, I saw that you implemented object detection in https://github.com/ankandrew/open-image-models. In the past, we discussed how tilted license plates might pose a challenge for detection. With this new MobileViT v2 architecture you're suggesting, do you think it's necessary to correct that distortion before applying OCR? I ask because I’ve noticed that your model performs very well, even with distorted plates. On my end, I addressed that issue by determining the "pose" of the four corners of the plates and then applying an affine transformation to correct the distortion, which I implemented in YOLOv8.
The text was updated successfully, but these errors were encountered: