BetterOCR
π BetterOCR combines results from multiple OCR engines with an π§ LLM to correct & reconstruct the output.
Before | After (β¨ latest at v1.2.0) |
---|---|
- Improved English/Korean text recognition with new Pororo OCR support! π
- Special thanks to @black7375 for https://github.com/black7375/korean_ocr_using_pororo (where he suggested using EasyOCR for text detection and BrainOCR (Pororo's OCR module) for text recognition) and #2.
- Also kudos to the @kakaobrain team and @yunwoong7.
Notes
Pororo is used only if the language options (lang
) specified include either πΊπΈ English (en
) or π°π· Korean (ko
). Also additional dependencies listed in [tool.poetry.group.pororo.dependencies]
must be available. (If not, it'll automatically be excluded from enabled engines.)
What's Changed
- [ImgBot] Optimize images by @imgbot in #7
- Write parser tests by @junhoyeo in #9
- EasyPororoOCR Integration by @junhoyeo in #8
New Contributors
Full Changelog: v1.1.2...v1.2.0