Skip to content

PDF OCR Enhancer is a powerful FastAPI-based web service that combines the strengths of EasyOCR and PDFMiner to provide accurate and enhanced text extraction from PDF documents. This project offers a unique approach to OCR by allowing users to specify keywords, which are then used to improve the accuracy of the extracted text.

License

Notifications You must be signed in to change notification settings

nasser-mallouli/PDF-OCR-Enhancer

Repository files navigation

PDF-OCR-Enhancer

PDF OCR Enhancer is a powerful FastAPI-based web service that combines the strengths of EasyOCR and PDFMiner to provide accurate and enhanced text extraction from PDF documents. This project offers a unique approach to OCR by allowing users to specify keywords, which are then used to improve the accuracy of the extracted text.

Key features:

  • PDF text extraction using EasyOCR
  • Enhanced accuracy with PDFMiner integration
  • Keyword-based text replacement for improved results
  • FastAPI web service for easy integration
  • Supports multiple languages (default: English and German)

About

PDF OCR Enhancer is a powerful FastAPI-based web service that combines the strengths of EasyOCR and PDFMiner to provide accurate and enhanced text extraction from PDF documents. This project offers a unique approach to OCR by allowing users to specify keywords, which are then used to improve the accuracy of the extracted text.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published