Google-OCR

Setup with pip & virtualenv

Clone this repo and follow the steps below

cd Google-OCR
virtualenv .env
source .env/bin/activate
pip install -r requirements.txt

Follow this Quick Start guide to setup Google Vision API, which is necessary for using Google OCR service. There is also video tutorial

Usage

Running OCR on collection of images. Note: This script only works on png image, so be sure to convert PDF or other format into png. But we will be releasing support for other format in future.

cd ocr
python google_ocr.py --input_dir data/W22084/vol-1 --output_dir W22084/vol-1 --n 3

Output of OCR will be stored in txt file at output_dir.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Google-OCR

Setup with pip & virtualenv

Usage

Files

README.md

Latest commit

History

README.md

File metadata and controls

Google-OCR

Setup with pip & virtualenv

Usage