Ubuntu 24.04 - PDF with images and no OCR layer. #5

avoiceofreason · 2025-01-21T15:55:48Z

If you try and process a PDF that is made from images with no OCR text layer or no other text then pdf-narrator is unable to extract any text and gets a bit upset with a few error messages in the log as the .txt file is empty.

Useful tip:

Use ocrmypdf to add an OCR layer to existing PDF files:

ocrmypdf --force-ocr --output-type pdf --rotate-pages --deskew --clean input.pdf output.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ubuntu 24.04 - PDF with images and no OCR layer. #5

Ubuntu 24.04 - PDF with images and no OCR layer. #5

avoiceofreason commented Jan 21, 2025

Ubuntu 24.04 - PDF with images and no OCR layer. #5

Ubuntu 24.04 - PDF with images and no OCR layer. #5

Comments

avoiceofreason commented Jan 21, 2025