Intelligence is simply talking to many people

Convert pdf to text (OCRmyPDF)

It relies on tesseract for its OCR (https://ocrmypdf.readthedocs.io/en/latest/languages.html), so you need tesseract's language packs (http://tttthis.com/blog/convert-image-to-text-tesseract-ocr) to do other languages.

"OCRmyPDF that will add a text layer to a scanned PDF making it searchable"

FOSS

sudo apt-get install ocrmypdf

ocrmypdf input.pdf output.pdf

SPANISH (characters, otherwise it won't be able to copy-paste ¿)

ocrmypdf -l spa input.pdf output-spa.pdf


Comments: 0

Interested to discuss? Leave a comment.

Image




Your email will not be published nor shared with anyone. In your text you can use markdown for marking up *italic*, links <http://example.org> and other elements. These comments are moderated and published manually as soon as possible.