Sam Wilson's Website

A good trick from dpeach about using Tesseract to get text out of a scanned PDF:

convert -density 300 file.pdf -depth 8 file.tiff

tesseract file.tiff OutputFileName

http://www.mythoughtspot.com/2014/10/23/use-tesseract-ocr-with-pdf-file/

Tags: ·