Documentation
read an image file and turn into text
get text content of pdf document images within
get text from pdf and resort to ocr as needed
Modules
read an image with tesseract and get output
get images from pdf document
get ocr and images out of a pdf file
extract text fom pdf document resorting to ocr as needed
save ocr to text file for easy retrieval