"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;

June 09, 2018

Day #114 - OCR with Tesseract

Architecture for Tesseract link

Step 1 - Download Latest version for windows link

Step 2 - Pdf to Image Conversion using ImageMagik (Refer Previous Posts)

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 100 D:\OCR\TestData\DEC17.jpg

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 60 D:\OCR\TestData\DEC17.jpg

Step 3 - Tesseract Options using LSTM Networks

#use LSTM Mode
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\DEC171.csv

tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\ tsv

Intelligent Photo OCR that reads better than you (Or not)
Deep Learning Fun - Crazy Food Image Classifier
OCR and OpenCV with RNN

Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
Build your own OCR(Optical Character Recognition) for free
An OCR System for recognition of Urdu text in Nastaliq Font
Optical Character Recognition
  • Image Thresholding
  • Contour formation
  • Detecting individual characters and forming a Region of Image (ROI) around detected characters.
  • Passing the ROIs through our trained model and finding out the predicted character
  • Saving/Appending this character in a string, which is basically going to be your output.

Happy Learning!!!

No comments: