Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database): Day #114

June 09, 2018

Day #114 - OCR with Tesseract

Architecture for Tesseract link

Step 1 - Download Latest version for windows link

Step 2 - Pdf to Image Conversion using ImageMagik (Refer Previous Posts)

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 100 D:\OCR\TestData\DEC17.jpg

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 60 D:\OCR\TestData\DEC17.jpg

Step 3 - Tesseract Options using LSTM Networks

#use LSTM Mode
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\DEC171.csv

tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\ tsv

Intelligent Photo OCR that reads better than you (Or not)
Deep Learning Fun - Crazy Food Image Classifier
OCR and OpenCV with RNN

Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
Build your own OCR(Optical Character Recognition) for free
An OCR System for recognition of Urdu text in Nastaliq Font
Optical Character Recognition

Image Thresholding
Contour formation
Detecting individual characters and forming a Region of Image (ROI) around detected characters.
Passing the ROIs through our trained model and finding out the predicted character
Saving/Appending this character in a string, which is basically going to be your output.

Happy Learning!!!

Data Science, Database, AI Startups and Domain Learning's (Video-Image-Text-Data-Database)

June 09, 2018

Day #114 - OCR with Tesseract

No comments:

Git Code Repository

About Me

What is your Expertise

Search This Blog

Translate

About Me and Disclaimer

Labels

Data Science Good Reads

Cloud, Datacentre, BigData and NOSQL Blogs

SQL Links

Archecture Blog List

Programming Problems

Startup - Reads

Perl-Python-Ruby-Linux-Oracle

Management + Leadership Blogs

Research Papers & Podcasts

My Wordpress

Interesting Reads

Useful Links - C# and .NET

Java, Selenium, QTP and Test Tools Learning

Agile Testing

Reverse Logistics Reads

Biztalk Blogs

MS BI Links

Process - Learnt it :)

Usability Guidelines - Building Better Sites

.NET Test Tools and Other Interesting Reads

Review Checklist

Blog Archive

Live Traffic

Total Pageviews

Popular Posts