Architecture for Tesseract link
Step 1 - Download Latest version for windows link
Step 2 - Pdf to Image Conversion using ImageMagik (Refer Previous Posts)
convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 100 D:\OCR\TestData\DEC17.jpg
Step 3 - Tesseract Options using LSTM Networks
#use LSTM Mode
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\DEC171.csv
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\ tsv
Intelligent Photo OCR that reads better than you (Or not)
Deep Learning Fun - Crazy Food Image Classifier
OCR and OpenCV with RNN
Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
Build your own OCR(Optical Character Recognition) for free
An OCR System for recognition of Urdu text in Nastaliq Font
Optical Character Recognition
Step 1 - Download Latest version for windows link
Step 2 - Pdf to Image Conversion using ImageMagik (Refer Previous Posts)
convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 100 D:\OCR\TestData\DEC17.jpg
convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 60 D:\OCR\TestData\DEC17.jpg
#use LSTM Mode
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\DEC171.csv
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\ tsv
Intelligent Photo OCR that reads better than you (Or not)
Deep Learning Fun - Crazy Food Image Classifier
OCR and OpenCV with RNN
Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
Build your own OCR(Optical Character Recognition) for free
An OCR System for recognition of Urdu text in Nastaliq Font
Optical Character Recognition
- Image Thresholding
- Contour formation
- Detecting individual characters and forming a Region of Image (ROI) around detected characters.
- Passing the ROIs through our trained model and finding out the predicted character
- Saving/Appending this character in a string, which is basically going to be your output.
Happy Learning!!!
No comments:
Post a Comment