"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;
Showing posts with label OCR. Show all posts
Showing posts with label OCR. Show all posts

June 22, 2023

Text Detection Models - Vision - GCP - Azure - Tesseract

 GCP - Link


Azure



Tesseract Results





Keep Exploring!!!

September 25, 2022

Document Q&A

From OCR, Document Extraction, Understanding, Hugging face has come a long way :)

DocQnA Pipeline very impressive




Results


Keep Exploring!!!

  • TesserOCR
  • MMOCR
  • OCRmypdf
  • EasyOCR
  • PaddleOCR
  • Kraken
  • OCRopus
  • PyOCR
  • Tesseract


Keep Learning!!!

August 16, 2022

OCR again - 2022 Updates

Donut 🍩, Document understanding transformer, is a new method of document understanding that utilizes an OCR

  • Based on the transformer concept
  • Experimented with the sample colab code
  • Gradio is like streamlit 

Samples and Results











Demo codes Link

Keep Exploring!!!

June 21, 2021

Interesting observations tesseract

While extracting digits from analog meters below two links we use to get the values

Lesson #1 - Setting the path to a folder vs complete executable, Minor thing took a while since not using it often

Ref - Link

Lesson #2 - Very useful for different situations on how it can be interpreted, 11 worked best. 6 was ok


Ref - Link

Keep Exploring!!!

June 09, 2018

Day #114 - OCR with Tesseract

Architecture for Tesseract link

Step 1 - Download Latest version for windows link

Step 2 - Pdf to Image Conversion using ImageMagik (Refer Previous Posts)

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 100 D:\OCR\TestData\DEC17.jpg

convert.exe -density 300 -trim D:\OCR\TestData\DEC17.pdf -quality 60 D:\OCR\TestData\DEC17.jpg

Step 3 - Tesseract Options using LSTM Networks

#use LSTM Mode
tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\DEC171.csv

tesseract.exe --oem 2 D:\OCR\TestData\DEC17-0.jpg D:\OCR\TestData\ tsv

Intelligent Photo OCR that reads better than you (Or not)
Deep Learning Fun - Crazy Food Image Classifier
OCR and OpenCV with RNN

Ready-to-use OCR with 40+ languages supported including Chinese, Japanese, Korean and Thai.
Build your own OCR(Optical Character Recognition) for free
An OCR System for recognition of Urdu text in Nastaliq Font
Optical Character Recognition
  • Image Thresholding
  • Contour formation
  • Detecting individual characters and forming a Region of Image (ROI) around detected characters.
  • Passing the ROIs through our trained model and finding out the predicted character
  • Saving/Appending this character in a string, which is basically going to be your output.

Happy Learning!!!

May 23, 2018

Day #109 - PDF to JPG Conversion

  • ImageMagick-6.9.9-Q16 (https://legacy.imagemagick.org/script/binary-releases.php)
  • Python 3.5 Environment on Anaconda and OCR followed as steps listed in https://sqlandsiva.blogspot.in/2018/03/day-101-ocr-and-python.html
Steps Goto - C:\Program Files\ImageMagick-6.9.9-Q16> in Administrator Mode

Different command line options to translate into jpeg with sharpening, density values
  • convert.exe -density 300 -trim D:\PetProject\OCR\pdfs\TestA.pdf -quality 100 D:\PetProject\OCR\pdfs\Pages\test.jpg
  • convert.exe -density 300 -trim D:\PetProject\OCR\pdfs\TestA.pdf -quality 100 -sharpen 0x1.0 D:\PetProject\OCR\pdfs\Pages\test.jpg
  • convert.exe -density 150 -trim D:\PetProject\OCR\pdfs\TestA.pdf -quality 100 -sharpen 0x1.0 D:\PetProject\OCR\pdfs\Pages\test.jpg
Happy Learning!!!

May 21, 2018

Day#108 - OCR for Hindi

OCR for Hindi

1. Download data from https://github.com/tesseract-ocr/tessdata/blob/3.04.00/hin.traineddata

2. Copy it to C:\opencv\Tesseract-OCR\tessdata/hin.traineddata


3. Test Data



3. Output

Code

More Reads - Link, Link1

Happy Learning!!!

March 09, 2018

Day #101 - OCR and Python


pytesseract installation steps

1. Download for windows https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM#400-alpha-for-windows
2. Installed - Windows Installer made with MinGW-w64 from UB Mannheim
3. This post was useful - https://stackoverflow.com/questions/34225927/pytesseract-cannot-find-the-file-specified
4. Pip install of pytesseract
5. Created Env Variable - TESSDATA_PREFIX=C:\Program Files (x86)\Tesseract-OCR

More Reads
RNN for OCR


Happy Learning!!!