August 01, 2023

PDF Data Extraction

  • Evaluate if there is problem in data extraction. unistructured - https://pypi.org/project/unstructured/
  • yMuPDF, also known as Fitz, is a Python binding for the MuPDF library
  • pdfplumber - https://pypi.org/project/pdfplumber/
  • Camelot - https://camelot-py.readthedocs.io/en/master/
  • img2table - https://github.com/xavctn/img2table
Keep Exploring!!!

No comments:

Post a Comment