Created
June 28, 2024 02:24
-
-
Save jsoma/430e3fc6b70aa1d91640dd563d8f6128 to your computer and use it in GitHub Desktop.
How to use pdfminer.six, PaddleOCR and OpenAI's GPT to OCR and extract text from PDFs and save them into a CSV (or Excel) file for later analysis.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment