Skip to content

Instantly share code, notes, and snippets.

@rsalaza4
Created March 20, 2024 17:28
Show Gist options
  • Select an option

  • Save rsalaza4/5492e587acdfb8a60d70aaac8fd9d595 to your computer and use it in GitHub Desktop.

Select an option

Save rsalaza4/5492e587acdfb8a60d70aaac8fd9d595 to your computer and use it in GitHub Desktop.
# Import libraries and dependencies
import re
import fitz
import PyPDF2
# Open resume PDF file
with fitz.open('Roberto_Salazar_Resume.pdf') as doc:
# Declare text string variable
text = ""
# Extract text from every page from the file
for page in doc:
text += page.get_text()
# Replace "\n" with an empty space
text = text.replace("\n"," ")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment