nullx5/convertir pdf o imagen a texto y pdf buscables con OCR.md

Last active December 16, 2025 23:19

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/nullx5/f77614430600edd65180673e0cab489c.js"></script>
Save nullx5/f77614430600edd65180673e0cab489c to your computer and use it in GitHub Desktop.

Download ZIP

Raw

convertir pdf o imagen a texto y pdf buscables con OCR.md

convertir pdf o imagen a texto y pdf hacer pdf buscables con OCR.

sudo apt install gimagereader

sudo apt install tesseract-ocr
sudo apt install tesseract-ocr-spa    #lenguaje español
sudo apt install tesseract-ocr-eng    #lenguaje ingles

#permite convertir de pdf a texto plano - me gusta este de pdf a texto FACIL ✅
#NO exportar a .odt
# permite crear pdf buscable, varias coincidencias de una busqueda NO funciona tambien

sudo apt install ocrmypdf

ocrmypdf archivo_escaneado.pdf salida_final.pdf # permite crear pdf buscable, varias coincidencias de una busqueda

ocrmypdf --image-dpi 300 "imagen.jpg" "salida.pdf" # de imagen a pdf buscable

flatpak install flathub org.gnome.OCRFeeder
flatpak run org.gnome.OCRFeeder

#permite exportar a ODT y luego abrir con libre office writer y es editable luego exportar a .pdf o .docx
#permite convertir de pdf a texto plano - me gusta este de pdf a texto FACIL ✅
#exportar directamente a pdf NO HACE PDF BUSCABLE no funciona

OCR Python 
https://github.com/JaidedAI/EasyOCR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment