Skip to content

Instantly share code, notes, and snippets.

@FrankGrimm
Created July 22, 2021 09:38
Show Gist options
  • Save FrankGrimm/ba9f3f635b8136da413af3bcbeaeda69 to your computer and use it in GitHub Desktop.
Save FrankGrimm/ba9f3f635b8136da413af3bcbeaeda69 to your computer and use it in GitHub Desktop.
#!/bin/bash
set -xe
# requires wget and pdftotext (from xpdf/poppler)
mkdir -p pdfs
mkdir -p texts
wget -O "pdfs/meldungen.zip" "https://www.bbk.bund.de/SharedDocs/Downloads/DE/Warnung-Vorsorge/MoWaS_Meldungen_HW-Lage_Jul_2021.zip?__blob=publicationFile&v=1"
cd "./pdfs"
unzip "meldungen.zip"
rm "meldungen.zip"
for f in *.pdf; do pdftotext "$f" "../texts/${f%.*}.txt"; done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment