Skip to content

Instantly share code, notes, and snippets.

@kba
Last active April 10, 2018 16:27
Show Gist options
  • Save kba/44fa17351ad4d5ad2162be0e57a22d2e to your computer and use it in GitHub Desktop.
Save kba/44fa17351ad4d5ad2162be0e57a22d2e to your computer and use it in GitHub Desktop.

Clone and virtualenv-install ocr-d/ocrd_tesserocr

git clone https://github.com/OCR-D/ocrd_tesserocr
cd ocrd_tesserocr
virtualenv venv3
source venv3/bin/activate
pip install -r requirements.txt
pip install -e .

Clone ocrd-assets

git clone https://github.com/OCR-D/ocrd-assets

Run

Through ocrd process

ocrd process \
   -T ocrd-tool.json \
   -m ocrd-assets/data/SBB0000F29300010000/mets.xml \
   -I OCR-D-IMG \
   -w /tmp/workdir \
   ocrd_tesserocr_segment_region

Or directly through the MP CLI

ocrd_tesserocr_segment_region \ 
   -m https://github.com/OCR-D/ocrd-assets/raw/master/data/SBB0000F29300010000/mets.xml \
   -I OCR-D-IMG \
   -w /tmp/workdir

Inspect results

find /tmp/workdir
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment