Skip to content

Instantly share code, notes, and snippets.

@gaboelnuevo
Created September 22, 2017 19:48
Show Gist options
  • Save gaboelnuevo/0062accf849176fa41186088db27846f to your computer and use it in GitHub Desktop.
Save gaboelnuevo/0062accf849176fa41186088db27846f to your computer and use it in GitHub Desktop.
tesseract aws lambda bin
#source: https://github.com/skylander86/lambda-text-extractor
sudo yum install libtool
sudo yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel
curl http://www.leptonica.com/source/leptonica-1.73.tar.gz | tar xzv
cd leptonica-1.73 && ./configure && make && sudo make install && cd ..
curl -L https://github.com/tesseract-ocr/tesseract/archive/3.05.tar.gz | tar xzv
cd tesseract-3.05/ && ./autogen.sh && ./configure && make && sudo make install && cd ..
mkdir text-extractor/lib-linux_x64/tesseract
cp /usr/local/lib/{libtesseract.so.3,liblept.so.5} text-extractor/lib-linux_x64/tesseract/
cp /lib64/{librt.so.1,libz.so.1,libpthread.so.0,libm.so.6,libgcc_s.so.1,libc.so.6,ld-linux-x86-64.so.2} text-extractor/lib-linux_x64/tesseract/
cp /usr/lib64/{libpng12.so.0,libjpeg.so.62,libtiff.so.5,libstdc++.so.6,libjbig.so.2.0} text-extractor/lib-linux_x64/tesseract/
cp /usr/local/share/tessdata/eng.traineddata text-extractor/lib-linux_x64/tesseract/
cp /usr/local/bin/tesseract text-extractor/bin-linux_x64/
mkdir text-extractor/lib-linux_x64/tesseract/tessdata
curl -L https://github.com/tesseract-ocr/tessdata/archive/3.04.00.tar.gz | tar xzv
cp tessdata-3.04.00/eng.* text-extractor/lib-linux_x64/tesseract/tessdata/
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment