Last active
September 3, 2015 22:09
-
-
Save webplumbr/64a150f595bcbac5b49c to your computer and use it in GitHub Desktop.
How to make a searchable PDF
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# | |
# This pertains to making a searchable PDF | |
# | |
# (1) Make sure your scanner setting is set to either Greyscale or Full-Colour and 200 dpi. | |
# (2) You need to have the following installed on your Ubuntu (Debian) system | |
# | |
sudo apt-get install imagemagick tesseract-ocr | |
# (3) Once you get the scanned PDF sent to your Univ e-mail address, download it and run the following | |
convert -normalize -density 200 -depth 8 <scanned.pdf> <newname.tif> | |
tesseract <newname.tif> <newpdfname> pdf | |
# (4) Open the newly created pdf and start searching |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment