Skip to content

Instantly share code, notes, and snippets.

@RobbieClarken
Last active December 22, 2015 17:09
Show Gist options
  • Save RobbieClarken/6503945 to your computer and use it in GitHub Desktop.
Save RobbieClarken/6503945 to your computer and use it in GitHub Desktop.

Method 1

  1. If you don't have xpdf already:
brew install xpdf
  1. Convert the PDF to a postscript file:
pdftops in.pdf
  1. Open the PS file in a text editor and locate the watermark elements. Look for a way to identify them throughout the file. For example, the watermark code may be a fixed length and contain some keyword.
  2. Strip the postscript of the watermark code. For example, if the watermark code stretches 35 lines before and 70 lines after PATTERN you could use:
vim -c 'g/PATTERN/-35,+70d' -c 'w! out.ps|q!' in.ps
  1. Compile the postscript back into PDF:
ps2pdf out.ps

Tips

  • Finding the start and end of the watermark code can be tricky. If you notice it is preceeded by opening tags q which are later closed with Q you can ensure you have captured all closing tags with the following command:

    ack -B 35 -A 70 'PATTERN' in.ps | tee >(ack -c '^Q$') >(ack -c '^q$') >/dev/null

Method 2

To remove an annoying link:

pdftk in.pdf output temp.pdf uncompress
vim -c 'g#\V/URI (http://www.example.com/)#-6,+7d' -c 'w! temp.pdf|q!' temp.pdf
LANG=C sed -E -i '' 's/www.example.com/               /' temp.pdf
pdftk temp.pdf output out.pdf compress
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment