RobbieClarken/RemovePDFWatermark.md

Last active December 22, 2015 17:09

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/RobbieClarken/6503945.js"></script>
Save RobbieClarken/6503945 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

RemovePDFWatermark.md

Method 1

If you don't have xpdf already:

brew install xpdf

Convert the PDF to a postscript file:

pdftops in.pdf

Open the PS file in a text editor and locate the watermark elements. Look for a way to identify them throughout the file. For example, the watermark code may be a fixed length and contain some keyword.
Strip the postscript of the watermark code. For example, if the watermark code stretches 35 lines before and 70 lines after PATTERN you could use:

vim -c 'g/PATTERN/-35,+70d' -c 'w! out.ps|q!' in.ps

Compile the postscript back into PDF:

ps2pdf out.ps

Tips

Finding the start and end of the watermark code can be tricky. If you notice it is preceeded by opening tags q which are later closed with Q you can ensure you have captured all closing tags with the following command:
```
ack -B 35 -A 70 'PATTERN' in.ps | tee >(ack -c '^Q$') >(ack -c '^q$') >/dev/null
```

Method 2

To remove an annoying link:

pdftk in.pdf output temp.pdf uncompress
vim -c 'g#\V/URI (http://www.example.com/)#-6,+7d' -c 'w! temp.pdf|q!' temp.pdf
LANG=C sed -E -i '' 's/www.example.com/               /' temp.pdf
pdftk temp.pdf output out.pdf compress

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment