Skip to content

Instantly share code, notes, and snippets.

@nathancolgate
Created April 21, 2015 17:45
Show Gist options
  • Select an option

  • Save nathancolgate/b4b7a85954a5c91e0b9d to your computer and use it in GitHub Desktop.

Select an option

Save nathancolgate/b4b7a85954a5c91e0b9d to your computer and use it in GitHub Desktop.
The modern Web is a dynamic place. However, sometimes it's necessary (or desirable) to remove the dynamic functionality of a website, while preserving its static content.
wget -P /path/to/destination/directory/ -mpck --user-agent="" -e robots=off --wait 1 -E https://www.example.com/
@mattkirkland
Copy link

I refer to this every time I download a static copy of an old Engage site. BUT it didn't get any assets for a site that was on Aecom-engage!

Worked with Nate to finesse this, and finally got to this command, it successfully downloaded images:

wget --recursive --level=inf --page-requisites --convert-links -P i91/ -mpck --user-agent="" -e robots=off \
    --adjust-extension --span-hosts --domains=[i-91charteroakbridge.com](http://i-91charteroakbridge.com/),[aecom-engage.com](http://aecom-engage.com/) [i-91charteroakbridge.com](http://i-91charteroakbridge.com/)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment