Skip to content

Instantly share code, notes, and snippets.

@crates
Created November 11, 2020 16:51
Show Gist options
  • Save crates/98d66fa73f4e18832cce80ee3bf22f3a to your computer and use it in GitHub Desktop.
Save crates/98d66fa73f4e18832cce80ee3bf22f3a to your computer and use it in GitHub Desktop.
Here are some examples of wget commands you can use to spider/crawl the contents of a website
wget -r -l5 -U "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:39.0) Gecko/20100101 Firefox/39.0" -c -D english.grimoar.cz --max-redirect 0 --no-parent -w .5 http://english.grimoar.cz/
wget -r -l5 -U "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:39.0) Gecko/20100101 Firefox/39.0" -c -D krishnamurti.abundanthope.org --max-redirect 0 --no-parent -w .5 http://krishnamurti.abundanthope.org/index_htm_files/
wget -r -l5 -U "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:39.0) Gecko/20100101 Firefox/39.0" -c -D krishnamurti.abundanthope.org --max-redirect 0 --no-parent -w .5 http://krishnamurti.abundanthope.org/index_htm_files/
wget -r -l5 -U "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:39.0) Gecko/20100101 Firefox/39.0" -c -D www.teachittome.com --max-redirect 0 --no-parent -w .5 http://www.teachittome.com/Chinuch/
wget -r -l5 -U "Mozilla/5.0 (Windows NT 6.1; WOW64; rv:39.0) Gecko/20100101 Firefox/39.0" -c -D fas.org --max-redirect 0 --no-parent -w .5 http://fas.org/irp/doddir/
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment