Skip to content

Instantly share code, notes, and snippets.

@edsu
Last active April 6, 2025 15:07
Show Gist options
  • Save edsu/27819e3ea2a6ad9b3a2c6cf5831d23cf to your computer and use it in GitHub Desktop.
Save edsu/27819e3ea2a6ad9b3a2c6cf5831d23cf to your computer and use it in GitHub Desktop.
docker run \
--publish 9037:9037 \
-v $PWD/crawls:/crawls/ \
webrecorder/browsertrix-crawler crawl \
--seeds https://www.womenonweb.org/af/ \
--seeds https://www.womenonweb.org/ar/ \
--seeds https://www.womenonweb.org/de/ \
--seeds https://www.womenonweb.org/en/ \
--seeds https://www.womenonweb.org/es/ \
--seeds https://www.womenonweb.org/fa/ \
--seeds https://www.womenonweb.org/fr/ \
--seeds https://www.womenonweb.org/hu/ \
--seeds https://www.womenonweb.org/id/ \
--seeds https://www.womenonweb.org/it/ \
--seeds https://www.womenonweb.org/ja/ \
--seeds https://www.womenonweb.org/ko/ \
--seeds https://www.womenonweb.org/lt/ \
--seeds https://www.womenonweb.org/ms/ \
--seeds https://www.womenonweb.org/nl/ \
--seeds https://www.womenonweb.org/pl/ \
--seeds https://www.womenonweb.org/pt/ \
--seeds https://www.womenonweb.org/ru/ \
--seeds https://www.womenonweb.org/si/ \
--seeds https://www.womenonweb.org/sk/ \
--seeds https://www.womenonweb.org/sv/ \
--seeds https://www.womenonweb.org/ta/ \
--seeds https://www.womenonweb.org/th/ \
--seeds https://www.womenonweb.org/tl/ \
--seeds https://www.womenonweb.org/tr/ \
--seeds https://www.womenonweb.org/uk/ \
--seeds https://www.womenonweb.org/xh/ \
--generateWACZ \
--text \
--collection womenonweb \
--workers 6 \
--screencastPort 9037 \
--exclude '.*/language/switch.*'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment