This is a collection of scripts to restore a full GHTorrent MongoDB database from the dumps available
at http://ghtorrent-downloads.ewi.tudelft.nl
.
To do the restore:
-
Open a MongoDB terminal and run the
createCollections.js
script to create the necessary collections. You canblock_compressor
to eithersnappy
orzlib
to make your databases compressed. I am using none here, as I am using compression at the filesystem level. -
Run
restore-cummulative-dumps.sh
to restore the cummulative dumps. Wait 3-4 days. -
Run
restore-daily-dumps.sh
to restore all daily dumps. Run therestore-daily-dump.sh
with a date argument to restore a single daily dump