Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save hotelzululima/5c8386fb49762ccd53ab98fbe96730cc to your computer and use it in GitHub Desktop.
Save hotelzululima/5c8386fb49762ccd53ab98fbe96730cc to your computer and use it in GitHub Desktop.
Parler Data & Tools
Data & Tools:
Many contributors. Thanks to all.
[email protected]
Channel: #parlerparsers at https://webirc.hackint.org/
#parlerparsers-video for video IDing
FBI Tip: https://tips.fbi.gov/digitalmedia/aad18481a3e8f02
Want to help but don't know how?
Download copies and rehost them elsewhere, and seed torrents.
Help make this file easier for other to understand.
================================
(1) Metadata json files with EXIF data on all MP4 videos scraped from Parler:
(down) https://gofile.io/d/Wf0JPg
magnet:?xt=urn:btih:1723e27bc79186c4574ff056ddb458d771c26e2f&dn=metadata.tar.gz&tr=wss%3A%2F%2Ftracker.btorrent.xyz&tr=wss%3A%2F%2Ftracker.openwebtorrent.com&tr=udp%3A%2F%2Ftracker.leechers-paradise.org%3A6969&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337&tr=udp%3A%2F%2Fexplodie.org%3A6969&tr=udp%3A%2F%2
SHA256: 66809d9ae0a5a6577a3c80bb623562274ceccd96b35519f15f568d09cefc56f8 metadata.tar.gz
(2) Script to download WARCS from archive.org once they process:
https://github.com/ozywog/parler-data-tools
(3) Magnet URI for torrent of file that contains 1.8 million texts scraped from Parler and is subet of full data.
Originally hosted on https://parler-archive.deadops.de/
magnet:?xt=urn:btih:FF29970B902657A32D561C0720E70FACFB8C4284&dn=parler_2020-01-06_posts-partial&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce&tr=udp%3a%2f%2ftracker.internetwarriors.net%3a1337%2fannounce
(4) Script to scrape usernames:
https://github.com/billstrobl/Prooter/blob/master/prooter.py
(5) Script to scrape videos: video scraper:
https://github.com/Nithindanday/parlervideoscraper
You will need the metadata.tar.gz from (1) to use this
(6) JSON / CSV / KML Scrapes:
https://gofile.io/d/p8RxUC - csv with all non-zero lat/log from donk's josn
https://gofile.io/d/WVmqhR - quick 'n dirty KMLmade from the csv
(7) Script to extract images/videos from WARCs:
https://gist.github.com/redd-dedd/9a200a9ba789f312faf53b25ac63e024
HOW TO VIEW WARC/ZSTD from ArchiveTeam's Parler scrape
# How to View Parler Archive "megawarc.warc.zst" files.
These are official zstd archive and warc standards.
They are uploading to: https://archive.org/details/archiveteam_neparlepas
1. Install Python 3.7
2. Execute: pip install zstandard==0.10.2
3. Download archive from here: https://archive.org/details/archiveteam_neparlepas?tab=collection
4. Copy this script into a new file called xtract.py: https://hastebin.com/bugedubaxi.py
5. Execute: python ./xtract.py /path/to/parler_blahblah.megawarc.warc.zst > dict
6. Execute: zstd -d /path/to/parler_blahblah.megawarc.warc.zst -D dict
7. Import the decompressed parler_blahblah.megacarc.warc file into this tool: https://github.com/webrecorder/webrecorder-desktop
If you cannot install Python 3.7 for some reason, a dockerfile is available at:
https://gist.github.com/shoghicp/6ce05806ffc805929667ec2d4c62aba2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment