Forked from Parler-Analysis/gist:2c023fd2e053fba5bc85b09209f606eb
Created
January 12, 2021 22:27
-
-
Save hotelzululima/5c8386fb49762ccd53ab98fbe96730cc to your computer and use it in GitHub Desktop.
Parler Data & Tools
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Data & Tools: | |
Many contributors. Thanks to all. | |
[email protected] | |
Channel: #parlerparsers at https://webirc.hackint.org/ | |
#parlerparsers-video for video IDing | |
FBI Tip: https://tips.fbi.gov/digitalmedia/aad18481a3e8f02 | |
Want to help but don't know how? | |
Download copies and rehost them elsewhere, and seed torrents. | |
Help make this file easier for other to understand. | |
================================ | |
(1) Metadata json files with EXIF data on all MP4 videos scraped from Parler: | |
(down) https://gofile.io/d/Wf0JPg | |
magnet:?xt=urn:btih:1723e27bc79186c4574ff056ddb458d771c26e2f&dn=metadata.tar.gz&tr=wss%3A%2F%2Ftracker.btorrent.xyz&tr=wss%3A%2F%2Ftracker.openwebtorrent.com&tr=udp%3A%2F%2Ftracker.leechers-paradise.org%3A6969&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337&tr=udp%3A%2F%2Fexplodie.org%3A6969&tr=udp%3A%2F%2 | |
SHA256: 66809d9ae0a5a6577a3c80bb623562274ceccd96b35519f15f568d09cefc56f8 metadata.tar.gz | |
(2) Script to download WARCS from archive.org once they process: | |
https://github.com/ozywog/parler-data-tools | |
(3) Magnet URI for torrent of file that contains 1.8 million texts scraped from Parler and is subet of full data. | |
Originally hosted on https://parler-archive.deadops.de/ | |
magnet:?xt=urn:btih:FF29970B902657A32D561C0720E70FACFB8C4284&dn=parler_2020-01-06_posts-partial&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce&tr=udp%3a%2f%2ftracker.internetwarriors.net%3a1337%2fannounce | |
(4) Script to scrape usernames: | |
https://github.com/billstrobl/Prooter/blob/master/prooter.py | |
(5) Script to scrape videos: video scraper: | |
https://github.com/Nithindanday/parlervideoscraper | |
You will need the metadata.tar.gz from (1) to use this | |
(6) JSON / CSV / KML Scrapes: | |
https://gofile.io/d/p8RxUC - csv with all non-zero lat/log from donk's josn | |
https://gofile.io/d/WVmqhR - quick 'n dirty KMLmade from the csv | |
(7) Script to extract images/videos from WARCs: | |
https://gist.github.com/redd-dedd/9a200a9ba789f312faf53b25ac63e024 | |
HOW TO VIEW WARC/ZSTD from ArchiveTeam's Parler scrape | |
# How to View Parler Archive "megawarc.warc.zst" files. | |
These are official zstd archive and warc standards. | |
They are uploading to: https://archive.org/details/archiveteam_neparlepas | |
1. Install Python 3.7 | |
2. Execute: pip install zstandard==0.10.2 | |
3. Download archive from here: https://archive.org/details/archiveteam_neparlepas?tab=collection | |
4. Copy this script into a new file called xtract.py: https://hastebin.com/bugedubaxi.py | |
5. Execute: python ./xtract.py /path/to/parler_blahblah.megawarc.warc.zst > dict | |
6. Execute: zstd -d /path/to/parler_blahblah.megawarc.warc.zst -D dict | |
7. Import the decompressed parler_blahblah.megacarc.warc file into this tool: https://github.com/webrecorder/webrecorder-desktop | |
If you cannot install Python 3.7 for some reason, a dockerfile is available at: | |
https://gist.github.com/shoghicp/6ce05806ffc805929667ec2d4c62aba2 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment