Skip to content

Instantly share code, notes, and snippets.

@theodric
Created October 13, 2024 22:07
Show Gist options
  • Save theodric/44432ce2f02c6ed6b6e2e30c22f8434f to your computer and use it in GitHub Desktop.
Save theodric/44432ce2f02c6ed6b6e2e30c22f8434f to your computer and use it in GitHub Desktop.
Extract your tweets in plaintext from the tweets.js / tweets-part1.js files inside your Twitter archive, and remove old-fashioned retweets, plain URLs, and other crap
cat tweets.js | grep -i full_text | sed -E 's/.*"full_text" *: *"(.*)",/\1/' | egrep -v ^@ | egrep -v ^https:// | egrep -v ^RT > fulltext2.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment