Skip to content

Instantly share code, notes, and snippets.

@ivan
Last active January 13, 2023 08:40
Show Gist options
  • Save ivan/bcf97b680474e6704f9b9bc88d061a16 to your computer and use it in GitHub Desktop.
Save ivan/bcf97b680474e6704f9b9bc88d061a16 to your computer and use it in GitHub Desktop.
Twitter usernames exceeding 15 characters
# cat twitter-usernames-archive.org-sample-stream.txt | rg '^.{16,}$' | awk '{ print length, $0 }' | sort -n -s | cut -d" " -f2-
richardrushfield
vittoriopasteris
Scottcoopersmith
The_Porter_House
thoughtlessbeast
TributetoJohnnyB
shannonerickson1
meinnameistbernd
shelbyunderwood1
VictoriaCardoso_
andrewjohngodley
jacobbarssbailey
camila_balthazar
NapoleonDynamite
Brandonrichards1
sergiomaistrello
insecurerockstar
fifaworldcup_es_
jeklinaprilianda
obviouspseudonym
PrudenceHopeWade
veganstraightedge
colepatrickturner
flaviosinespacios
Eternal_Pessimist
Mohammad_Zangeneh
MargaretRodenberg
Laura_Villas_Boas
toasteroverheated
silverspoonglasses
mfletcherchristian
princessoftheworld
SmittyWigglesworth
AlyssaRamosReynoso
KevinWelcometoTwitte
DarrenStromanDarrenS
@ivan
Copy link
Author

ivan commented Nov 20, 2022

collected from the *.warc.os.cdx.gz files in https://archive.org/details/twitterarchive

@ivan
Copy link
Author

ivan commented Nov 20, 2022

@ivan
Copy link
Author

ivan commented Nov 20, 2022

Twitter did not always limit usernames to 15 characters: in a dataset of 418,329,839 usernames, there are
21 16-character,
8 17-character,
5 18-character, and
2 20-character usernames which tweeted something since ~2017 (and most accounts still exist)

This will be missing some users that tweeted infrequently or never after 2017.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment