| column_name | datatype | description |
|---|---|---|
| user_id | INTEGER | unique identifier for each user in our "impeachment 2020" dataset |
| created_on | DATE | date the user was created |
| screen_name_count | INTEGER | number of screen names used |
| screen_names | STRING | all screen names used |
| is_bot | BOOLEAN | whether or not we classified this user as a "bot" / automated account |
| bot_rt_network | INTEGER | for bots, which retweet network (0:anti-trump, 1:pro-trump) |
| is_q | BOOLEAN | whether or not this user tweeted Q-anon language / hashtags |
| q_status_count | INTEGER | the number of tweets with Q-anon language / hashtags |
| status_count | INTEGER | number of total tweets authoried by this user (in our "impeachment 2020" dataset only) |
| rt_count | INTEGER | number of total retweets authoried by this user (in our "impeachment 2020" dataset only) |
| avg_score_lr | FLOAT | avergage opinion score from our Logistic Regression model (0:anti-trump, 1:pro-trump) |
| avg_score_nb | FLOAT | avergage opinion score from our Naive Bayes model (0:anti-trump, 1:pro-trump) |
| avg_score_bert | FLOAT | avergage opinion score from our BERT Transformer model (0:anti-trump, 1:pro-trump) |
| opinion_community | INTEGER | binary classification of average opinion (0:anti-trump, 1:pro-trump) |
| follower_count | INTEGER | number of followers (in our "impeachment 2020" dataset only) |
| follower_count_b | INTEGER | ... who are bots |
| follower_count_h | INTEGER | ... who are humans |
| friend_count | INTEGER | number of friends (in our "impeachment 2020" dataset only) |
| friend_count_b | INTEGER | ... who are bots |
| friend_count_h | INTEGER | ... who are humans |
| avg_toxicity | FLOAT | average "toxicity" score from the Detoxify model |
| avg_severe_toxicity | FLOAT | average "sever toxicity" score from the Detoxify model |
| avg_insult | FLOAT | average "insult" score from the Detoxify model |
| avg_obscene | FLOAT | average "obscene" score from the Detoxify model |
| avg_threat | FLOAT | average "threat" score from the Detoxify model |
| avg_identity_hate | FLOAT | average "identity hate" score from the Detoxify model |
| urls_shared_count (TODO) | INTEGER | number of tweets with URLs in them (TODO) |
| fact_scored_count | INTEGER | number of tweets with URL domains that we have rankings for |
| avg_fact_score | FLOAT | average fact score of links shared (1: fake news, 5: mainstream media) |
Created
February 4, 2022 16:46
-
-
Save s2t2/0838d97e0b16d29d0ba14519bd0914ba to your computer and use it in GitHub Desktop.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment