This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| TEXT Great Review on #JBSpinshttp://jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| OLD Great Review on #JBSpinshttp :/ / jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| NEW Great Review on #JBSpinshttp :// jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| TEXT nowFOLLOWiNG @youngstunna336 lemme give him a quick s\o!!! Ladies follow him he's #teamfollowback hey chris!!! | |
| OLD nowFOLLOWiNG @youngstunna336 lemme give him a quick so !!! Ladies follow him he's #teamfollowback hey chris !!! | |
| NEW nowFOLLOWiNG @youngstunna336 lemme give him a quick s\o !!! Ladies follow him he's #teamfollowback hey chris !!! | |
| TEXT #LebronShould know his only championship is 'slam dunk'. @infiniteideal | |
| OLD #LebronShould know his only championship is ' slam dunk'. @infiniteideal |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| TEXT Gradpics Today <3 !! | |
| OLD Gradpics Today <3 !! | |
| NEW Gradpics Today <3 !! | |
| TEXT Great Review on #JBSpinshttp://jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| OLD Great Review on #JBSpinshttp :/ / jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| NEW Great Review on #JBSpinshttp :// jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| TEXT RT @kamilaluvsdemi: @IselgomezI dont really know where i would be without you and Demi<3 i luv youuuu | |
| OLD RT @kamilaluvsdemi : @IselgomezI dont really know where i would be without you and Demi <3 i luv youuuu |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| TEXT Gradpics Today <3 !! | |
| OLD Gradpics Today < 3 !! | |
| NEW Gradpics Today <3 !! | |
| TEXT Great Review on #JBSpinshttp://jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| OLD Great Review on #JBSpinshttp :/ / jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| NEW Great Review on #JBSpinshttp :// jbspins.blogspot.com/2010/10/fandom-love-saturday-nightmares.html | |
| TEXT RT @kamilaluvsdemi: @IselgomezI dont really know where i would be without you and Demi<3 i luv youuuu | |
| OLD RT @kamilaluvsdemi : @IselgomezI dont really know where i would be without you and Demi < 3 i luv youuuu |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| TEXT Wednesday 27th october 2010. „Äãhave a nice day :) | |
| OLD Wednesday 27th october 2010 . „Äãhave a nice day :) | |
| NEW Wednesday 27th october 2010 . „ Ä ãhave a nice day :) | |
| TEXT RT @eye_ee_duh_Esq: LMBO! This man filed an EMERGENCY Motion for Continuance on account of the Rangers game tonight! ¬´ Wow lmao | |
| OLD RT @eye_ee_duh_Esq : LMBO ! This man filed an EMERGENCY Motion for Continuance on account of the Rangers game tonight ! ¬´ Wow lmao | |
| NEW RT @eye_ee_duh_Esq : LMBO ! This man filed an EMERGENCY Motion for Continuance on account of the Rangers game tonight ! ¬ ´ Wow lmao | |
| TEXT calm heechul.. ^^ RT @Heedictator: http://twitpic.com/317hjw HHHHHHHHHHHHHHHHHHHHHHi~‚òÜ | |
| OLD calm heechul .. ^^ RT @Heedictator : http://twitpic.com/317hjw HHHHHHHHHHHHHHHHHHHHHHi~‚òÜ |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| --- out1 2011-06-29 19:17:40.000000000 -0400 | |
| +++ out2 2011-06-29 19:20:44.000000000 -0400 | |
| @@ -8,7 +8,7 @@ | |
| @iTSbEAUTiFULMe naw we didn't know we went at the last minute | |
| You get recessions & stock market declines . If you can't understand that , you're not ready & you won't do well in the markets . Peter Lynch | |
| Man I want so frye boot ... Imma bout to go on a huge shopping spree for my birthday | |
| -@JustinBieber todays my Birthdayy and it would mean the world to me if i got a dm from you[ : < 3 63 . | |
| +@JustinBieber todays my Birthdayy and it would mean the world to me if i got a dm from you [: < 3 63 . | |
| I hear Lottie's ( 1925 W . Cortland) , official #Blackhawks bar , is a possibility . For the zillionth time , where are people watching ? | |
| college kids who may sit around and complain about everything going on because they can't get atten because of the westside chicks |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| How much text versus metadata is in a tweet? | |
| Brendan O'Connor (brenocon.com), 2011-06-13 | |
| http://twitter.com/brendan642/status/80473880111742976 | |
| What's it mean to compare the amount of text versus metadata? | |
| Let's start with raw size of the data that comes over the wire from Twitter. | |
| ## Get tweets out of a sample stream archive. | |
| ## (e.g. curl http://stream.twitter.com/1/statuses/sample.json) | |
| % cat tweets.2011-05-19 | grep -P '"text":' | head -100000 > 100k_tweets |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| " Use Vim settings, rather then Vi settings (much better!). | |
| " This must be first, because it changes other options as a side effect. | |
| set nocompatible | |
| " allow backspacing over everything in insert mode | |
| set backspace=indent,eol,start | |
| " set autoindent | |
| set backup " keep a backup file | |
| set history=10000 " lines of command line history |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| 150 235 104 327 360 265 367 348 266 309 227 511 119 493 299 439 201 118 60 115 2 345 312 257 156 315 578 341 316 181 149 500 35 22 42 374 175 135 304 303 89 161 573 543 68 40 504 52 603 447 147 231 64 331 198 3 213 414 120 202 174 411 597 29 491 284 307 34 73 106 333 366 39 78 512 116 154 413 365 200 466 222 86 159 5 207 4 160 377 234 323 387 270 38 105 93 441 380 426 463 144 77 289 379 127 210 25 610 152 13 203 600 172 6 336 71 157 59 65 133 114 110 495 166 239 405 193 182 392 70 180 430 237 220 185 53 75 238 204 226 267 171 218 164 286 369 45 27 245 248 33 85 67 254 496 168 28 46 20 494 281 134 423 129 15 183 280 111 419 607 251 287 206 595 339 9 162 125 361 225 588 139 107 552 602 306 330 224 421 62 184 236 272 211 101 343 256 431 7 532 187 246 268 155 112 221 291 57 232 440 153 334 582 197 186 24 502 318 124 158 21 351 82 404 443 247 215 191 98 472 140 394 19 260 233 432 455 255 137 121 23 563 16 26 480 269 572 344 503 462 375 228 179 470 95 72 74 549 301 311 277 66 275 176 102 192 542 208 551 76 252 81 2 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| 1980s | |
| 1990s | |
| 19th_century | |
| 20th_centuries | |
| 20th_century | |
| 21st_century | |
| ability | |
| abnormal_returns | |
| abortion | |
| absolute_value |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| feat year ts freq impactraw impactnorm | |
| [abstract]discourse 1980 0.00101933710363 0.123376623377 0.0193674049689 0.000125762369928 | |
| [abstract]discourse 1981 0.00107427232847 0.232142857143 0.0139655402701 0.000249384647681 | |
| [abstract]discourse 1982 0.00113786762909 0.0416666666667 0.00910294103272 4.74111512121e-05 | |
| [abstract]discourse 1983 0.00121956281201 0.147058823529 0.0182934421802 0.000179347472355 | |
| [abstract]discourse 1984 0.00122237204345 0.264 0.040338277434 0.000322706219472 | |
| [abstract]discourse 1985 0.00113985199031 0.271739130435 0.0284962997579 0.000309742388672 | |
| [abstract]discourse 1986 0.00099591218448 0.125541125541 0.0288814533499 0.00012502793658 | |
| [abstract]discourse 1987 0.000890163733688 0.131034482759 0.0169131109401 0.000116642144414 | |
| [abstract]discourse 1988 0.000788717837666 0.0459770114943 0.00946461405199 3.62628890881e-05 |