- Hilary Mason's collection of research-quality datasets
- 100+ Interesting Data Sets: seems really great for ML/data science practice or fun side projects.
- Most of the datasets available with R, but here ALSO available in CSV format! 700+ datasets.
- Kaggle Higgs Boson data (I think this is a super cool problem)
- Statistical Sleuth data problems -- pretty good for intro stats concepts
- Hadley's data packages -- baby names by sex (1880-2013), fuel economy of cars, atmospheric measurements from Central America, and info on all NYC flights in 2013. Set up for R, but I bet you could process these with other software if you wanted.
- others? (leave a comment!)
Last active
August 29, 2015 14:03
-
-
Save alyssafrazee/1df264173d81a711ebb5 to your computer and use it in GitHub Desktop.
some sites with cool data
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment