This is a copy of https://github.com/Jay-Oh-eN/datasets
- http://rs.io/2014/05/29/list-of-data-sets.html
- http://archive.ics.uci.edu/ml/
- https://dreamtolearn.com/doc/2HDNJH3XJU6CVGKZ7SDM4MCSW
- https://news.ycombinator.com/item?id=7727474
- http://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
- http://www.quora.com/Data-Analysis/Whats-your-favorite-free-data-source
- Stanford SNAP datasets: networks
- UCI Machine Learning Repository
- Internet Traffic Archive
- Academic Torrents
- NYC MTA transit data
- SFMTA GPS data on vehicles
- Uber Anonymized GPS
- Citi Bike NYC: json
- Capital Bike Share DC
- Bay Area Bike Share: data and gem
- Weather
- San Francisco
- Education
- SF neighborhoods
- New York
- United States
- Census
- United Kingdom
- OpenPrism
- SF Building Permits
- SFPD Incidents
APIs -- public and hidden
- Wikipedia
- Foursquare
- Tumblr
- Rdio
- Delicous
- NYT
- Disqus
- Yelp
- Last.fm
- bitly
- Yahoo Finance (hidden)
- Hunch
- Trulia
- Evernote
- Songkick
- Freebase
- Programmable Web
- CrowdFlower Open Data
- 538 Datasets
- DataMob
- /r/datasets
- Introduction to Data Science (Berkeley): by Jeff (the Hammer) Hammerbacher
- Peter Skomoroch
- Hilary Mason Research Data sets
- Quora post
- StatsSci
- Open Science Data Cloud
- Visual.ly open data
- visualizing.org open data
- music + data
- marketplace: Infochimps
- time series: Quandl
- public datasets: enigma
- location contextualization: factual
- financial modeling: Quantopian
- email contextualization: Rapleaf
- social media: Gnip
- knoema
- Find the Data