Kaggle Wiki - some resources come from here.
ML:
Statistics:
- Introduction to Statistics @ Udacity
- The Elements of Statistical Learning: Data Mining, Inference, and Prediction
- Think Stats: Probability and Statistics for Programmers
- Open Intro Statistics
AI:
- Artificial Intelligence: Programming a Robotic Car @ Udacity
- Artificial Intelligence. A Modern Approach
NLP:
Data Mining:
Data Analysis:
- Python for Data Analysis
- Software for Data Analysis: Programming with R
- Computing for Data Analysis @ John Hopkins
- Data Analysis @ John Hopkins
Hadoop:
- Cloudera
- Cloudera Hadoop Training Videos
- Cloudera's Hadoop Demo VM
- Big Data University
- Running Hadoop on Ubuntu Linux(Single-Node Cluster)
- Hadoop Tutorial Series
- Hadoop: The Definitive Guide
Papers:
- Leakage in Data Mining: Formulation, Detection, and Avoidance
- On Discriminative vs Generative Classifiers
Blogs:
- Kaggle Community Blogs
- Simply Statistics
- Doing Bayesian Data Analysis
- John Myles White
- Zero Intelligence Agents
Misc:
- Algorithms
- Tutorials
- Data Science Toolkit
- Data Science as a Sport
- Very concise notes on ML and Statistics
- Where can I get large datasets open to the public?
- How do I become a data scientist?
- Python Packages for Data Analysis and related work
Sub-reddits:
- OpenData - open APIs and datasets
- Datasets for Data Mining, Analytics and Knowledge Discovery
- Content analysis and visualization
- Statistical theory, software and application
- Machine Learning
- Data Science
- Statistics and Data Analysis with R
- Scientific/numerical/high performance/quantitative computing and/or analysis using SciPy
- Natural Language Processing
- Rational Thinking in philosohpy, thought experiments, debates and academic research
- Hadoop
Thanks for listing nice resources. If anyone find books about python and data science, then visit here for best python data science books.