Talk delivered 2015-07-29 at ICERM workshop on "mathematics and data science"
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
find . | grep '\.py' | grep -v -e '/venv/' -e '(' -e ')' | xargs -I % grep import % | sed -e 's/^[ ]*//' | grep -e '^from ' -e '^import ' | awk '{print $2}' | sed -e 's/^[\.]*//' | cut -d\. -f1 | tr '[A-Z]' '[a-z]' | sed -e 's/,$//' | sort -bfd | uniq -c | sort -nr | grep -n -e numpy -e scipy -e matplotlib -e ipython -e pandas -e sympy -e nose | tr ':' '\t' |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
find . | grep '\.py' | grep -v -e '/venv/' -e '(' -e ')' | xargs -I % grep import % | sed -e 's/^[ ]*//' | grep -e '^from ' -e '^import ' | awk '{print $2}' | sed -e 's/^[\.]*//' | cut -d\. -f1 | tr '[A-Z]' '[a-z]' | sort -bfd | uniq -c | sort -nr | head -20 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
---------- Forwarded message ---------- | |
From: chris wiggins <chris.wiggins@[YYY].edu> | |
Date: Wed, Aug 1, 2012 at 7:26 PM | |
Subject: stats history | |
To: hadley@[XXX].edu | |
Cc: chris wiggins <chris.wiggins@[YYY].edu> | |
Dear Hadley: |
- TimesMachine: Thursday September 18, 1851 - NYTimes.com
- The New York Times Introduces a Web Site - NYTimes.com
- The New York Times Company | The New York Times Company
- The leaked New York Times innovation report is one of the key documents of this media age » Nieman Journalism Lab
- How The New York Times Works
- [Machine Learning For Journalism at The New York Times | Techjaw](http://techjaw.com/2015/02/01/machine-learning-for-journalism-at-the-new-york-times/?utm_conte
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
nice NPR story illustrating a conceptual and methodological | |
difference between AI and ML, using some of the more | |
press-grabbing, (human) game-beating systems: | |
http://www.npr.org/blogs/alltechconsidered/2015/01/08/375736513/look-out-this-poker-playing-computer-is-unbeatable | |
this story's pretty interesting in general but one particular | |
part grabs my attention: | |
Oren Etzioni, the head of Seattle's Allen Institute for |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Q: what are "single tree-based" (as opposed to forest-based) supervised learning methods? | |
A: some of my favorites: | |
- ADT | |
+ wiki: http://en.wikipedia.org/wiki/Alternating_decision_tree | |
+ ref: http://perun.pmf.uns.ac.rs/radovanovic/dmsem/cd/install/Weka/doc/classifiers-papers/trees/ADTree/atrees.pdf | |
- rpart in R | |
+ http://cran.r-project.org/web/packages/rpart/vignettes/longintro.pdf |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
FAQ: | |
where are some fun datasets to play with? | |
1. CMU: | |
http://lib.stat.cmu.edu/datasets/ | |
2. UCI: | |
a) MLR@UCI (machine learning repository / machine learning archive ) |