Skip to content

Instantly share code, notes, and snippets.

View Nimster's full-sized avatar

Nimrod Priell Nimster

View GitHub Profile

Podcasts for Data Science & Stuff

I asked the Twittersphere for data science (& tangentially-related) podcasts recommendations, and got a much bigger response than I expected with some really superb recommendations, so I created a gist with the suggestions I received. They're arranged alphabetically by name below, along with relevant Twitter accounts, links, and names of the hosts (if I could find them).

Shoot me a tweet @bennyjtang if you have more suggestions to add to this list!

Original Twitter thread

Adversarial Learning

@dsparks
dsparks / Binarize_categories.R
Created September 18, 2012 22:39
Categorical data to indicator matrix to log odds ratios
# Starting with categorical data, ending with a table of log odds ratios
doInstall <- TRUE # Change to FALSE if you don't want packages installed.
toInstall <- c("plyr", "reshape2")
if(doInstall){install.packages(toInstall,
repos = "http://cran.us.r-project.org")}
lapply(toInstall, library, character.only = TRUE)
# Canonical example of categorical data
HEC <- melt(HairEyeColor)
@Nimster
Nimster / gist:2054777
Created March 17, 2012 03:46
Important piece of functional programming and allocation of objects
# Often when doing functional style programming you get concerned that copying or
# wrapping objects might cost a lot in terms of performance. But the great thing
# about an immutable object is that I don't need to create a new one if I am
# handed an immutable copy.
# Follow me on twitter @nimrodpriell or see my blog at http://www.educated-guess.com
# for more bits of knowledge
# This is what we want to figure out the performance of. It is equivalent to calling
# frozenset(frozenset(frozenset(... m times (l)))...))
def frozens(l,m=100):
@sujee
sujee / emr-wait-for-completion.sh
Created January 2, 2011 07:14
emr-wait-for-completion.sh
#!/bin/bash
## http://sujee.net/tech/articles/amazon-emr-beyond-basics/
echo "=== $JOBID started...."
LOGDIR="/var/logs/hadoop-logs/${JOBNAME}__${JOBID}__${TIMESTAMP}"
mkdir -p "${LOGDIR}"
## stuff below is to wait till the jobs is done