Tim Hopper tdhopper

Locate the section for your github remote in the .git/config file. It looks like this:

[remote "origin"]
	fetch = +refs/heads/*:refs/remotes/origin/*
	url = [email protected]:joyent/node.git

Now add the line fetch = +refs/pull/*/head:refs/remotes/origin/pr/* to this section. Obviously, change the github url to match your project's URL. It ends up looking like this:

What

Roll your own iPython Notebook server with Amazon Web Services (EC2) using their Free Tier.

What are we using? What do you need?

An active AWS account. First time sign-ups are eligible for the free tier for a year
One Micro Tier EC2 Instance
With AWS we will use the stock Ubuntu Server AMI and customize it.
Anaconda for Python.
Coffee/Beer/Time

Unifying Sketch Monoids

As I discussed in Algebra for Analytics, many sketch monoids, such as Bloom filters, HyperLogLog, and Count-min sketch, can be described as a hashing (projection) of items into a sparse space, then using two different commutative monoids to read and write respectively. Finally, the read monoids always have the property that (a + b) <= a, b and the write monoids has the property that (a + b) >= a, b.

##Some questions:

Note how similar CMS and Bloom filters are. The difference: bloom hashes k times onto the same space, CMS hashes k times onto a k orthogonal subspaces. Why the difference? Imagine a fixed space bloom that hashes onto k orthogonal spaces, or an overlapping CMS that hashes onto k * m length space. How do the error asymptotics change?
CMS has many query modes (dot product, etc...) can those generalize to other sketchs (HLL, Bloom)?
What other sketch or non-sketch algorithms can be expressed in this dual mo

	#!/usr/bin/env python

	#
	# Converts any integer into a base [BASE] number. I have chosen 62
	# as it is meant to represent the integers using all the alphanumeric
	# characters, [no special characters] = {0..9}, {A..Z}, {a..z}
	#
	# I plan on using this to shorten the representation of possibly long ids,
	# a la url shortenters
	#

	#!/usr/bin/php
	<?php
	$repos = array();
	exec('find -type d -name .git \| sed -e "s/\.git//"', $repos);
	foreach ($repos as $repo) {
	$status = shell_exec("cd $repo && git status");
	if (false == strpos($status, 'nothing to commit (working directory clean)')) {
	echo "$repo\n" . str_repeat('-', strlen($repo)) . "\n$status\n\n";
	}
	}

	# Copyright Jehiah Czebotar 2013
	# http://jehiah.cz/

	import tornado.options
	import glob
	import os
	import sqlite3
	import logging
	import datetime
	import csv

	#!/usr/bin/env python
	"""strip outputs from an IPython Notebook

	Opens a notebook, strips its output, and writes the outputless version to the original file.

	Useful mainly as a git filter or pre-commit hook for users who don't want to track output in VCS.

	This does mostly the same thing as the `Clear All Output` command in the notebook UI.

	LICENSE: Public Domain

	#!/bin/bash

	# Set up paths and whatnot
	test -e ~/.bashrc && source ~/.bashrc

	# We need tmux. Obvs.
	if [[ -z `which tmux` ]]; then echo "You need tmux first!"; exit 1; fi

	# Named variables are much more flexible
	name="$1"

	#!/bin/bash
	#
	# transcode-video.sh
	#
	# Copyright (c) 2013-2015 Don Melton
	#

	about() {
	cat <<EOF
	$program 5.13 of April 8, 2015

	# alias to edit commit messages without using rebase interactive
	# example: git reword commithash message
	reword = "!f() {\n GIT_SEQUENCE_EDITOR=\"sed -i 1s/^pick/reword/\" GIT_EDITOR=\"printf \\\"%s\\n\\\" \\\"$2\\\" >\" git rebase -i \"$1^\";\n git push -f;\n}; f"

	# completely wipe git history
	wipe-history = "!f() { git add . && git reset --soft $(git rev-list --max-parents=0 HEAD) && git commit --amend -m \"${1:-sup}\" && git push --force; }; f"

	# squash the last N commits
	squash = "!f(){ git reset --soft HEAD~${1} && git commit --edit -m\"$(git log --format=%B --reverse HEAD..HEAD@{1})\"; };f"