Skip to content

Instantly share code, notes, and snippets.

i
me
my
myself
we
our
ours
ourselves
you
your
@fnielsen
fnielsen / gist:1373887
Created November 17, 2011 17:50
Co-author mining for papers in the Brede Wiki
import matplotlib.pyplot as plt
import networkx as nx
from pysqlite2 import dbapi2
connection = dbapi2.Connection('bredewiki-templates.sqlite3')
sql = "SELECT DISTINCT pid FROM brede WHERE (template='paper' OR template='conference_paper');"
cursor = connection.cursor()
cursor.execute(sql)
pids = [ row[0] for row in cursor.fetchall() ]
@fnielsen
fnielsen / gist:1392607
Created November 25, 2011 01:29
Computation of co-author distance in the Brede Wiki co-author graph
# wget http://neuro.imm.dtu.dk/services/bredewiki/download/bredewiki-templates.sqlite3
import matplotlib.pyplot as plt
import networkx as nx
from pysqlite2 import dbapi2
connection = dbapi2.Connection('bredewiki-templates.sqlite3')
sql = "SELECT DISTINCT tid FROM brede WHERE (template='paper' OR template='conference_paper');"
cursor = connection.cursor()
cursor.execute(sql)
tids = [ row[0] for row in cursor.fetchall() ]
@fnielsen
fnielsen / gist:1410094
Created November 30, 2011 18:12
Movie review sentiment classifier and the AFINN word list
# Code inspired and developed from:
# http://streamhacker.com/2010/05/10/text-classification-sentiment-analysis-naive-bayes-classifier/
from __future__ import division
import nltk.classify, nltk.corpus, nltk.classify.util
from pylab import *
filebase = '/home/fn'
@futureperfect
futureperfect / edges.csv
Created March 19, 2012 05:25
Sample data for Gephi demonstration
Source Target Type Id Weight Average Degree
1 3 Undirected 1 1.0 1.0
2 10 Undirected 8 1.0 1.0
3 4 Undirected 2 1.0 1.0
4 5 Undirected 3 1.0 1.0
5 10 Undirected 4 1.0 1.0
8 2 Undirected 7 1.0 1.0
8 9 Undirected 11 1.0 1.0
10 1 Undirected 5 1.0 1.0
10 6 Undirected 10 1.0 1.0
def run_pg_fouine():
info = host_info[env.host_string]
db_name = info.tags.get('Name')
sudo('perl -pi -e "s/log_min_duration_statement = .*/log_min_duration_statement = 0/" /etc/postgresql/9.*/main/postgresql.conf')
sudo('/etc/init.d/postgresql reload')
time.sleep(30)
sudo('perl -pi -e "s/log_min_duration_statement = .*/log_min_duration_statement = 500/" /etc/postgresql/9.*/main/postgresql.conf')
sudo('/etc/init.d/postgresql reload')
run('tail -n 100000 /var/log/postgresql/postgresql-9.*-main.log > /tmp/pgfouine.txt')
run('gzip -f /tmp/pgfouine.txt')
@cosmocatalano
cosmocatalano / instagram_scrape.php
Last active February 8, 2025 07:43
Quick-and-dirty Instagram web scrape, just in case you don't think you should have to make your users log in to deliver them public photos.
<?php
//returns a big old hunk of JSON from a non-private IG account page.
function scrape_insta($username) {
$insta_source = file_get_contents('http://instagram.com/'.$username);
$shards = explode('window._sharedData = ', $insta_source);
$insta_json = explode(';</script>', $shards[1]);
$insta_array = json_decode($insta_json[0], TRUE);
return $insta_array;
}
@yanofsky
yanofsky / LICENSE
Last active August 30, 2025 02:53
A script to download all of a user's tweets into a csv
This is free and unencumbered software released into the public domain.
Anyone is free to copy, modify, publish, use, compile, sell, or
distribute this software, either in source code form or as a compiled
binary, for any purpose, commercial or non-commercial, and by any
means.
In jurisdictions that recognize copyright laws, the author or authors
of this software dedicate any and all copyright interest in the
software to the public domain. We make this dedication for the benefit
library(stringr)
library(lubridate)
library(ggplot2)
library(scales)
getwd()
setwd("~/Desktop/Personal_Projects/Immersion/")
i <- read.csv("./immersion.csv", header=TRUE, as.is=TRUE)
## Only get those emails that I sent
@ryanlewis
ryanlewis / google_twunter_lol
Created May 27, 2014 14:28 — forked from jamiew/google_twunter_lol
Naughty word list, compiled by Google and @jamiew
4r5e
5h1t
5hit
a55
anal
anus
ar5e
arrse
arse
ass