Skip to content

Instantly share code, notes, and snippets.

View olp-cs's full-sized avatar

Olga Pustovalova olp-cs

View GitHub Profile
@yanofsky
yanofsky / LICENSE
Last active August 30, 2025 02:53
A script to download all of a user's tweets into a csv
This is free and unencumbered software released into the public domain.
Anyone is free to copy, modify, publish, use, compile, sell, or
distribute this software, either in source code form or as a compiled
binary, for any purpose, commercial or non-commercial, and by any
means.
In jurisdictions that recognize copyright laws, the author or authors
of this software dedicate any and all copyright interest in the
software to the public domain. We make this dedication for the benefit
@gufranco-zz
gufranco-zz / railsInstaller.sh
Last active September 22, 2020 19:48
How to install Ruby, Git, Rails, PostgreSQL and MySQL on Ubuntu
#!/bin/sh
# System update
sudo apt-get update
# Curl
sudo apt-get -y install curl
# Git
sudo apt-get -y install git-core
@dypsilon
dypsilon / frontendDevlopmentBookmarks.md
Last active September 25, 2025 20:45
A badass list of frontend development resources I collected over time.

I wrote this in early January 2012, but never finished it. The research and thinking in this area led to a lot of the design of Yeoman and talks like "Javascript Development Workflow of 2013", "Web Application Development Workflow" and "App development stack for JS developers" (surpisingly little overlap in those talks, btw).

Now it's June 2013 and the state of web app tooling has matured quite a bit. But here's a snapshot of the story from 18 months ago, even if a little ugly and incomplete. :p


In the beginning…

  • Intro to tooling
@elijahmanor
elijahmanor / doctors.js
Last active October 22, 2021 20:21
Reducing Filter and Map with Reduce
var doctors = [
{ number: 1, actor: "William Hartnell", begin: 1963, end: 1966 },
{ number: 2, actor: "Patrick Troughton", begin: 1966, end: 1969 },
{ number: 3, actor: "Jon Pertwee", begin: 1970, end: 1974 },
{ number: 4, actor: "Tom Baker", begin: 1974, end: 1981 },
{ number: 5, actor: "Peter Davison", begin: 1982, end: 1984 },
{ number: 6, actor: "Colin Baker", begin: 1984, end: 1986 },
{ number: 7, actor: "Sylvester McCoy", begin: 1987, end: 1989 },
{ number: 8, actor: "Paul McGann", begin: 1996, end: 1996 },
{ number: 9, actor: "Christopher Eccleston", begin: 2005, end: 2005 },
@dylanjf
dylanjf / gist:7011219
Last active December 25, 2015 17:09
3 rep 10 fold CV
########3 rep 10 fold CV to determine feature sparsity percentage via RFE#########
#X = concatenated text features for training set (title, body, url) transformed via TfIdfVectorizer
#y = training set classification (0, 1)
import numpy as np
import pandas as pd
import sklearn.linear_model as lm
from sklearn.cross_validation import KFold
from sklearn import metrics
@ivant
ivant / WordStats.hs
Last active December 31, 2015 14:48
Build a sorted word frequency list from a file, trimmed to a given quantile.
-- Build a sorted word frequency list from a file, trimmed to a given quantile.
--
-- Usage: WordStats <book.txt> <quantile>
--
-- `quantile` is a number between 0 and 1.
--
-- Example:
-- ./WordStats "Don Quijote.txt" 0.85 > "Don Quijote.words.85"
import Control.Applicative
@debasishg
debasishg / gist:8172796
Last active October 3, 2025 16:28
A collection of links for streaming algorithms and data structures

General Background and Overview

  1. Probabilistic Data Structures for Web Analytics and Data Mining : A great overview of the space of probabilistic data structures and how they are used in approximation algorithm implementation.
  2. Models and Issues in Data Stream Systems
  3. Philippe Flajolet’s contribution to streaming algorithms : A presentation by Jérémie Lumbroso that visits some of the hostorical perspectives and how it all began with Flajolet
  4. Approximate Frequency Counts over Data Streams by Gurmeet Singh Manku & Rajeev Motwani : One of the early papers on the subject.
  5. [Methods for Finding Frequent Items in Data Streams](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.187.9800&amp;rep=rep1&amp;t
@zachguo
zachguo / print_cm.py
Last active May 31, 2022 17:39
Pretty print for sklearn confusion matrix
from sklearn.metrics import confusion_matrix
def print_cm(cm, labels, hide_zeroes=False, hide_diagonal=False, hide_threshold=None):
"""pretty print for confusion matrixes"""
columnwidth = max([len(x) for x in labels]+[5]) # 5 is value length
empty_cell = " " * columnwidth
# Print header
print " " + empty_cell,
for label in labels:
print "%{0}s".format(columnwidth) % label,