Skip to content

Instantly share code, notes, and snippets.

View prateek's full-sized avatar

Prateek Rungta prateek

View GitHub Profile
#!/usr/bin/env ruby
# Uses the Readability Metrics API from http://ipeirotis.appspot.com/readability-api.html
# Accepts text from STDIN (piped) or as an argument
=begin examples
pbpaste|text_score.rb # copy text and run (on OS X) to get the stats for the clipboard.
cat myfile.md|text_score.rb # get scores for the contents of a file
=end
require 'open-uri'
require 'net/http'
@prateek
prateek / doingrc.yml
Last active August 29, 2015 14:01 — forked from ttscoff/doingrc.yml
---
current_section: Currently
doing_file: ~/Dropbox/notes/wwid-??.md
views:
times:
date_format: '%a %_I:%M%P'
section: All
count: 0
wrap_width: 0
template: '%boldblack%date %boldcyan > %boldwhite%title %boldbgwhite%boldblack%interval%default'

Shortcut for OSX dock to run irssi in tmux with iTerm2

Synopsis

I run irssi inside a tmux session on OSX. I often close the terminal as I usually get notified by growl about important stuff. I don't want to open a terminal and write a command every time I want to check IRC.

The solution

1. Create a shell script with the following content

#!/bin/zsh
/usr/local/bin/tmux attach -d -t irssi || /usr/local/bin/tmux new -s irssi irssi

Install Steps for RMR

These steps have been tested with

  • Oracle Linux 6.4
  • RHEL 6.5
  • CDH 5.1

Note I wish this was parceled up.

Step 0 (all nodes)

#!/bin/bash
# restart-cm-service.sh
# Restart a Cloudera-Manager-managed service via the REST API
# -----------------------------------------------------------------------
# Copyright (C) 2014 Cloudera and Ben White
# Cloudera Manager credentials
USERNAME=admin
PASSWORD=admin
#!/bin/bash
# Adapted from
# a) http://blog.cloudera.com/blog/2013/11/approaches-to-backup-and-disaster-recovery-in-hbase/#snapshots
# b) https://blog.cloudera.com/blog/2013/03/introduction-to-apache-hbase-snapshots/
# c) https://groups.google.com/forum/#!topic/nosql-databases/osC58F5PDsE
# 1. create table
hbase shell <<EOF
disable 'testTable'
package mapreduce
/**
* This is an attempt to find a minimal set of type classes that describe the map-reduce programming model
* (the underlying model of Google map/reduce, Hadoop, Spark and others)
* The idea is to have:
* 1) lawful types that fully constrain correctness
* 2) a minimal set of laws (i.e. we can't remove any laws,
* 3) able to fully express existing map/reduce in terms of these types
*

New Distcp Usage Notes


Make sure you can access the S3 bucket with a command such as:

$ hdfs dfs -put localfile s3n://$BUCKET/

If this works, uncompress the attached jar and try the command below.

@prateek
prateek / tree.md
Last active August 29, 2015 14:10 — forked from hrldcpr/tree.md

One-line Tree in Python

Using Python's built-in defaultdict we can easily define a tree data structure:

def tree(): return defaultdict(tree)

That's it!