Skip to content

Instantly share code, notes, and snippets.

@mieitza
mieitza / cypher.py
Created January 3, 2018 15:06 — forked from gregroberts/cypher.py
A function for pandas to get results of a cypher query directly into a DataFrame
from pandas.core.api import DataFrame
from pandas.tseries.tools import to_datetime
#save me at site-packages\pandas\io\cypher.py
def read_cypher(cypher, con, index_col=None, params = {},parse_dates = None, columns= None):
'''
Run a Cypher query against the graph at con, put the results into a df
Parameters
@mieitza
mieitza / mongodb_2_pandas.py
Created January 3, 2018 15:05 — forked from jmquintana79/mongodb_2_pandas.py
Functions to connect and read mongodb data to pandas df
import pandas as pd
from pymongo import MongoClient
# set connection with mongodb
def _connect_mongo(host, port, username, password, db):
""" A util for making a connection to mongo """
if username and password:
mongo_uri = 'mongodb://%s:%s@%s:%s/%s' % (username, password, host, port, db)
conn = MongoClient(mongo_uri)
@mieitza
mieitza / pyspark_to_elasticsearch.py
Created January 3, 2018 15:04 — forked from adrianva/pyspark_to_elasticsearch.py
Save RDD and/or DataFrame from Spark to Elasticsearch
# Elastic configs
es_read_conf = {
"es.nodes" : "localhost",
"es.port" : "9200",
"es.resource" : "twitter/tweet"
}
es_write_conf = {
"es.nodes" : "localhost",
"es.port" : "9200",
@mieitza
mieitza / pyspark_to_elasticsearch.py
Created January 3, 2018 15:04 — forked from adrianva/pyspark_to_elasticsearch.py
Save RDD and/or DataFrame from Spark to Elasticsearch
# Elastic configs
es_read_conf = {
"es.nodes" : "localhost",
"es.port" : "9200",
"es.resource" : "twitter/tweet"
}
es_write_conf = {
"es.nodes" : "localhost",
"es.port" : "9200",
@mieitza
mieitza / keybase.md
Last active November 27, 2016 20:13

Keybase proof

I hereby claim:

  • I am mieitza on github.
  • I am mieitza (https://keybase.io/mieitza) on keybase.
  • I have a public key ASCpl4A69Tk1VjHXdVtvfnJ_c0zRfD7MQscb5SPPhOf7kAo

To claim this, I am signing this object:

@mieitza
mieitza / cmd_flag_parser.go
Created November 1, 2015 19:33 — forked from andreagrandi/cmd_flag_parser.go
Parse command line flags in Go
package main
import (
"flag"
"fmt"
)
var hostName = flag.String("host", "localhost", "Hostname or IP you want to run this service on")
var portNumber = flag.Int("port", 8080, "Port you want this service to listen on (default 8080)")
@mieitza
mieitza / substitute_string
Created November 1, 2015 19:32 — forked from andreagrandi/substitute_string
Substitute a placeholder like {{hostname}} that is inside a file, with the output of the hostname command.
sed -i "s/{{hostname}}/`hostname`/g" /etc/supervisor/conf.d/celeryd.conf
import commands
import time
import sys
from socket import socket
import argparse
import os
import pymongo
from pymongo import Connection
import yaml
@mieitza
mieitza / deploy_spark.sh
Last active October 9, 2015 23:38
Deploy spark
#!/bin/sh
# installation of Oracle Java JDK.
sudo apt-get -y update
sudo apt-get -y install python-software-properties
sudo add-apt-repository ppa:webupd8team/java
sudo apt-get -y update
sudo apt-get -y install oracle-java7-installer
# Installation of commonly used python scipy tools
@mieitza
mieitza / doInstall.sh
Last active August 29, 2015 14:05
Install Docker pre req for Ubuntu12.04
#!/bin/bash -x
sudo apt-get update
sudo apt-get install linux-image-generic-lts-raring linux-headers-generic-lts-raring
sudo apt-get install apt-transport-https
sudo apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv-keys 36A1D7869245C8950F966E92D8576A8BA88D21E9
sudo sh -c "echo deb https://get.docker.io/ubuntu docker main\
> /etc/apt/sources.list.d/docker.list"
sudo apt-get update
sudo groupadd docker