This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/** | |
* Created by rawkintrevo on 4/5/17. | |
*/ | |
// Only need these to intelliJ doesn't whine | |
import org.apache.mahout.drivers.ItemSimilarityDriver.parser | |
import org.apache.mahout.math._ | |
import org.apache.mahout.math.scalabindings._ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/** | |
* Created by rawkintrevo on 4/5/17. | |
*/ | |
// Only need these to intelliJ doesn't whine | |
import org.apache.mahout.math._ | |
import org.apache.mahout.math.scalabindings._ | |
import org.apache.mahout.math.drm._ | |
import org.apache.mahout.math.scalabindings.RLikeOps._ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/** | |
* Created by rawkintrevo on 4/5/17. | |
*/ | |
// Only need these so intelliJ doesn't complain | |
import org.apache.mahout.math._ | |
import org.apache.mahout.math.scalabindings._ | |
import org.apache.mahout.math.drm._ | |
import org.apache.mahout.math.scalabindings.RLikeOps._ | |
import org.apache.mahout.math.drm.RLikeDrmOps._ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/sh | |
sudo apt-get install git | |
sudo apt-get install openssh-server | |
sudo add-apt-repository ppa:openjdk-r/ppa | |
sudo apt-get update | |
sudo apt-get install openjdk-7-jdk openjdk-7-jre-lib | |
sudo apt-get purge maven maven2 | |
wget "http://www.us.apache.org/dist/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
// For parsing Stations | |
// ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/readme.txt | |
// For parsing Observations | |
// ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/by_year/readme.txt | |
case class Observation(station_id: String, date: String, observation_type: String, observation_value: Float, observation_time: String) | |
val obsText = sc.textFile("ftp://anonymous:[email protected]/pub/data/ghcn/daily/by_year/1768.csv.gz") | |
val observations = obsText.map(s=>s.split(","))map(s=> Observation(s(0), s(1), s(2), s(3).toFloat, s(4))) | |
val avgTemps = observations.filter(s => s.observation_type == "TMAX").map(s => (s.station_id, (s.observation_value,1))).reduceByKey((running,next_val) => (running._1 + (next_val._1-running._1)/ (running._2 + 1), running._2 + next_val._2)) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/bin/bash | |
sudo apt-get install git openssh-server openjdk-7-jdk openjdk-7-doc openjdk-7-jre-lib | |
sudo apt-get purge maven maven2 | |
wget "http://www.us.apache.org/dist/maven/maven-3/3.3.3/binaries/apache-maven-3.3.3-bin.tar.gz" | |
tar -zxvf apache-maven-3.3.3-bin.tar.gz | |
sudo mv ./apache-maven-3.3.3 /usr/local | |
sudo ln -s /usr/local/apache-maven-3.3.3/bin/mvn /usr/bin/mvn | |
git clone https://github.com/tillrohrmann/incubator-zeppelin.git |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
%spark // let Zeppelin know what interpretter to use. | |
/* | |
Written by Trevor Grant 10/22/2015 | |
Inspired by word count example at: http://spark.apache.org/examples.html | |
*/ | |
val text = sc.parallelize(List("In the time of chimpanzees, I was a monkey", // some lines of text to analyze | |
"Butane in my veins and I'm out to cut the junkie", | |
"With the plastic eyeballs, spray paint the vegetables", |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
%flink // let Zeppelin know what interpretter to use. | |
val text = env.fromElements("In the time of chimpanzees, I was a monkey", // some lines of text to analyze | |
"Butane in my veins and I'm out to cut the junkie", | |
"With the plastic eyeballs, spray paint the vegetables", | |
"Dog food stalls with the beefcake pantyhose", | |
"Kill the headlights and put it in neutral", | |
"Stock car flamin' with a loser in the cruise control", | |
"Baby's in Reno with the Vitamin D", | |
"Got a couple of couches, sleep on the love seat", |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import pymc as pm | |
import numpy as np | |
import matplotlib.pyplot as plt | |
from pprint import pprint | |
import pandas as pd | |
def linear_setup(df, ind_cols, dep_col, gb_cols, intercept=True): | |
''' | |
N: Number of observations | |
G: Number of groups |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
""" | |
Bayesian Statistics and Marketing: Rossi, Allenby, McCullough | |
Section 3.7 | |
This is a remarkably poorly defined model. Almost better off going | |
straight for the deep wizardry on his code than dicking with the | |
text. (At least for model definition, because the code is crap too.) | |
D: |
NewerOlder