Skip to content

Instantly share code, notes, and snippets.

@josep2
Created March 9, 2017 23:29
Show Gist options
  • Save josep2/bae9322d816594a46adf955a67b782da to your computer and use it in GitHub Desktop.
Save josep2/bae9322d816594a46adf955a67b782da to your computer and use it in GitHub Desktop.
import edu.berkeley.cs.succinct._
val conf = new SparkConf().setAppName("Ranking Example")
val sc = new SparkContext(conf)
// A large file of raw hip hop lyrics ~ 100 GB
val hipHopRDD = sc.textFile("/hiphopcorpus").map(_.getBytes)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment