Skip to content

Instantly share code, notes, and snippets.

@bryanyang0528
Created October 9, 2014 16:39
Show Gist options
  • Save bryanyang0528/94c61bd8c43f03d49f5d to your computer and use it in GitHub Desktop.
Save bryanyang0528/94c61bd8c43f03d49f5d to your computer and use it in GitHub Desktop.
In [3]:
raw_ratings = sc.textFile('/Users/bryanyang/Documents/Data/Movie Rating/ratings.dat')
raw_ratings.setName("raw ratings")
raw_ratings.cache()
Out[3]:
raw ratings MappedRDD[3] at textFile at NativeMethodAccessorImpl.java:-2
In [4]:
entries = raw_ratings.count()
print "%s entries in ratings" %entries
out[4]
100000 entries in ratings
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment