Skip to content

Instantly share code, notes, and snippets.

@dgadiraju
Created September 4, 2019 16:22
Show Gist options
  • Save dgadiraju/803a98cc995eed42eed5ceea3699a7d5 to your computer and use it in GitHub Desktop.
Save dgadiraju/803a98cc995eed42eed5ceea3699a7d5 to your computer and use it in GitHub Desktop.
Let us perform word count to understand
spark in detail. As part of word count
we will try to get how many times each
word is repeated. To get word count we
will use flatMap, map and then either
countByKey or reduceByKey to get count
by each word.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment