Skip to content

Instantly share code, notes, and snippets.

@josephmisiti
Created August 13, 2012 04:17
Show Gist options
  • Save josephmisiti/3336891 to your computer and use it in GitHub Desktop.
Save josephmisiti/3336891 to your computer and use it in GitHub Desktop.
#!/usr/bin/env python
import sys
# input comes from STDIN (standard input)
for line in sys.stdin:
# remove leading and trailing whitespace
line = line.strip()
# split the line into words
words = line.split()
# increase counters
for word in words:
# write the results to STDOUT (standard output);
# what we output here will be the input for the
# Reduce step, i.e. the input for reducer.py
#
# tab-delimited; the trivial word count is 1
print '%s\t%s' % (word, 1)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment