Skip to content

Instantly share code, notes, and snippets.

@gabhi
Created January 26, 2015 09:10
Show Gist options
  • Save gabhi/b805335bd4f4b1c335a8 to your computer and use it in GitHub Desktop.
Save gabhi/b805335bd4f4b1c335a8 to your computer and use it in GitHub Desktop.
emr sample
JAR location: /home/hadoop/contrib/streaming/hadoop-streaming.jar
Main class: None
Arguments: -files s3://bigdataapp/wcmapper.php,s3://bigdataapp/wcreducer.php -mapper wcmapper.php -reducer wcreducer.php -input s3://aws-publicdatasets/common-crawl/parse-output/segment/1341690169105/textData-00112 -output s3://bigdataapp/wctest -inputformat SequenceFileAsTextInputFormat
Action on failure: Continue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment