Skip to content

Instantly share code, notes, and snippets.

View iamavnish's full-sized avatar

Avnish Gupta iamavnish

View GitHub Profile
@airawat
airawat / 00-MultipleOutputs
Last active July 17, 2019 10:29
MultipleOutputs sample program - A program that demonstrates how to generate an output file for each key
********************************
Gist
********************************
Motivation
-----------
The typical mapreduce job creates files with the prefix "part-"..and then the "m" or "r" depending
on whether it is a map or a reduce output, and then the part number. There are scenarios where we
may want to create separate files based on criteria-data keys and/or values. Enter the "MultipleOutputs"
functionality.