- copy files to a directory:
git clone https://gist.github.com/cc7c8cec1188fd387cc2e3ec0f4fed7a.git wordcount
and thencd wordcount
. - see the input files:
cat *.txt
- make sure mapper and reducer are executable
chmod +x *.scala
- see how mapper works:
cat baa.txt | ./mapper.scala
- see how reducer works:
cat baa.txt | ./mapper.scala | ./reducer.scala
- copy files to a directory:
git clone https://gist.github.com/cc7c8cec1188fd387cc2e3ec0f4fed7a.git wordcount
and thencd wordcount
. - create a directory on HDFS:
hadoop fs -mkdir -p /wc/in