Skip to content

Instantly share code, notes, and snippets.

@pixie79
Last active December 23, 2015 19:29
Show Gist options
  • Save pixie79/6682532 to your computer and use it in GitHub Desktop.
Save pixie79/6682532 to your computer and use it in GitHub Desktop.
register /home/hadoop/pig/jars/elephant-bird-core-4.1.jar;
register /home/hadoop/pig/jars/elephant-bird-hadoop-compat-4.1.jar;
register /home/hadoop/pig/jars/elephant-bird-pig-4.1.jar;
register /home/hadoop/pig/jars/json-simple-1.1.jar;
register /home/hadoop/pig/jars/hadoop-lzo-0.4.15.jar
set mapred.compress.map.output true;
set mapred.output.compress true;
set mapred.output.compression.codec com.hadoop.compression.lzo.LzoCodec;
set mapred.child.java.opts -Djava.library.path=/home/hadoop/pig/libs/;
view_data = LOAD 's3n://mybucketdata/idb/raw/view/ym=201309/d=24/*' USING com.twitter.elephantbird.pig.load.LzoJsonLoader();
view_data = LIMIT view_data 1000;
describe view_data;
dump view_data;
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment