Skip to content

Instantly share code, notes, and snippets.

@danbri
Created September 4, 2011 15:36
Show Gist options
  • Save danbri/1193028 to your computer and use it in GitHub Desktop.
Save danbri/1193028 to your computer and use it in GitHub Desktop.
grunt> mydir = seqdirectory('ted/txt/', 'ted/foo', IGNORE);
grunt> dump mydir;
2011-09-04 17:33:58,860 [main] INFO org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: NATIVE
2011-09-04 17:33:59,373 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2011-09-04 17:33:59,558 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 3
2011-09-04 17:33:59,558 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 3
2011-09-04 17:33:59,631 [main] INFO org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added to the job
2011-09-04 17:33:59,654 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2011-09-04 17:34:09,497 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - Setting up single store job
2011-09-04 17:34:09,574 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 1 map-reduce job(s) waiting for submission.
2011-09-04 17:34:09,984 [Thread-16] INFO org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2011-09-04 17:34:09,984 [Thread-16] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2011-09-04 17:34:09,995 [Thread-16] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths (combined) to process : 1
2011-09-04 17:34:10,074 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete
2011-09-04 17:34:10,994 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_201108312028_0274
2011-09-04 17:34:10,995 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - More information at: http://localhost:50030/jobdetails.jsp?jobid=job_201108312028_0274
2011-09-04 17:34:23,526 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 16% complete
2011-09-04 17:34:26,533 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 33% complete
SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
SLF4J: Defaulting to no-operation (NOP) logger implementation
SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
Unknown program 'org.apache.mahout.text.SequenceFilesFromDirectory' chosen.
Valid program names are:
baumwelch: : Baum-Welch algorithm for unsupervised HMM training
canopy: : Canopy clustering
cleansvd: : Cleanup and verification of SVD output
dirichlet: : Dirichlet Clustering
eigencuts: : Eigencuts spectral clustering
evaluateFactorization: : compute RMSE of a rating matrix factorization against probes in memory
evaluateFactorizationParallel: : compute RMSE of a rating matrix factorization against probes
fkmeans: : Fuzzy K-means clustering
fpg: : Frequent Pattern Growth
hmmpredict: : Generate random sequence of observations by given HMM
itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
kmeans: : K-means clustering
lda: : Latent Dirchlet Allocation
matrixmult: : Take the product of two matrices
meanshift: : Mean Shift clustering
parallelALS: : ALS-WR factorization of a rating matrix
predictFromFactorization: : predict preferences from a factorization of a rating matrix
recommenditembased: : Compute recommendations using item-based collaborative filtering
rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
seq2sparse: : Sparse Vector generation from Text sequence files
spectralkmeans: : Spectral k-means clustering
splitDataset: : split a rating dataset into training and probe parts
ssvd: : Stochastic SVD
svd: : Lanczos Singular Value Decomposition
testclassifier: : Test Bayes Classifier
trainclassifier: : Train Bayes Classifier
transpose: : Take the transpose of a matrix
vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
viterbi: : Viterbi decoding of hidden states from given output states sequence
2011-09-04 17:34:33,474 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Error running native mapreduce operator job :../../core/target/mahout-core-0.6-SNAPSHOT-job.jar_Native job returned with non-zero return code
2011-09-04 17:34:33,474 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete
2011-09-04 17:34:33,474 [main] ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 0 map reduce job(s) failed!
2011-09-04 17:34:33,475 [main] INFO org.apache.pig.tools.pigstats.SimplePigStats - Script Statistics:
HadoopVersion PigVersion UserId StartedAt FinishedAt Features
0.20.2 0.9.0 danbri 2011-09-04 17:33:59 2011-09-04 17:34:33 NATIVE
Some jobs have failed! Stop running all dependent jobs
Job Stats (time in seconds):
JobId Maps Reduces MaxMapTime MinMapTIme AvgMapTime MaxReduceTime MinReduceTime AvgReduceTime Alias Feature Outputs
job_201108312028_0274 1 0 6 6 6 0 0 0 IGNORE MAP_ONLY
Failed Jobs:
JobId Alias Feature Message Outputs
Input(s):
Successfully read 3 records (37 bytes) from: "hdfs://localhost:9000/user/danbri/migtest/input.txt"
Output(s):
Counters:
Total records written : 0
Total bytes written : 0
Spillable Memory Manager spill count : 0
Total bags proactively spilled: 0
Total records proactively spilled: 0
Job DAG:
job_201108312028_0274 -> null,
null -> null,
null
2011-09-04 17:34:33,479 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Some jobs have failed! Stop running all dependent jobs
2011-09-04 17:34:33,479 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias mydir
Details at logfile: /Users/bandri/working/mahout/trunk/examples/bin/pig_1315150414808.log
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment