danbri · September 4, 2011 15:36
diff --git a/gistfile1.txt b/gistfile1.txt
 grunt> mydir = seqdirectory('ted/txt/', 'ted/foo', IGNORE);                                                                              
 grunt> dump mydir;
 2011-09-04 17:33:58,860 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: NATIVE
 2011-09-04 17:33:59,373 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
 2011-09-04 17:33:59,558 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 3
 2011-09-04 17:33:59,558 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 3
 2011-09-04 17:33:59,631 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added to the job
 2011-09-04 17:33:59,654 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
 2011-09-04 17:34:09,497 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - Setting up single store job
 2011-09-04 17:34:09,574 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 1 map-reduce job(s) waiting for submission.
 2011-09-04 17:34:09,984 [Thread-16] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
 2011-09-04 17:34:09,984 [Thread-16] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
 2011-09-04 17:34:09,995 [Thread-16] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths (combined) to process : 1
 2011-09-04 17:34:10,074 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete
 2011-09-04 17:34:10,994 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_201108312028_0274
 2011-09-04 17:34:10,995 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - More information at: http://localhost:50030/jobdetails.jsp?jobid=job_201108312028_0274
 2011-09-04 17:34:23,526 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 16% complete
 2011-09-04 17:34:26,533 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 33% complete
 SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
 SLF4J: Defaulting to no-operation (NOP) logger implementation
 SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
 Unknown program 'org.apache.mahout.text.SequenceFilesFromDirectory' chosen.
 Valid program names are:
  baumwelch: : Baum-Welch algorithm for unsupervised HMM training
  canopy: : Canopy clustering
  cleansvd: : Cleanup and verification of SVD output
  dirichlet: : Dirichlet Clustering
  eigencuts: : Eigencuts spectral clustering
  evaluateFactorization: : compute RMSE of a rating matrix factorization against probes in memory
  evaluateFactorizationParallel: : compute RMSE of a rating matrix factorization against probes
  fkmeans: : Fuzzy K-means clustering
  fpg: : Frequent Pattern Growth
  hmmpredict: : Generate random sequence of observations by given HMM
  itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
  kmeans: : K-means clustering
  lda: : Latent Dirchlet Allocation
  matrixmult: : Take the product of two matrices
  meanshift: : Mean Shift clustering
  parallelALS: : ALS-WR factorization of a rating matrix
  predictFromFactorization: : predict preferences from a factorization of a rating matrix
  recommenditembased: : Compute recommendations using item-based collaborative filtering
  rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
  seq2sparse: : Sparse Vector generation from Text sequence files
  spectralkmeans: : Spectral k-means clustering
  splitDataset: : split a rating dataset into training and probe parts
  ssvd: : Stochastic SVD
  svd: : Lanczos Singular Value Decomposition
  testclassifier: : Test Bayes Classifier
  trainclassifier: : Train Bayes Classifier
  transpose: : Take the transpose of a matrix
  vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
  viterbi: : Viterbi decoding of hidden states from given output states sequence
 2011-09-04 17:34:33,474 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Error running native mapreduce operator job :../../core/target/mahout-core-0.6-SNAPSHOT-job.jar_Native job returned with non-zero return code
 2011-09-04 17:34:33,474 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete
 2011-09-04 17:34:33,474 [main] ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 0 map reduce job(s) failed!
 2011-09-04 17:34:33,475 [main] INFO  org.apache.pig.tools.pigstats.SimplePigStats - Script Statistics: 

 HadoopVersion	PigVersion	UserId	StartedAt	FinishedAt	Features
 0.20.2	0.9.0	danbri	2011-09-04 17:33:59	2011-09-04 17:34:33	NATIVE

 Some jobs have failed! Stop running all dependent jobs

 Job Stats (time in seconds):
 JobId	Maps	Reduces	MaxMapTime	MinMapTIme	AvgMapTime	MaxReduceTime	MinReduceTime	AvgReduceTime	Alias	Feature	Outputs
 job_201108312028_0274	1	0	6	6	6	0	0	0	IGNORE	MAP_ONLY	

 Failed Jobs:
 JobId	Alias	Feature	Message	Outputs

 Input(s):
 Successfully read 3 records (37 bytes) from: "hdfs://localhost:9000/user/danbri/migtest/input.txt"

 Output(s):

 Counters:
 Total records written : 0
 Total bytes written : 0
 Spillable Memory Manager spill count : 0
 Total bags proactively spilled: 0
 Total records proactively spilled: 0

 Job DAG:
 job_201108312028_0274	->	null,
 null	->	null,
 null


 2011-09-04 17:34:33,479 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Some jobs have failed! Stop running all dependent jobs
 2011-09-04 17:34:33,479 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias mydir
 Details at logfile: /Users/bandri/working/mahout/trunk/examples/bin/pig_1315150414808.log
	grunt> mydir = seqdirectory('ted/txt/', 'ted/foo', IGNORE);
	grunt> dump mydir;
	2011-09-04 17:33:58,860 [main] INFO org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: NATIVE
	2011-09-04 17:33:59,373 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
	2011-09-04 17:33:59,558 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 3
	2011-09-04 17:33:59,558 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 3
	2011-09-04 17:33:59,631 [main] INFO org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added to the job
	2011-09-04 17:33:59,654 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
	2011-09-04 17:34:09,497 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - Setting up single store job
	2011-09-04 17:34:09,574 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 1 map-reduce job(s) waiting for submission.
	2011-09-04 17:34:09,984 [Thread-16] INFO org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
	2011-09-04 17:34:09,984 [Thread-16] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
	2011-09-04 17:34:09,995 [Thread-16] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths (combined) to process : 1
	2011-09-04 17:34:10,074 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete
	2011-09-04 17:34:10,994 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_201108312028_0274
	2011-09-04 17:34:10,995 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - More information at: http://localhost:50030/jobdetails.jsp?jobid=job_201108312028_0274
	2011-09-04 17:34:23,526 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 16% complete
	2011-09-04 17:34:26,533 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 33% complete
	SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
	SLF4J: Defaulting to no-operation (NOP) logger implementation
	SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
	Unknown program 'org.apache.mahout.text.SequenceFilesFromDirectory' chosen.
	Valid program names are:
	baumwelch: : Baum-Welch algorithm for unsupervised HMM training
	canopy: : Canopy clustering
	cleansvd: : Cleanup and verification of SVD output
	dirichlet: : Dirichlet Clustering
	eigencuts: : Eigencuts spectral clustering
	evaluateFactorization: : compute RMSE of a rating matrix factorization against probes in memory
	evaluateFactorizationParallel: : compute RMSE of a rating matrix factorization against probes
	fkmeans: : Fuzzy K-means clustering
	fpg: : Frequent Pattern Growth
	hmmpredict: : Generate random sequence of observations by given HMM
	itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
	kmeans: : K-means clustering
	lda: : Latent Dirchlet Allocation
	matrixmult: : Take the product of two matrices
	meanshift: : Mean Shift clustering
	parallelALS: : ALS-WR factorization of a rating matrix
	predictFromFactorization: : predict preferences from a factorization of a rating matrix
	recommenditembased: : Compute recommendations using item-based collaborative filtering
	rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
	seq2sparse: : Sparse Vector generation from Text sequence files
	spectralkmeans: : Spectral k-means clustering
	splitDataset: : split a rating dataset into training and probe parts
	ssvd: : Stochastic SVD
	svd: : Lanczos Singular Value Decomposition
	testclassifier: : Test Bayes Classifier
	trainclassifier: : Train Bayes Classifier
	transpose: : Take the transpose of a matrix
	vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
	viterbi: : Viterbi decoding of hidden states from given output states sequence
	2011-09-04 17:34:33,474 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Error running native mapreduce operator job :../../core/target/mahout-core-0.6-SNAPSHOT-job.jar_Native job returned with non-zero return code
	2011-09-04 17:34:33,474 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete
	2011-09-04 17:34:33,474 [main] ERROR org.apache.pig.tools.pigstats.PigStatsUtil - 0 map reduce job(s) failed!
	2011-09-04 17:34:33,475 [main] INFO org.apache.pig.tools.pigstats.SimplePigStats - Script Statistics:

	HadoopVersion PigVersion UserId StartedAt FinishedAt Features
	0.20.2 0.9.0 danbri 2011-09-04 17:33:59 2011-09-04 17:34:33 NATIVE

	Some jobs have failed! Stop running all dependent jobs

	Job Stats (time in seconds):
	JobId Maps Reduces MaxMapTime MinMapTIme AvgMapTime MaxReduceTime MinReduceTime AvgReduceTime Alias Feature Outputs
	job_201108312028_0274 1 0 6 6 6 0 0 0 IGNORE MAP_ONLY

	Failed Jobs:
	JobId Alias Feature Message Outputs

	Input(s):
	Successfully read 3 records (37 bytes) from: "hdfs://localhost:9000/user/danbri/migtest/input.txt"

	Output(s):

	Counters:
	Total records written : 0
	Total bytes written : 0
	Spillable Memory Manager spill count : 0
	Total bags proactively spilled: 0
	Total records proactively spilled: 0

	Job DAG:
	job_201108312028_0274 -> null,
	null -> null,
	null


	2011-09-04 17:34:33,479 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Some jobs have failed! Stop running all dependent jobs
	2011-09-04 17:34:33,479 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias mydir
	Details at logfile: /Users/bandri/working/mahout/trunk/examples/bin/pig_1315150414808.log