mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Jenkins Server <jenk...@builds.apache.org>
Subject Build failed in Jenkins: Mahout-Examples-Classify-20News #64
Date Fri, 29 Jun 2012 20:15:26 GMT
See <https://builds.apache.org/job/Mahout-Examples-Classify-20News/64/changes>

Changes:

[ssc] RegressionResultAnalyzer must use US locale

------------------------------------------
[...truncated 6207 lines...]
888	441
889	440
890	440
891	440
892	440
893	440
894	440
895	440
896	439
897	439
898	439
899	438
900	437
901	437
902	437
903	434
904	434
905	434
906	434
907	434
908	433
909	433
910	433
911	432
912	431
913	431
914	430
915	430
916	430
917	428
918	428
919	427
920	426
921	426
922	425
923	425
924	425
925	424
926	424
927	424
928	423
929	423
930	423
931	422
932	422
933	422
934	422
935	421
936	420
937	420
938	419
939	419
940	419
941	419
942	419
943	419
944	418
945	417
946	416
947	415
948	415
949	415
950	414
951	413
952	412
953	411
954	410
955	410
956	409
957	408
958	408
959	408
960	407
961	407
962	407
963	407
964	407
965	406
966	406
967	405
968	405
969	404
970	403
971	402
972	402
973	402
974	401
975	400
976	400
977	399
978	398
979	398
980	397
981	396
982	396
983	396
984	396
985	395
986	394
987	394
988	394
989	393
990	393
991	393
992	392
993	392
994	392
995	391
996	391
997	391
998	391
999	390
1000	390
12/06/29 20:14:45 INFO driver.MahoutDriver: Program took 454126 ms (Minutes: 7.568783333333333)
Testing on /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/ with model: /tmp/news-group.model
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:14:46 WARN driver.MahoutDriver: No org.apache.mahout.classifier.sgd.TestNewsGroups.props
found on classpath, will use command-line arguments only
7532 test files
=======================================================
Summary
-------------------------------------------------------
Correctly Classified Instances          :       5520	   73.2873%
Incorrectly Classified Instances        :       2012	   26.7127%
Total Classified Instances              :       7532

=======================================================
Confusion Matrix
-------------------------------------------------------
a    	b    	c    	d    	e    	f    	g    	h    	i    	j    	k    	l    	m    	n    	o    
p    	q    	r    	s    	t    	<--Classified as
37   	23   	0    	4    	38   	82   	17   	32   	1    	2    	5    	0    	0    	3    	10   
11   	18   	3    	4    	95   	 |  385   	a     = comp.sys.mac.hardware
2    	290  	0    	10   	28   	21   	7    	4    	2    	5    	2    	1    	0    	2    	4    
1    	5    	1    	3    	6    	 |  394   	b     = comp.os.ms-windows.misc
0    	1    	285  	1    	2    	0    	3    	1    	3    	7    	6    	16   	3    	16   	1    
2    	3    	6    	2    	6    	 |  364   	c     = talk.politics.guns
1    	30   	0    	287  	39   	3    	5    	12   	2    	2    	3    	0    	0    	0    	1    
0    	2    	2    	1    	5    	 |  395   	d     = comp.windows.x
4    	18   	1    	14   	292  	7    	9    	8    	0    	2    	7    	0    	0    	0    	1    
2    	3    	4    	5    	12   	 |  389   	e     = comp.graphics
1    	52   	0    	4    	18   	229  	2    	14   	0    	1    	1    	1    	0    	1    	4    
2    	1    	1    	0    	60   	 |  392   	f     = comp.sys.ibm.pc.hardware
0    	0    	1    	2    	9    	0    	352  	6    	2    	0    	3    	0    	3    	2    	0    
0    	3    	0    	0    	11   	 |  394   	g     = sci.space
0    	2    	1    	1    	2    	12   	5    	344  	0    	0    	2    	1    	0    	0    	3    
6    	3    	0    	1    	7    	 |  390   	h     = misc.forsale
0    	3    	1    	1    	6    	1    	3    	3    	312  	25   	2    	21   	0    	2    	2    
0    	10   	1    	1    	4    	 |  398   	i     = soc.religion.christian
0    	1    	1    	0    	4    	0    	11   	2    	24   	220  	3    	29   	4    	1    	3    
0    	9    	2    	2    	3    	 |  319   	j     = alt.atheism
0    	2    	0    	1    	3    	0    	0    	5    	3    	1    	356  	0    	0    	1    	1    
1    	0    	0    	15   	8    	 |  397   	k     = rec.sport.baseball
0    	1    	12   	0    	2    	2    	11   	2    	34   	41   	5    	122  	2    	3    	1    
1    	8    	0    	2    	2    	 |  251   	l     = talk.religion.misc
0    	0    	3    	1    	1    	0    	2    	0    	9    	29   	4    	1    	300  	11   	2    
3    	2    	2    	3    	3    	 |  376   	m     = talk.politics.mideast
0    	1    	95   	0    	4    	0    	7    	1    	5    	8    	0    	8    	3    	160  	2    
1    	9    	4    	2    	0    	 |  310   	n     = talk.politics.misc
0    	1    	0    	2    	1    	0    	0    	5    	2    	1    	2    	0    	0    	2    	365  
6    	6    	0    	0    	5    	 |  398   	o     = rec.motorcycles
0    	2    	1    	0    	5    	2    	7    	11   	3    	2    	8    	0    	0    	1    	18   
293  	5    	2    	3    	33   	 |  396   	p     = rec.autos
0    	2    	4    	2    	15   	1    	10   	10   	13   	7    	6    	2    	1    	1    	3    
7    	283  	0    	2    	27   	 |  396   	q     = sci.med
3    	3    	2    	1    	6    	1    	5    	2    	4    	3    	3    	2    	0    	0    	2    
1    	4    	339  	0    	15   	 |  396   	r     = sci.crypt
1    	0    	0    	0    	0    	1    	1    	1    	4    	2    	17   	1    	0    	0    	1    
0    	2    	0    	365  	3    	 |  399   	s     = rec.sport.hockey
0    	3    	1    	2    	20   	14   	16   	9    	2    	2    	3    	0    	1    	1    	4    
6    	6    	13   	1    	289  	 |  393   	t     = sci.electronics



Avg. Log-likelihood: -1.1212616906335722 25%-ile: -1.6696006521059248 75%-ile: -0.5413681618725742
12/06/29 20:15:05 INFO driver.MahoutDriver: Program took 18648 ms (Minutes: 0.3108)
+ echo 2
+ ./examples/bin/classify-20newsgroups.sh
Please select a number to choose the corresponding task to run
1. cnaivebayes
2. naivebayes
3. sgd
4. clean -- cleans up the work area in /tmp/mahout-work-jenkins
ok. You chose 2 and we'll use naivebayes
creating work directory at /tmp/mahout-work-jenkins
+ echo 'Preparing 20newsgroups data'
Preparing 20newsgroups data
+ rm -rf /tmp/mahout-work-jenkins/20news-all
+ mkdir /tmp/mahout-work-jenkins/20news-all
+ cp -R /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/alt.atheism /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.graphics
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.os.ms-windows.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.ibm.pc.hardware
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.sys.mac.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/comp.windows.x
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/misc.forsale /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.autos
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.motorcycles /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.baseball
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/rec.sport.hockey /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.crypt
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.electronics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.med
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/sci.space /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/soc.religion.christian
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.guns /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.mideast
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.politics.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-test/talk.religion.misc
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/alt.atheism /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.graphics
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.os.ms-windows.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.ibm.pc.hardware
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.sys.mac.hardware /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/comp.windows.x
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/misc.forsale /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.autos
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.motorcycles /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.baseball
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/rec.sport.hockey /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.crypt
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.electronics /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.med
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/sci.space /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/soc.religion.christian
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.guns /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.mideast
/tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.politics.misc /tmp/mahout-work-jenkins/20news-bydate/20news-bydate-train/talk.religion.misc
/tmp/mahout-work-jenkins/20news-all
+ echo 'Creating sequence files from 20newsgroups data'
Creating sequence files from 20newsgroups data
+ ./bin/mahout seqdirectory -i /tmp/mahout-work-jenkins/20news-all -o /tmp/mahout-work-jenkins/20news-seq
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:15:07 INFO common.AbstractJob: Command line arguments: {--charset=[UTF-8], --chunkSize=[64],
--endPhase=[2147483647], --fileFilterClass=[org.apache.mahout.text.PrefixAdditionFilter],
--input=[/tmp/mahout-work-jenkins/20news-all], --keyPrefix=[], --output=[/tmp/mahout-work-jenkins/20news-seq],
--startPhase=[0], --tempDir=[temp]}
12/06/29 20:15:14 INFO driver.MahoutDriver: Program took 6972 ms (Minutes: 0.1162)
+ echo 'Converting sequence files to vectors'
Converting sequence files to vectors
+ ./bin/mahout seq2sparse -i /tmp/mahout-work-jenkins/20news-seq -o /tmp/mahout-work-jenkins/20news-vectors
-lnorm -nv -wt tfidf
hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-jcl-1.6.6.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: Found binding in [jar:<https://builds.apache.org/job/Mahout-Examples-Classify-20News/ws/trunk/examples/target/dependency/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]>
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Maximum n-gram size is:
1
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Minimum LLR value: 1.0
12/06/29 20:15:15 INFO vectorizer.SparseVectorsFromSequenceFiles: Number of reduce tasks:
1
12/06/29 20:15:15 INFO input.FileInputFormat: Total input paths to process : 1
12/06/29 20:15:16 INFO mapred.JobClient: Running job: job_local_0001
12/06/29 20:15:17 INFO mapred.JobClient:  map 0% reduce 0%
12/06/29 20:15:21 INFO mapred.Task: Task:attempt_local_0001_m_000000_0 is done. And is in
the process of commiting
12/06/29 20:15:21 INFO mapred.LocalJobRunner: 
12/06/29 20:15:21 INFO mapred.Task: Task attempt_local_0001_m_000000_0 is allowed to commit
now
12/06/29 20:15:21 INFO output.FileOutputCommitter: Saved output of task 'attempt_local_0001_m_000000_0'
to /tmp/mahout-work-jenkins/20news-vectors/tokenized-documents
12/06/29 20:15:22 INFO mapred.LocalJobRunner: 
12/06/29 20:15:22 INFO mapred.LocalJobRunner: 
12/06/29 20:15:22 INFO mapred.Task: Task 'attempt_local_0001_m_000000_0' done.
12/06/29 20:15:23 INFO mapred.JobClient:  map 100% reduce 0%
12/06/29 20:15:23 INFO mapred.JobClient: Job complete: job_local_0001
12/06/29 20:15:23 INFO mapred.JobClient: Counters: 8
12/06/29 20:15:23 INFO mapred.JobClient:   File Output Format Counters 
12/06/29 20:15:23 INFO mapred.JobClient:     Bytes Written=27717956
12/06/29 20:15:23 INFO mapred.JobClient:   File Input Format Counters 
12/06/29 20:15:23 INFO mapred.JobClient:     Bytes Read=36979301
12/06/29 20:15:23 INFO mapred.JobClient:   FileSystemCounters
12/06/29 20:15:23 INFO mapred.JobClient:     FILE_BYTES_READ=67766861
12/06/29 20:15:23 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=58778031
12/06/29 20:15:23 INFO mapred.JobClient:   Map-Reduce Framework
12/06/29 20:15:23 INFO mapred.JobClient:     Map input records=18846
12/06/29 20:15:23 INFO mapred.JobClient:     Spilled Records=0
12/06/29 20:15:23 INFO mapred.JobClient:     SPLIT_RAW_BYTES=113
12/06/29 20:15:23 INFO mapred.JobClient:     Map output records=18846
12/06/29 20:15:23 INFO input.FileInputFormat: Total input paths to process : 1
12/06/29 20:15:23 INFO mapred.JobClient: Running job: job_local_0002
12/06/29 20:15:24 WARN mapred.LocalJobRunner: job_local_0002
java.lang.ClassCastException: org.apache.hadoop.mapreduce.lib.input.FileSplit cannot be cast
to org.apache.hadoop.mapred.InputSplit
	at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:412)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
12/06/29 20:15:24 INFO mapred.JobClient:  map 0% reduce 0%
12/06/29 20:15:24 INFO mapred.JobClient: Job complete: job_local_0002
12/06/29 20:15:24 INFO mapred.JobClient: Counters: 0
Exception in thread "main" java.lang.IllegalStateException: Job failed!
	at org.apache.mahout.vectorizer.DictionaryVectorizer.startWordCounting(DictionaryVectorizer.java:360)
	at org.apache.mahout.vectorizer.DictionaryVectorizer.createTermFrequencyVectors(DictionaryVectorizer.java:171)
	at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:272)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
	at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.main(SparseVectorsFromSequenceFiles.java:55)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
Build step 'Execute shell' marked build as failure

Mime
View raw message