mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Musselman <andrew.mussel...@gmail.com>
Subject cluster-reuters.sh broken in trunk
Date Fri, 24 Jan 2014 16:18:27 GMT
Last night I had this issue when testing out cluster-reuters.sh with no
flags; anyone seen this recently?

14/01/23 22:03:54 INFO driver.MahoutDriver: Program took 286799 ms
(Minutes: 4.779983333333333)
Running on hadoop, using /home/akm/hadoop-0.20.205.0/bin/hadoop and
HADOOP_CONF_DIR=
MAHOUT-JOB:
/home/akm/mahout/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar
14/01/23 22:03:57 INFO common.AbstractJob: Command line arguments:
{--clustering=null,
--clusters=[/tmp/mahout-work-akm/reuters-kmeans-clusters],
--convergenceDelta=[0.5],
--distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure],
--endPhase=[2147483647],
--input=[/tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/],
--maxIter=[10], --method=[mapreduce], --numClusters=[20],
--output=[/tmp/mahout-work-akm/reuters-kmeans], --overwrite=null,
--startPhase=[0], --tempDir=[temp]}
14/01/23 22:03:57 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
14/01/23 22:03:57 INFO compress.CodecPool: Got brand-new compressor
14/01/23 22:03:57 INFO kmeans.RandomSeedGenerator: Wrote 20 Klusters to
/tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
14/01/23 22:03:57 INFO kmeans.KMeansDriver: Input:
/tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors
Clusters In: /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
Out: /tmp/mahout-work-akm/reuters-kmeans Distance:
org.apache.mahout.common.distance.CosineDistanceMeasure
14/01/23 22:03:57 INFO kmeans.KMeansDriver: convergence: 0.5 max
Iterations: 10
14/01/23 22:03:57 INFO compress.CodecPool: Got brand-new decompressor
Exception in thread "main" java.lang.IllegalStateException: No input
clusters found in
/tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed. Check your -c
argument.
at
org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:212)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:143)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:103)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:47)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
$ hls /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
Found 1 items
-rw-r--r--   1 akm supergroup        149 2014-01-23 22:03
/tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed


$ hcat /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
SEQorg.apache.hadoop.io.Text5org.apache.mahout.clustering.iterator.ClusterWritable*org.apache.hadoop.io.compress.DefaultCodec�����M5�0ü���$

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message