mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From adam35413 <>
Subject Re: Dealing with kmean and meanshift output
Date Fri, 09 Apr 2010 19:01:48 GMT

I took a look at the code, and the only thing that seemed to be required was
the Sequence file.  I pulled the part-00000 file from output/clusterPoints/
folder off of my Hadoop cluster, and tried the following command:

bin/mahout clusterdump --seqFileDir part-00000 --output testFile.txt

This resulted in the following error:

no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
Apr 9, 2010 3:00:25 PM org.slf4j.impl.JCLLoggerAdapter error
SEVERE: MahoutDriver failed with args: [--seqFileDir, part-00000, --output,
testFile.txt, null]
Exception in thread "main" java.lang.NullPointerException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at java.lang.reflect.Method.invoke(
	at org.apache.hadoop.util.ProgramDriver.driver(
	at org.apache.mahout.driver.MahoutDriver.main(

Strange, since $HADOOP_HOME is actually set.  Thoughts?

Jeff Eastman wrote:
> The dictionary file contains a list (not sure how its delimited) of 
> element names for the input Vectors and is optional. See the new code in 
> trunk/utils in TestClusterDumper for some examples. I need to write test 
> sfor meanshift and also fuzzy kmeans to make sure they work but I 
> imagine they do. I also need to write tests that include the points, but 
> that appears to be done in memory so it likely won't scale to your 
> 5-node data set.
> Jeff
> adam35413 wrote:
>> I have been able to successfully run the kmean and meanshift examples on
>> a
>> 5-node Hadoop cluster.  However, when it comes to dealing with the
>> output, I
>> am a bit confused.  I found the following page:
>>, but when I went to
>> track down the dictionary file I was unable to find it.  Do I need to
>> generate the dictionary file separately or manually?
>> Thanks!
View this message in context:
Sent from the Mahout User List mailing list archive at

View raw message