Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 57996 invoked from network); 7 Jan 2010 21:31:13 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Jan 2010 21:31:13 -0000 Received: (qmail 77043 invoked by uid 500); 7 Jan 2010 21:31:12 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 76988 invoked by uid 500); 7 Jan 2010 21:31:12 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 76978 invoked by uid 99); 7 Jan 2010 21:31:12 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jan 2010 21:31:12 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jan 2010 21:31:04 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1NSzwK-0001Ci-2c for mahout-user@lucene.apache.org; Thu, 07 Jan 2010 13:30:44 -0800 Message-ID: <27067350.post@talk.nabble.com> Date: Thu, 7 Jan 2010 13:30:44 -0800 (PST) From: diveman To: mahout-user@lucene.apache.org Subject: Re: Kmeans clustering In-Reply-To: <8f8e14c41001071257j7b88a788h80dfc289ec94b138@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: shiliang91@gmail.com References: <27066415.post@talk.nabble.com> <8f8e14c41001071257j7b88a788h80dfc289ec94b138@mail.gmail.com> Thanks! and when I try to run the dumper it gives me the following: hadoop jar mahout-utils-0.3-SNAPSHOT.jar org.apache.mahout.utils.clustering.ClusterDumper -s output/clusters-6/ -o /data/output Exception in thread "main" java.lang.NullPointerException at org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:112) at org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:253) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) Drew Farris wrote: > > Each iteration of k-means clustering will produce a cluster-X file. In > this case, there were 7 iterations prior to the clusters converging. > The final cluster data can be found in clusters-6. > > There is a utility in mahout-util, > o.a.m.utils.clustering.ClusterDumper that can be used to dump the data > from clusters-6 and points into a json-like format. You could use that > code as a starting point for discovering how to get at the data you're > interested in. > > On Thu, Jan 7, 2010 at 3:23 PM, diveman wrote: >> >> I'm new to Mahout. Installed 0.3 in a 4-node cluster and run mahout kmean >> example with syntheticcontrol data. I got outputs like the following: >> >> output/canopies >> output/clusters-0 >> output/clusters-1 >> output/clusters-2 >> output/clusters-3 >> output/clusters-4 >> output/clusters-5 >> output/clusters-6 >> output/data >> output/points >> >> by which I understand in the points folder, each point is labeled with a >> cluster id. I'm wondering where I can find the cluster center, radius >> info, >> etc. And what's in clusters-0~6? BTW, the sample data has 6 groups and >> the >> result has 7 clusters, any clue? >> >> Thanks! >> -- >> View this message in context: >> http://old.nabble.com/Kmeans-clustering-tp27066415p27066415.html >> Sent from the Mahout User List mailing list archive at Nabble.com. >> >> > > -- View this message in context: http://old.nabble.com/Kmeans-clustering-tp27066415p27067350.html Sent from the Mahout User List mailing list archive at Nabble.com.