mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Cui tony <tony.cui1...@gmail.com>
Subject Re: How to get results of K-means
Date Fri, 19 Feb 2010 13:33:22 GMT
Thank you all you guys.
I know how to change seqFile to txt file now.

I'm sorry, Grant, your example is still a little complicated to me. How can
I run in this command: bin/mahout ?
This is the command line which I used :
 hadoop jar data/mahout-examples-0.2.job
org.apache.mahout.clustering.syntheticcontrol.kmeans.Job

And, Grant, could you give me some information on how to use SequenceFile
Dumper or class dumper?


Thank you!

在 2010年2月18日 下午10:14,Grant Ingersoll <gsingers@apache.org>写道:

> You can use the ClusterDumper class (I just posted an example at
> http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/)
> or you can use the SequenceFile Dumper.
>
> If you generated from a Lucene index, you can also use a class in utilities
> (ClusterLabels?) to print out labels based on LLR calculations.
>
> HTH,
> Grant
>
> On Feb 18, 2010, at 2:32 AM, Cui tony wrote:
>
> > Hi,
> >  I'm a beginner on mahout.I have figure out how to run k-means of mahout.
> > But after that, I have no idea how to get the clustered result.
> >
> > My input data is the standard example data : synthetic_control.data
> >
> > After running, I got a points folder which someone says that it contains
> the
> > result.  The points folders has mainly two files : part-00000  part-00001
> >
> >
> > file part-00000 like this:
> >
> EQ^F^Yorg.apache.hadoop.io.Text^Yorg.apache.hadoop.io.Text^@^@^@^@^@^@羉?[34m~YYVb}?~\~UP_Z~N~V^@^@^C:^@^@^C8~N^C5{"class":"org.apache.mahout.matrix.Sparsor","vector":"{\"values\":{\"indices\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59],\"values\":[28.7812,34.4632,31.3381,31.2834,28.9207,33.7596,25.3969,27.7849,35.2479,27.1159,32.8717,29.2171,36.0253,32.337,34.5249,32.8717,34.1173,26.5235,27.6623,26.3693,25.7744,29.27,30.7326,29.5054,33.0292,25.04,28.9167,24.3437,26.1203,34.9424,25.0293,26.6311,35.6541,28.4353,29.1495,28.1584,26.1927,33.3182,30.9772,27.0443,35.5344,26.2353,28.9964,32.0036,31.0558,34.2553,28.0721,28.9402,35.4973,29.747,31.4333,24.5556,33.7431,25.0466,34.9318,34.9879,32.4721,33.3759,25.4652,25.8717],\"numMappings\":60},\"cardinality\":60,\"lengthSquared\":-1.0,\"name\":\"\"}"}^A2^@^@^C;^@^@^C9~N^C6{"class":"org.apache.mahout.matrix.SparseVector","vector":"{\"values\":{\"indices\":[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59],\"values\":[24.8923,25.741,27.5532,32.8217,27.8789,
> >
> > I'm so confused about this result: who can I got the data with the
> clustered
> > label?
> >
> > thanks~~
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message