mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: k-Means questions
Date Fri, 26 Jun 2009 11:15:23 GMT

On Jun 25, 2009, at 10:17 PM, Ted Dunning wrote:

> Rephrasing:
>
> Option 1) pick a single (or three) random input vector as the initial
> position of each centroid
>
> Option 2) assign every input vector to some centroid at random and  
> compute
> the resulting centroids
>
> Option (1) is like (2), but it only assigns k input vectors while  
> option (2)
> assigns all input vectors to some cluster.  Many people use (2), but  
> (1)
> generally works better for me.

Gotcha.  Option 1 was what I had in mind.

Mime
View raw message