mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Matt Tanquary <matt.tanqu...@gmail.com>
Subject Re: kmeans vectors
Date Fri, 01 Oct 2010 14:58:44 GMT
I used the InputDriver that Jeff placed in Utils to convert my input
to a SeqFile and ran it through mahout kmeans, now I can specify the
'k' arg. Jeff - I know you tried to tell me, it just didn't sink in
until now. :-)

On Fri, Oct 1, 2010 at 7:22 AM, Matt Tanquary <matt.tanquary@gmail.com> wrote:
> I played around with the t1 and t2 until I got a k that I expected
> with my small set, but if I want to ensure say 3 clusters on a large
> set of data, then how to I use t1 and t2 to set k? Is there a formula
> for that?
>
> On Thu, Sep 30, 2010 at 8:24 PM, Lahiru Samarakoon <lahiruts@gmail.com> wrote:
>> Hi Matt,
>>
>> As Jeff has mentioned earlier, you have to choose t1 and t2 to get the k
>> when you are using * syntheticcontrol.kmeans.Job* program. So what you have
>> experienced is correct.
>>
>> Thanks,
>> Lahiru
>>
>
>
>
> --
> Have you thanked a teacher today? ---> http://www.liftateacher.org
>



-- 
Have you thanked a teacher today? ---> http://www.liftateacher.org

Mime
View raw message