Return-Path: Delivered-To: apmail-mahout-user-archive@www.apache.org Received: (qmail 58416 invoked from network); 8 Feb 2011 22:52:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 8 Feb 2011 22:52:17 -0000 Received: (qmail 77360 invoked by uid 500); 8 Feb 2011 22:52:16 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 77294 invoked by uid 500); 8 Feb 2011 22:52:16 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 77286 invoked by uid 99); 8 Feb 2011 22:52:15 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Feb 2011 22:52:15 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sharathjagannath@gmail.com designates 209.85.160.170 as permitted sender) Received: from [209.85.160.170] (HELO mail-gy0-f170.google.com) (209.85.160.170) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Feb 2011 22:52:08 +0000 Received: by gyf2 with SMTP id 2so2356204gyf.1 for ; Tue, 08 Feb 2011 14:51:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=nAQV6Pvz2xW3Po2y9kdJWzvIwtwAAe7N7LU2JCVZje0=; b=m6eMpxkBdvcBqVq8ovx1MK54ISqB7njhQ8YroYzGhjRXPNjznDpfOYAZamOABtEVFc iii6HfzbJv6vRZDmsyG7Gb30LWqIofHzeosLPSvqKgCwUju7EP1ZLopMtM4z9kLy0rtN Fxf+Rnmh7aZwl08y4oTxgzzWoMfnClunSxvyg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=L7I+dJKNH8/LX5DwWdEK/ke6BsCobkSs/433y3QTGaule1mj9hRqIstc90lu9XDKY8 XUYp7evb1KZOfDMaVi9agOakDddbtzSvUzT4WNf2WMLWYE/U7neI6q574niPo6E/n1zz 1UBB529cmeRSZ8wxWUg2i3nZxC+9eR8MMO21I= MIME-Version: 1.0 Received: by 10.101.139.15 with SMTP id r15mr11015956ann.63.1297205507039; Tue, 08 Feb 2011 14:51:47 -0800 (PST) Received: by 10.100.124.20 with HTTP; Tue, 8 Feb 2011 14:51:46 -0800 (PST) In-Reply-To: References: Date: Tue, 8 Feb 2011 14:51:46 -0800 Message-ID: Subject: Re: Clustering with KMeans From: sharath jagannath To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=0016e68e7f8b72a174049bcd31f8 X-Virus-Checked: Checked by ClamAV on apache.org --0016e68e7f8b72a174049bcd31f8 Content-Type: text/plain; charset=ISO-8859-1 oh!! that was id. Then how should I know total number of clusters? Thanks, Sharath On Tue, Feb 8, 2011 at 2:32 PM, Kate Ericson wrote: > Hi Sharath, > > So do you have 197 clusters, or just one cluster where the id is 197? > The ids don't always correspond to the number of clusters you have. > > -Kate > > On Tue, Feb 8, 2011 at 2:46 PM, sharath jagannath > wrote: > > Now with t1=800, t2=750, SquaredEuclideanDistanceMeasure, I have 197 > > clusters: > > > > C-197{n=1 c=[194:13.118, 346:13.820, 497:13.118, 620:13.118, 1224:11.650] > > r=[0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, > 0.000, > > 0.000, 0.000, 0.000, 0.000, 0.000... > > > > > > From the above sample output you can see cluster Id 197, centroids, > number > > of points and radius. > > > > For any value of t1 and t2 I always get n = 1. This is quite strange. > > > > Does it have to do anything with my dataset? Sorry for the confusion > > created. All these while I have being saying number of clusters to be 1. > > > > > > > > Thanks, > > > > Sharath > > > -- Thanks, Sharath Jagannath --0016e68e7f8b72a174049bcd31f8--