Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 76E549289 for ; Mon, 7 May 2012 08:16:36 +0000 (UTC) Received: (qmail 34093 invoked by uid 500); 7 May 2012 08:16:35 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 33853 invoked by uid 500); 7 May 2012 08:16:30 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Delivered-To: moderator for user@mahout.apache.org Received: (qmail 48479 invoked by uid 99); 7 May 2012 07:02:32 -0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dawid.weiss@gmail.com designates 209.85.213.42 as permitted sender) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:content-type :content-transfer-encoding; bh=z0VJ6soWXqYiloyjg4hlhRoeXsFA1xtf7vS2b3izcv4=; b=Y1Jr7pN4QIQUl/rhVZevv3Wqzy2neKmUHYIA9qo9O5gydo/oQuV186OsDT/h8fBN4L Bbm74W3Hi9oBhdCbPa9gzBOWOvWtcKuLEMYgZDRIVZWL8/F8zGqTLMFzrbgCqSiYTDBE iv3QhON38UEOMrTDhunnrq9LctLdHivW1gACJziGGY+jQRo+mzLeiJqahOOY+PxIHfW4 3A+RAu4Q9tGP6lgdgMQIm2qNPnXSCQDlgJtpOjnrA+1enRq8lMey+epTmxXxplafZwKs sAuvrCZdvTD2HxL0mMJxmWxGEsr+6B4K0CLrGqF9/vksoGrRTDOrjgsHZfUCqUee49My ccSQ== MIME-Version: 1.0 Sender: dawid.weiss@gmail.com In-Reply-To: References: <4FA6E3F0.8080507@occamsmachete.com> From: Dawid Weiss Date: Mon, 7 May 2012 09:01:43 +0200 X-Google-Sender-Auth: uIdvtZ3uxCFdtQE7bIosJ94Ic7c Message-ID: Subject: Re: kmeans not returning k clusters To: user@mahout.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable > - it doesn't have the final pass of in-memory clustering so it really jus= t > gives you an indifferent quality clustering with a huge number of weighte= d > clusters. =C2=A0With the final pass, it will give you a high quality clus= tering > with your specified number of clusters. I think the "huge number of weighted clusters" can be actually beneficial in certain applications. Are you going to leave this in as an option when integrating with Mahout, Ted? Still didn't have time to look at the code yet ;( Dawid