From mahout-user-return-3060-apmail-lucene-mahout-user-archive=lucene.apache.org@lucene.apache.org Tue Apr 06 22:11:10 2010 Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 42080 invoked from network); 6 Apr 2010 22:11:10 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 6 Apr 2010 22:11:10 -0000 Received: (qmail 56019 invoked by uid 500); 6 Apr 2010 22:11:10 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 55974 invoked by uid 500); 6 Apr 2010 22:11:10 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 55966 invoked by uid 99); 6 Apr 2010 22:11:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Apr 2010 22:11:10 +0000 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=AWL,RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [74.208.4.195] (HELO mout.perfora.net) (74.208.4.195) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Apr 2010 22:11:02 +0000 Received: from jeff-eastmans-macbook-pro.local (c-71-198-0-148.hsd1.ca.comcast.net [71.198.0.148]) by mrelay.perfora.net (node=mrus2) with ESMTP (Nemesis) id 0Mhhff-1OBhQC41Yp-00N595; Tue, 06 Apr 2010 18:10:41 -0400 Message-ID: <4BBBB15F.3050108@windwardsolutions.com> Date: Tue, 06 Apr 2010 15:10:39 -0700 From: Jeff Eastman User-Agent: Thunderbird 2.0.0.24 (Macintosh/20100228) MIME-Version: 1.0 To: mahout-user@lucene.apache.org Subject: MAHOUT-236 Cluster Evaluation Tools? References: <4BBB6BF6.7050807@windwardsolutions.com> <4BBB7416.5080902@windwardsolutions.com> In-Reply-To: <4BBB7416.5080902@windwardsolutions.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V01U2FsdGVkX18X/tAiRV5mFLeSLW1b9V3SC1ly0o2Zro4J+N6 WNIBWvyHzFYcceXBcAoB0frvng28LNSV+tC8CNmAUc7NFEzGLW 9YR8AHQBZgjuaZvkJOcSZdYm4GYklLMsg3J26w6bCQ= Is anybody working on MAHOUT-236? To me it looks like the next logical step beyond generalizing the cluster dumper: improving on its summaries Jeff Eastman wrote: > Completing the ClusterDumper jira will allow for visual inspection of > the Dirichlet models and extracting some useful information thereof; > arguably not too useful with 1793-element vectors but this is also > true of kmeans clusters with 1793-element center vectors. With no > terminating conditions, selecting the particular iteration to inspect > is also an issue unique to Dirichlet. MAHOUT-236 has been around for a > while and, as Jake notes below, is really needed. >