Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 21175 invoked from network); 26 Oct 2006 23:14:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Oct 2006 23:14:16 -0000 Received: (qmail 68300 invoked by uid 500); 26 Oct 2006 23:14:26 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 68277 invoked by uid 500); 26 Oct 2006 23:14:26 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 68266 invoked by uid 99); 26 Oct 2006 23:14:26 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Oct 2006 16:14:26 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: local policy) Received: from [212.122.43.147] (HELO sioux.101tec.com) (212.122.43.147) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 Oct 2006 16:14:12 -0700 Received: from [192.168.200.40] (unknown [80.64.182.137]) by sioux.101tec.com (Postfix) with ESMTP id B3552103CE for ; Fri, 27 Oct 2006 01:13:50 +0200 (CEST) Mime-Version: 1.0 (Apple Message framework v752.3) In-Reply-To: <02E1BD58-F09C-42ED-B108-FDB0F785CCB5@athena.com> References: <02E1BD58-F09C-42ED-B108-FDB0F785CCB5@athena.com> Content-Type: multipart/alternative; boundary=Apple-Mail-6-1061113727 Message-Id: <513DE344-7F1F-484C-A5DE-6F8B0A8946FE@101tec.com> From: Stefan Groschupf Subject: Re: Statistical clustering MapReduce example? Date: Fri, 27 Oct 2006 01:13:43 +0200 To: hadoop-user@lucene.apache.org X-Mailer: Apple Mail (2.752.3) X-Virus-Checked: Checked by ClamAV on apache.org --Apple-Mail-6-1061113727 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Hi David, a student once wrote a paper about map reduce and clustering in my company during a internal ship. I will send it to you off list since the list does not support attachments. However if someone wants to have a copy as well, let me know. Cheers, Stefan Am 23.10.2006 um 22:41 schrieb David Pollak: > Howdy, > > I'm looking to cluster documents by word frequency (and maybe > position). Does anyone know of MapReduce examples that demonstrate > statistical clustering? > > Thanks, > > David > > > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 101tec Inc. search tech for web 2.1 Menlo Park, California http://www.101tec.com --Apple-Mail-6-1061113727--