Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@www.apache.org Received: (qmail 21099 invoked from network); 26 Jan 2005 17:17:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 26 Jan 2005 17:17:36 -0000 Received: (qmail 90704 invoked by uid 500); 26 Jan 2005 17:17:32 -0000 Delivered-To: apmail-jakarta-lucene-dev-archive@jakarta.apache.org Received: (qmail 90687 invoked by uid 500); 26 Jan 2005 17:17:32 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 90661 invoked by uid 99); 26 Jan 2005 17:17:32 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from natsmtp00.rzone.de (HELO natsmtp00.rzone.de) (81.169.145.165) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 26 Jan 2005 09:17:32 -0800 Received: from shred (A09c5.a.pppool.de [213.6.9.197]) by post.webmailer.de (8.13.1/8.13.1) with ESMTP id j0QHHRge020881 for ; Wed, 26 Jan 2005 18:17:28 +0100 (MET) From: "integer [daniel prawdzik]" To: Subject: -> Grouping Search Results by Clustering Snippets: Date: Wed, 26 Jan 2005 18:17:24 +0100 Message-ID: <001001c503ca$e8b01c60$c50906d5@shred> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.2616 Importance: Normal X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4927.1200 X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Grouping Search Results by Clustering Snippets: The presentation of search engines are typically long unsorted lists of results. To find the page you=92re looking for, is often time-consuming and unsatisfying.=20 Showing the results in groups by similar topics is a quite more suitable solution to give an user a quick overview over the results. This can be done by a technology called cluster analysis. Actually I=92m working on my diploma master thesis about this topic. In my understanding, it=92s too nice to be born for the archive, so I want to implement this feature in an opensource software. The coding of this programm already gone pretty far, I=92ve got some tests done and the results are impresive and might still get better [you can see some results on http://www.trist.de/CV/Text-Mining/ -> sorry, only in german] To make a long story short:=20 I=92m wondering, if this is an attractive feature for the lucene community? regards, integer --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org