Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 91284 invoked from network); 1 Jun 2004 10:01:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 1 Jun 2004 10:01:51 -0000 Received: (qmail 24154 invoked by uid 500); 1 Jun 2004 10:01:43 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 24049 invoked by uid 500); 1 Jun 2004 10:01:41 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 24016 invoked by uid 99); 1 Jun 2004 10:01:40 -0000 Received: from [24.51.109.181] (HELO postfix.mail.ehatchersolutions.com) (24.51.109.181) by apache.org (qpsmtpd/0.27.1) with ESMTP; Tue, 01 Jun 2004 03:01:40 -0700 Received: from [127.0.0.1] (localhost [127.0.0.1]) by postfix.mail.ehatchersolutions.com (Postfix) with ESMTP id F2BC0743F72 for ; Tue, 1 Jun 2004 06:01:24 -0400 (EDT) Mime-Version: 1.0 (Apple Message framework v613) In-Reply-To: References: <20040531181016.52275.qmail@web90107.mail.scd.yahoo.com> Content-Type: text/plain; charset=US-ASCII; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: similarity of two texts Date: Tue, 1 Jun 2004 06:01:23 -0400 To: "Lucene Users List" X-Mailer: Apple Mail (2.613) X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N On May 31, 2004, at 2:17 PM, Stefan Groschupf wrote: > Lucene can't help you. What about using term vectors though? I've been able to do rudimentary document similarity calculations using the new support in Lucene 1.4. Search the 'net for more info on term vectors and the formulas needed (elementary vector angle calculation, actually). Erik > Am 31.05.2004 um 20:10 schrieb uddam chukmol: > >> Hi, >> >> I'm a newbie to Lucene and heard that it helps in the information >> retrieval process. However, my problem is not really related to the >> information retrieval but to the comparison of two texts. I think >> Lucene may help resolving it. >> >> I would like to have a clue on how to compare two given texts and >> finally say how much they are similar. >> >> Has anyone had this kind of experience? I will be very grateful to >> hear your ideas and your recommendations. >> >> Thanks before hand! >> >> Uddam CHUKMOL >> >> >> >> >> --------------------------------- >> Do you Yahoo!? >> Friends. Fun. Try the all-new Yahoo! Messenger > --------------------------------------------------------------- > open technology: http://www.media-style.com > open source: http://www.weta-group.net > open discussion: http://www.text-mining.org > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org