Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 19457 invoked from network); 27 Feb 2010 20:30:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 27 Feb 2010 20:30:58 -0000 Received: (qmail 56248 invoked by uid 500); 27 Feb 2010 20:30:56 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 56222 invoked by uid 500); 27 Feb 2010 20:30:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 56214 invoked by uid 99); 27 Feb 2010 20:30:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 27 Feb 2010 20:30:56 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 27 Feb 2010 20:30:47 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1NlTIw-0004ES-K4 for java-user@lucene.apache.org; Sat, 27 Feb 2010 12:30:26 -0800 Message-ID: <27730729.post@talk.nabble.com> Date: Sat, 27 Feb 2010 12:30:26 -0800 (PST) From: PlusPlus To: java-user@lucene.apache.org Subject: Changing TF method MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: r.shahidinejad@gmail.com X-Virus-Checked: Checked by ClamAV on apache.org Hi, I want to change the Lucene's similarity in a way that I can add Fuzzy memberships to the terms of a document. Thus, TF value of a term in one document is not always 1, it can add 0.7 to the value of the TF ( (In my application, each term is contained in a document at most once). This membership value is available before index time. On the other hand, each occurrence of a word will not be considered as 1 documentfrequency for the IDF formula. I was wondering if I can change the TF and IDF values of the terms like this. So far, I know that I can change the impact of TF values on the scoring, but not this thing that I'm looking for. Best, Reza -- View this message in context: http://old.nabble.com/Changing-TF-method-tp27730729p27730729.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org