Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 34535 invoked from network); 23 Mar 2004 16:04:26 -0000 Received: from daedalus.apache.org (HELO mail.apache.org) (208.185.179.12) by minotaur-2.apache.org with SMTP; 23 Mar 2004 16:04:26 -0000 Received: (qmail 80493 invoked by uid 500); 23 Mar 2004 16:04:14 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 80454 invoked by uid 500); 23 Mar 2004 16:04:12 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 80440 invoked from network); 23 Mar 2004 16:04:12 -0000 Received: from unknown (HELO smtp6.clb.oleane.net) (213.56.31.26) by daedalus.apache.org with SMTP; 23 Mar 2004 16:04:12 -0000 Received: from teck ([217.167.130.37]) (authenticated) by smtp6.clb.oleane.net with ESMTP id i2NG4C6b005814 for ; Tue, 23 Mar 2004 17:04:13 +0100 Message-ID: <005701c410f0$66537f00$690010ac@teck> From: "Julien Nioche" To: "Lucene Users List" References: <00cc01c410e8$4eddeac0$0b01a8c0@p4> Subject: Re: Similarity - position in Field[] effects scoring - how to change? Date: Tue, 23 Mar 2004 17:03:34 +0100 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2720.3000 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000 X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Joachim, Why don't you use the method explain of IndexSearcher? http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/IndexSear cher.html This is the best way to find why your documents are different. I suspect the lengthNorm method, which is used at indexation time. Julien ----- Original Message ----- From: "Joachim Schreiber" To: Sent: Tuesday, March 23, 2004 4:05 PM Subject: Similarity - position in Field[] effects scoring - how to change? > Hallo, > > I run in following problem. Perhaps somebody can help me. > > I have a index with different ids in the same field > something like > > 00000000 > 45678565 > 87854546 > > Situation: I have different documents with the entry 00000000 in the same > index. > > > document 1) > > 324235678565 > 324dssd5678565 > 45678324565 > 00000000 > 8785454324326 > > > document 2) > > 324235678565 > 00000000 > 45678324565 > 8785454324326 > > > > when I search for " s:00000000 " I receive both docs, but document 1 has a > better scoring than document 2. > The position of 00000000 in doc 1 is Field[4] and in doc 2 it's Field[2], > so this seems to effect scoring. > > How can I disable this behaviour, so doc 1 has the same scoring as doc 2??? > Which method do I have to overwrite in DefaultSimilarity. > Has anybody any idea, any help. > > Thanks > > yo > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org