Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 86633 invoked from network); 28 Nov 2004 00:34:59 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 28 Nov 2004 00:34:59 -0000 Received: (qmail 8533 invoked by uid 500); 28 Nov 2004 00:34:49 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 8480 invoked by uid 500); 28 Nov 2004 00:34:48 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 8456 invoked by uid 99); 28 Nov 2004 00:34:48 -0000 X-ASF-Spam-Status: No, hits=0.2 required=10.0 tests=HTML_30_40,HTML_MESSAGE,MIME_QP_LONG_LINE X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from mailbe04.mc.Vanderbilt.Edu (HELO mailbe04.mc.vanderbilt.edu) (160.129.50.174) by apache.org (qpsmtpd/0.28) with ESMTP; Sat, 27 Nov 2004 16:34:46 -0800 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C4D4E2.0CB9FEC4" X-MimeOLE: Produced By Microsoft Exchange V6.5.7226.0 Subject: RE: Are similarity scores computed when using sort? Date: Sat, 27 Nov 2004 18:33:07 -0600 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Are similarity scores computed when using sort? Thread-Index: AcTU1YlL6aQQue96TjuwHXYo+PIkIQADEtUi From: "Aphinyanaphongs, Yindalon" To: "Lucene Users List" X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N ------_=_NextPart_001_01C4D4E2.0CB9FEC4 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Erik! Thanks for the response. I'll take a look and see to customizing a = solution. Yin ________________________________ From: Erik Hatcher [mailto:erik@ehatchersolutions.com] Sent: Sat 11/27/2004 5:04 PM To: Lucene Users List Subject: Re: Are similarity scores computed when using sort? On Nov 27, 2004, at 1:01 PM, Aphinyanaphongs, Yindalon wrote: > Thanks for the quick response. I don't necessarily need to calculated > the similarity score. It was my understanding that the inverted index > had a list of all the hits for each term so technically, no document > returned from the inverted index should have a score of 0. It's certainly true that the inverted index has a list of all documents that contain each term. However the word "hit" is by definition a document with score > 0 given a query, not just a term. Queries can be quite sophisticated. For example, I was very easily able to create an XOR query by using a custom similarity. > Would you know in what java class the call is made to Similarity and > where that code may be commented out, or would it be preferable to > write my own similarity sub-class instead? Similarity use is spread out in several places. Get a good IDE(A!) and surf the code and you'll easily be able to see where methods of Similarity are being used. Erik --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org ------_=_NextPart_001_01C4D4E2.0CB9FEC4--