Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 15846 invoked from network); 24 Nov 2004 00:31:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 24 Nov 2004 00:31:25 -0000 Received: (qmail 30615 invoked by uid 500); 24 Nov 2004 00:31:18 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 30591 invoked by uid 500); 24 Nov 2004 00:31:18 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 30575 invoked by uid 99); 24 Nov 2004 00:31:18 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=RCVD_BY_IP,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: domain of ken.mccracken@gmail.com designates 64.233.170.198 as permitted sender) Received: from [64.233.170.198] (HELO rproxy.gmail.com) (64.233.170.198) by apache.org (qpsmtpd/0.28) with ESMTP; Tue, 23 Nov 2004 16:31:16 -0800 Received: by rproxy.gmail.com with SMTP id a36so31719rnf for ; Tue, 23 Nov 2004 16:31:14 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=PZ2aKQWZx9mCtqPt9PFc/m//D+EQY5YgLA6gvE+KvGn3iF9vVch0MPhrBqtt2IIaXJgWPRI++pK4C/UA8LBsFDPpAcc1XtOuuaXQoh826fM7L+HJAWikgmB/G0RPN/L+zcMBLRFM6i9j7XxWp1wr/iNmvKRQ7i05tLPaDNNlG5M= Received: by 10.38.150.72 with SMTP id x72mr105103rnd; Tue, 23 Nov 2004 16:31:14 -0800 (PST) Received: by 10.38.75.22 with HTTP; Tue, 23 Nov 2004 16:31:14 -0800 (PST) Message-ID: <2b2518a404112316313c264077@mail.gmail.com> Date: Tue, 23 Nov 2004 16:31:14 -0800 From: Ken McCracken Reply-To: Ken McCracken To: Lucene Users List Subject: Re: lucene Scorers In-Reply-To: <200411131207.05478.paul.elschot@xs4all.nl> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit References: <200411131207.05478.paul.elschot@xs4all.nl> X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, Thanks the pointers in your replies. Would it be possible to include some sort of accrual scorer interface somewhere in the Lucene Query APIs? This could be passed into a query similar to MaxDisjunctionQuery; and combine the sum, max, tieBreaker, etc., according to the implementor's discretion, to compute the overall score for a document. -Ken On Sat, 13 Nov 2004 12:07:05 +0100, Paul Elschot wrote: > On Friday 12 November 2004 22:56, Chuck Williams wrote: > > > > I had a similar need and wrote MaxDisjunctionQuery and > > MaxDisjunctionScorer. Unfortunately these are not available as a patch > > but I've included the original message below that has the code (modulo > > line breaks added by simple text email format). > > > > This code is functional -- I use it in my app. It is optimized for its > > stated use, which involves a small number of clauses. You'd want to > > improve the incremental sorting (e.g., using the bucket technique of > > BooleanQuery) if you need it for large numbers of clauses. > > When you're interested, you can also have a look here for > yet another DisjunctionScorer: > http://issues.apache.org/bugzilla/show_bug.cgi?id=31785 > > It has the advantage that it implements skipTo() so that it can > be used as a subscorer of ConjunctionScorer, ie. it can be > faster in situations like this: > > aa AND (bb OR cc) > > where bb and cc are treated by the DisjunctionScorer. > When aa is a filter this can also be used to implement > a filtering query. > > > > > > Re. Paul's suggested steps below, I did not integrate this with query > > parser as I didn't need that functionality (since I'm generating the > > multi-field expansions for which max is a much better scoring choice > > than sum). > > > > Chuck > > > > Included message: > > > > -----Original Message----- > > From: Chuck Williams [mailto:chuck@manawiz.com] > > Sent: Monday, October 11, 2004 9:55 PM > > To: lucene-dev@jakarta.apache.org > > Subject: Contribution: better multi-field searching > > > > The files included below (MaxDisjunctionQuery.java and > > MaxDisjunctionScorer.java) provide a new mechanism for searching across > > multiple fields. > > The maximum indeed works well, also when the fields differ a lot length. > > Regards, > Paul > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org