Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 67613 invoked from network); 28 Aug 2008 12:57:34 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 28 Aug 2008 12:57:34 -0000 Received: (qmail 18476 invoked by uid 500); 28 Aug 2008 12:57:26 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 18449 invoked by uid 500); 28 Aug 2008 12:57:26 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 18438 invoked by uid 99); 28 Aug 2008 12:57:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Aug 2008 05:57:26 -0700 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [208.97.132.119] (HELO spunkymail-a2.g.dreamhost.com) (208.97.132.119) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Aug 2008 12:56:28 +0000 Received: from [192.168.0.3] (adsl-074-229-189-244.sip.rmo.bellsouth.net [74.229.189.244]) by spunkymail-a2.g.dreamhost.com (Postfix) with ESMTP id 7761687017 for ; Thu, 28 Aug 2008 05:57:25 -0700 (PDT) Message-Id: <79A7FD5E-947D-4D48-AEBB-9C676D201248@apache.org> From: Grant Ingersoll To: java-user@lucene.apache.org In-Reply-To: <4BCE10CFEBFE2E4F8FD2B77E2E8E504205F82FE1DD@mse19be2.mse19.exchange.ms> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v926) Subject: Re: Clarity: Is there a Query boosting 50-50 over 1000-1 ? Date: Thu, 28 Aug 2008 08:56:56 -0400 References: <4BCE10CFEBFE2E4F8FD2B77E2E8E504205F82FDF5E@mse19be2.mse19.exchange.ms> <4BCE10CFEBFE2E4F8FD2B77E2E8E504205F82FE1DD@mse19be2.mse19.exchange.ms> X-Mailer: Apple Mail (2.926) X-Virus-Checked: Checked by ClamAV on apache.org On Aug 27, 2008, at 7:34 PM, Shi Hui Liu wrote: > Hi, > > I think I should clarify my question a little bit. I'm using > BooleanQuery to combine TermQuery(A) and TermQuery(B). But I'm not > satisfied with its scoring algorigthm. Is there other queries can > boost up the documents with 50 of A and 50 of B on top of documents > with 1000 of A and 1 of B? Is your query A + B meant to be A OR B or A AND B? That is, are both terms required? You notation suggests they are, but the description suggests you are getting documents that have only A in them, which suggests "OR". Have you looked at the explains? What about the scoring aren't you happy with? It's not perfect (there is no such thing) but it works pretty well in most cases, and works great if you spend a little time figuring out the right length normalization factors. > And I'm looking at the source code and found lots of classes are not > public and some important methods are protected. What's the reason? > Why make them public and let users to customize the Query easily? Because there not meant to be overridden, but of course we are open to specific suggestions on things that should be made public and often do this when someone shows a valid reason. Cheers, Grant --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org