Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 54777 invoked from network); 26 Sep 2008 19:56:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 26 Sep 2008 19:56:18 -0000 Received: (qmail 32030 invoked by uid 500); 26 Sep 2008 19:56:10 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 31629 invoked by uid 500); 26 Sep 2008 19:56:09 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 31613 invoked by uid 99); 26 Sep 2008 19:56:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Sep 2008 12:56:09 -0700 X-ASF-Spam-Status: No, hits=2.6 required=10.0 tests=DNS_FROM_OPENWHOIS,SPF_HELO_PASS,SPF_PASS,WHOIS_MYPRIVREG X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lists@nabble.com designates 216.139.236.158 as permitted sender) Received: from [216.139.236.158] (HELO kuber.nabble.com) (216.139.236.158) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Sep 2008 19:55:09 +0000 Received: from isper.nabble.com ([192.168.236.156]) by kuber.nabble.com with esmtp (Exim 4.63) (envelope-from ) id 1KjJPj-00070f-2G for java-user@lucene.apache.org; Fri, 26 Sep 2008 12:55:43 -0700 Message-ID: <19695313.post@talk.nabble.com> Date: Fri, 26 Sep 2008 12:55:43 -0700 (PDT) From: student_t To: java-user@lucene.apache.org Subject: Please help to interpret Lucene Boost results MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Nabble-From: cchou@cscinfo.com X-Virus-Checked: Checked by ClamAV on apache.org I am baffled by the results of the following queries. Can it be something to do with the boosting factor? All of these queries are performed in the same environment with the same crawled index/data. A. query1 = +(content:(Pepsi)) resulted in 228 hits. B. query2 = +(content:(Pepsi) ) +(host:(ca)^10 ) resulted in 398 hits. C. query3 = +(host:(ca)^10 ) resulted in 212 hits. Two questions (strictly just one): 1. query1 of any content contains Pepsi yielded 228 hits, how could a more limiting query2 (give me all docs that have Pepsi in it with a domain of ca) yield more hits (398)? 2. Since there are 212 hits of Canadian domains, how can query2 return 398 hits? Thanks for any pointers! Cheers, student_t -- View this message in context: http://www.nabble.com/Please-help-to-interpret-Lucene-Boost-results-tp19695313p19695313.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org