lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From student_t <cc...@cscinfo.com>
Subject Please help to interpret Lucene Boost results
Date Fri, 26 Sep 2008 19:55:43 GMT

I am baffled by the results of the following queries. Can it be something to
do with the boosting factor? All of these queries are performed in the same
environment with the same crawled index/data. 

A. query1 = +(content:(Pepsi))                              resulted in 228
hits. 
B. query2 = +(content:(Pepsi) ) +(host:(ca)^10 )     resulted in 398 hits. 
C. query3 = +(host:(ca)^10 )                                resulted in 212
hits. 

Two questions (strictly just one): 
1. query1 of any content contains Pepsi yielded 228 hits, how could a more
limiting query2 (give me all docs that have Pepsi in it with a domain of ca)
yield more hits (398)? 
2. Since there are 212 hits of Canadian domains, how can query2 return 398
hits? 

Thanks for any pointers! 
Cheers, 
student_t 


-- 
View this message in context: http://www.nabble.com/Please-help-to-interpret-Lucene-Boost-results-tp19695313p19695313.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message