lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From prabin meitei <prabin.mei...@gmail.com>
Subject Re: document with different index time boost returns same score
Date Sat, 19 Dec 2009 06:46:39 GMT
Thanks to all for the replies. I checked with luke and documents with
different index time boosting (not much different) has same fieldNorm. I
think that is causing the search hits to have same score.
As Andrzej suggested i checked the rounding error caused by encoding. The
result really surprises me.
like 7.9 rounding to 7.0, 8.0 rounding to 8.0 and even 9.4 rounding to 8.0
etc..

Prabin


On Sat, Dec 19, 2009 at 3:09 AM, Andrzej Bialecki <ab@getopt.org> wrote:

> On 2009-12-18 21:47, Tom Hill wrote:
>
>> The docBoost, IIRC, is stored in a single byte, which combines the doc
>> boost, the field boost, and the length norm.
>> (
>>
>> http://lucene.apache.org/java/2_4_1/api/core/org/apache/lucene/search/Similarity.html#formula_norm
>> )
>>
>> Are the lengths of your documents the same? If not, this could be
>> affecting
>> your scoring.
>>
>> You can run luke (http://code.google.com/p/luke/) , and look at the
>> values
>> for fieldNorm. It's on the documents tab.
>>
>
> There's a little widget in Luke to set the value of norm - just display the
> dialog, it shows you the rounding error that this encoding causes (and what
> input values effectively come out the same, once encoded).
>
>
> --
> Best regards,
> Andrzej Bialecki     <><
>  ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message