lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ahmet Arslan <iori...@yahoo.com>
Subject Re: Strange relevance scoring
Date Tue, 08 Apr 2014 09:36:41 GMT
Hi Nielsen,

There is no special attention paid to first word. You are probably hitting length normalisation. 
Lucene/Solr punishes long documents, favours short documents. 
(5 times appearing one) longer?



On Tuesday, April 8, 2014 12:03 PM, John Nielsen <jn@mcb.dk> wrote:
Hi,

We are seeing a strange phenomenon with our Solr setup which I have been
unable to answer.

My Google-fu is clearly not up to the task, so I am trying here.

It appears that if i do a freetext search for a single word, say "modellering"
on a text field, the scoring is massively boosted if the first word of the
text field is a hit.

For instance if there is only one occurrence of the word "modellering" in
the text field and that occurrence is the first word of the text, then that
document gets a higher relevancy than if the word "modelling" occurs 5
times in the text and the first word of the text is any other word.

Is this normal behavior? Is special attention paid to the first word in a
text field? I would think that the latter case would get the highest score.


-- 
Med venlig hilsen / Best regards

*John Nielsen*
Programmer



*MCB A/S*
Enghaven 15
DK-7500 Holstebro

Kundeservice: +45 9610 2824
post@mcb.dk
www.mcb.dk


Mime
View raw message