lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From elisabeth benoit <elisaelisael...@gmail.com>
Subject deactivate coord scoring factor in pf2 pf3
Date Thu, 28 Apr 2016 15:31:47 GMT
Hello all,

I am using Solr 4.10.1. I use edismax, with pf2 to boost documents starting
with. I use a start with token (bzzzz) automatically added at index time,
and added in request at query time.

I have a problem at this point.

request is *q=bzzzz saint denis rer*

the start with field is name_sw

first document *name_sw: Saint-Denis-Université*
second document *name_sw: RER Saint-Denis*

So one will have the pf2 starts with boost and not the other. The problem
is that it has an effect on the scoring of pf2 for all other words.

In other words, my problem is the proximity between "saint" and "denis" is
not scored the same value for those two documents.

>From what I get this is because of the coord scoring factor used for pf2.

In explain output, for first document

0.52612317 Matches Punished by 0.6666667 (not all query terms matched)
   0.78918475 Sum of the following:
     0.39459237 names_sw:"bzzzz saint"^0.21

     0.39459237 Dismax (take winner of below)
       0.39459237 names_sw:"saint denis"^0.21

       0.37580228 catchall:"saint den"^0.2


*So here, matches punished by 0.66*, which corresponds to coord(2/3)

and final score pf2 for proximity between saint and denis

0.263061593153079 names_sw:"saint denis"^0.21


In explain output, for second document


 0.13153079 Matches Punished by 0.33333334 (not all query terms matched)
   0.39459237 Dismax (take winner of below)
     0.39459237 names_sw:"saint denis"^0.21

     0.37580228 catchall:"saint den"^0.2


*So here matches punished by 0.33*, which corresponds to coord(1/3)

and final score pf2 for proximity between saint and denis

0.1315307926306158 names_sw:"saint denis"^0.21


I would like to deactivate coord for pf2 pf3. Does anyone know how I
could do this?


Best regards,

Elisabeth

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message