lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Elschot <paul.elsc...@xs4all.nl>
Subject Re: PhraseQuery issues - differences with SpanNearQuery
Date Fri, 05 Sep 2008 18:49:04 GMT
Op Friday 05 September 2008 16:57:34 schreef Mark Miller:
> Paul Elschot wrote:
> > Op Thursday 04 September 2008 20:39:13 schreef Mark Miller:
> >> Sounds like its more in line with what you are looking for. If I
> >> remember correctly, the phrase query factors in the edit distance
> >> in scoring, but the NearSpanQuery will just use the combined idf
> >> for each of the terms in it, so distance shouldnt matter with
> >> spans (I'm sure Paul will correct me if I am wrong).
> >
> > SpanScorer will use the similarity slop factor for each matching
> > span size to adjust the effective frequency.
> > The span size is the difference in position between the first
> > and last matching term, and idf is not used for scoring Spans.
> > The reason why idf is not used could be that there is no basic
> > score value associated with inner spans; only top level spans
> > are scored by SpanScorer.
> > For more details, please consult the SpanScorer code.
> >
> > Regards,
> > Paul Elschot
>
> Right, my fault, its the query normalization in the weight which uses
> idf (by pulling from each clause in the span). So its kind of
> factored into the score, but not in the way I implied. Sorry, my bad
> on the info.

Well, I had missed the phrase idf over all the SpanQuery terms
as used from the SpanWeight.

Regards,
Paul Elschot

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message