lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Renaud Delbru <renaud.del...@deri.org>
Subject Re: Sorting posting lists before intersection
Date Mon, 13 Oct 2008 15:52:24 GMT
Hi,

Paul Elschot wrote:
> This could be done, but since not all scorers will be TermScorers it
> will be necessary to add a method to Scorer (or perhaps even to its
> DocIdSetIterator superclass):
>
>    public abstract int estimatedDocFreq();
>
> and implement this for all existing instances. TermScorer could
> implement it without estimating.
> For AND/OR/NOT such an estimation is straightforward but for
> proximity queries it would be more of a guess.
>   
I agree. Indeed, for proximity queries, it is more tricky. Maybe taking 
the frequency of the rarest term in a PhraseQuery / SpanQuery could be a 
not so bad predictor in general.

Regards.
-- 
Renaud Delbru

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message