lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yonik Seeley <yo...@lucidimagination.com>
Subject Re: [PMX:FAKE_SENDER] Re: large OR-boolean query
Date Thu, 24 Sep 2009 14:07:56 GMT
On Thu, Sep 24, 2009 at 9:26 AM, Luo, Jeff <jluo@cas.org> wrote:
> I think the searching is the bottle neck. Solr/Lucene is slow when the
> maxBooleanClauses is bigger enough.

OK, the I'd go with the custom query.  You can reduce the message size
and get gains in query parsing speed too:

{!parallel_or}1,2,3,4,5,6,7,8,9,...2048

Sort all of the terms first before creating the lower level
disjunctions - this can speed up the term seeking (for example seeking
to 1000, then 1001, then 1002 in the same thread will be faster than
1000, 2000, 1.

If you don't need scoring, then it can be made even faster by using bitsets.

-Yonik
http://www.lucidimagination.com

Mime
View raw message