Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 68443 invoked from network); 13 Apr 2005 07:03:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 13 Apr 2005 07:03:51 -0000 Received: (qmail 54793 invoked by uid 500); 13 Apr 2005 07:03:45 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 54748 invoked by uid 500); 13 Apr 2005 07:03:44 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 54725 invoked by uid 500); 13 Apr 2005 07:03:44 -0000 Delivered-To: apmail-jakarta-lucene-dev@jakarta.apache.org Received: (qmail 54718 invoked by uid 99); 13 Apr 2005 07:03:43 -0000 X-ASF-Spam-Status: No, hits=0.2 required=10.0 tests=NO_REAL_NAME X-Spam-Check-By: apache.org Received: from ajax-1.apache.org (HELO ajax.apache.org) (192.87.106.226) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 13 Apr 2005 00:03:42 -0700 Received: by ajax.apache.org (Postfix, from userid 99) id 213D52DD; Wed, 13 Apr 2005 09:03:38 +0200 (CEST) From: bugzilla@apache.org To: lucene-dev@jakarta.apache.org Subject: DO NOT REPLY [Bug 34407] - BooleanQuery assumes everything else implements skipTo X-Bugzilla-Reason: AssignedTo Message-Id: <20050413070338.213D52DD@ajax.apache.org> Date: Wed, 13 Apr 2005 09:03:38 +0200 (CEST) X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG� RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT . ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND� INSERTED IN THE BUG DATABASE. http://issues.apache.org/bugzilla/show_bug.cgi?id=34407 ------- Additional Comments From paul.elschot@xs4all.nl 2005-04-13 09:03 ------- (In reply to comment #4) .. > > My motivation for a RangeQuery is not making it faster for the average case, > it's making it possible in any scenario (any place in a query, any number of > terms, etc). > > We have some search collections with over 100M documents. Now imagine a range > query on a unique id field... I don't think any method utilizing 100M termdoc > enumerators is really feasible (am I understanding correctly?) This is very similar to a date range. Try searching for this on the web: yyyy yyyymm yyyymmdd lucene The results are getting dense in this way, and for performance you might consider caching (intermediate) results in (BitSet) filters. Lucene itself is meant for smaller numbers of results. 100M docs means about 12Mbyte per BitSet filter. When your filters contain fewer docs than 12M and you need many filters you might consider the sparse filters of bug 32921 . However, these filters require skipTo on all their filtered scorers, meaning that they require the development version of BooleanQuery at the moment. Regards, Paul Elschot P.S. Perhaps someone is interested in writing a story about Lucene and the ordered document skippers. It's getting a bit involved. -- Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org