From java-dev-return-15615-apmail-lucene-java-dev-archive=lucene.apache.org@lucene.apache.org Thu Sep 07 16:49:11 2006 Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 45721 invoked from network); 7 Sep 2006 16:49:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 7 Sep 2006 16:49:08 -0000 Received: (qmail 74571 invoked by uid 500); 7 Sep 2006 16:49:06 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 74202 invoked by uid 500); 7 Sep 2006 16:49:05 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 74185 invoked by uid 99); 7 Sep 2006 16:49:04 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Sep 2006 09:49:04 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [194.109.24.30] (HELO smtp-vbr10.xs4all.nl) (194.109.24.30) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Sep 2006 09:49:03 -0700 Received: from k8l.lan (porta.xs4all.nl [80.127.24.69]) by smtp-vbr10.xs4all.nl (8.13.6/8.13.6) with ESMTP id k87GmfAO078172 for ; Thu, 7 Sep 2006 18:48:41 +0200 (CEST) (envelope-from paul.elschot@xs4all.nl) From: Paul Elschot To: java-dev@lucene.apache.org Subject: Re: [jira] Updated: (LUCENE-584) Decouple Filter from BitSet Date: Thu, 7 Sep 2006 18:48:40 +0200 User-Agent: KMail/1.8.2 References: <20060907080345.57295.qmail@web25909.mail.ukl.yahoo.com> In-Reply-To: <20060907080345.57295.qmail@web25909.mail.ukl.yahoo.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200609071848.40738.paul.elschot@xs4all.nl> X-Virus-Scanned: by XS4ALL Virus Scanner X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Thursday 07 September 2006 10:03, eks dev wrote: ... > > on the other note, > the key for really efficiant matchers will be good SmartMatcherFactory that picks the best representation for given density/"sortednes". > The cases I've been able to identify so far: > > Very Low density - IntList > Low density VIntSortedList > Dense - OpenBitSet/BitSetIterator or such > Sorted - (imagine case where you have an oportunity to sort your index on category field, quite offten I guess as it does not require absolute "sortedness", it is enough to sort periodicly without caring for smaller updates). There, one simple interval list can do the magic in just a few bytes of memory, even in high density cases. ... > More ideas on this? What's the point of using a sorted interval list for a category? With the patch, a TermScorer is a Matcher, so one could use a TermScorer to filter a category assuming the category has an indexed term. For filtering, it might be worthwhile to introduce a TermMatcher to avoid the scoring done in TermScorer. Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org