From java-user-return-22474-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Wed Aug 02 16:24:01 2006 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 68610 invoked from network); 2 Aug 2006 16:24:00 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 2 Aug 2006 16:24:00 -0000 Received: (qmail 52822 invoked by uid 500); 2 Aug 2006 16:23:53 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 52795 invoked by uid 500); 2 Aug 2006 16:23:53 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 52784 invoked by uid 99); 2 Aug 2006 16:23:53 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Aug 2006 09:23:53 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [194.109.24.23] (HELO smtp-vbr3.xs4all.nl) (194.109.24.23) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Aug 2006 09:23:52 -0700 Received: from k8l.lan (porta.xs4all.nl [80.127.24.69]) by smtp-vbr3.xs4all.nl (8.13.6/8.13.6) with ESMTP id k72GNV3N097705 for ; Wed, 2 Aug 2006 18:23:31 +0200 (CEST) (envelope-from paul.elschot@xs4all.nl) From: Paul Elschot To: java-user@lucene.apache.org Subject: Re: wildcards and spans Date: Wed, 2 Aug 2006 18:23:30 +0200 User-Agent: KMail/1.8.2 References: <359a92830608020829h5a05d381ofe17b6d3d7f1d0be@mail.gmail.com> In-Reply-To: <359a92830608020829h5a05d381ofe17b6d3d7f1d0be@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200608021823.30384.paul.elschot@xs4all.nl> X-Virus-Scanned: by XS4ALL Virus Scanner X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Wednesday 02 August 2006 17:29, Erick Erickson wrote: > I'm back, with another flavor of wildcards. What direction would you point a > poor boy who's project lead wants wildcard queries and spans? Here's the > problem.... > > I cannot use any of the classes that throw a "TooManyClauses" exception (e.g. > SpanRegexQuery or SpanNearQuery with, say WildCardQuery). The corpus is big > enough that this is guaranteed to be thrown. So, currently I'm using a > filter for wildcard queries, populating it via WildcardTermEnum and > TermDocs... Works like a champ. But I don't see how to combine this with > spans... You can try BooleanQuery.setMaxClauseCount() to increase the max. nr. of clauses to 100000 or so and see what happens when searching. With enough RAM it should work nicely. You could also use the surround query language. This allows to set the max. nr. of clauses for a whole query instead of per BooleanQuery. Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org