Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 64243 invoked from network); 15 Jun 2006 17:44:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 15 Jun 2006 17:44:02 -0000 Received: (qmail 19283 invoked by uid 500); 15 Jun 2006 17:43:51 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 19216 invoked by uid 500); 15 Jun 2006 17:43:51 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 19170 invoked by uid 99); 15 Jun 2006 17:43:51 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jun 2006 10:43:50 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [169.229.70.167] (HELO rescomp.berkeley.edu) (169.229.70.167) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Jun 2006 10:43:50 -0700 Received: by rescomp.berkeley.edu (Postfix, from userid 1007) id D7CA75B764; Thu, 15 Jun 2006 10:43:29 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by rescomp.berkeley.edu (Postfix) with ESMTP id D12D37F403 for ; Thu, 15 Jun 2006 10:43:29 -0700 (PDT) Date: Thu, 15 Jun 2006 10:43:29 -0700 (PDT) From: Chris Hostetter To: java-user@lucene.apache.org Subject: RE: BooleanQuery.TooManyClauses on MultiSearcher In-Reply-To: <20060615173331.E3B96187B4@mail.seseit.com> Message-ID: References: <20060615173331.E3B96187B4@mail.seseit.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N : Incidentally, I'm getting BooleanQuery.TooManyClauses when I search on : "james", but I don't when I search on "James". Surely the number of clauses : isn't dependent on the number of hits?! not the numebr of hits -- just hte number of terms in your index that start with the prefix. : However, I know that "fred" is relatively uncommon in my index and "neil" is : relatively common and yet "fred" is getting the BooleanQuery.TooManyClauses : and "neil" isn't. Does that make sense? : : Should the actual term used in a PrefixQuery effect the number of clauses? yes .. the Term used in the PrefixQUery is just a convinient holder for a fieldname and a term value prefix -- what matters is how many terms in that field start with that prefix. if "james*" causes a problem, but "James*" doesn't then it sounds like your indexing analyzer is case sensative and you have a lot more lowercase values starting with james then upercase values starting with James .. if "fred*" causes a problem but "neil*" doesn't then you probably have a lot more terms that start with "fred" then you do that start with "neil" -- it doesn't matter if "neil@foo" is the value more documents then the total number of docs that contain any value starting with "fred", what matters is how many unique values there are starting with "fred" -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org