Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 31859 invoked from network); 28 Jul 2009 16:17:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 28 Jul 2009 16:17:14 -0000 Received: (qmail 86940 invoked by uid 500); 28 Jul 2009 16:18:31 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 86856 invoked by uid 500); 28 Jul 2009 16:18:31 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 86846 invoked by uid 99); 28 Jul 2009 16:18:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jul 2009 16:18:31 +0000 X-ASF-Spam-Status: No, hits=2.5 required=10.0 tests=PLING_QUERY,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [85.25.71.29] (HELO mail.troja.net) (85.25.71.29) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 28 Jul 2009 16:18:22 +0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by mail.troja.net (Postfix) with ESMTP id BCAC145ED64 for ; Tue, 28 Jul 2009 18:18:00 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at mail.troja.net Received: from mail.troja.net ([127.0.0.1]) by localhost (megaira.troja.net [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id NDkL3bwSQjjO for ; Tue, 28 Jul 2009 18:17:53 +0200 (CEST) Received: from VEGA (dslb-088-065-126-173.pools.arcor-ip.net [88.65.126.173]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.troja.net (Postfix) with ESMTPSA id 310C645ED63 for ; Tue, 28 Jul 2009 18:17:53 +0200 (CEST) From: "Uwe Schindler" To: References: <24664839.post@talk.nabble.com> <4A6CA0C5.2070209@ice-sa.com> <24670355.post@talk.nabble.com> <24671050.post@talk.nabble.com> <24701672.post@talk.nabble.com> <44FC07357EE24E8DB856430E5829A63C@VEGA> <24702697.post@talk.nabble.com> Subject: RE: Boolean query with 50,000 clauses! Possible? Scalable? Date: Tue, 28 Jul 2009 18:17:49 +0200 Message-ID: <0BAE8DA72BE44EF28F500743766342BC@VEGA> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 11 In-Reply-To: <24702697.post@talk.nabble.com> Thread-Index: AcoPnfUbBy4gyAFQTNuCL1ApxacgNwAAFa0w X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579 X-Virus-Checked: Checked by ClamAV on apache.org Hi Dado, > However, that you are suggesting is to query the author index (for example > for authors weighing more than 200lbs) and then for each author query the > book index with the author name. The problem is: the author query could > return tens if not hundreds of thousands author names. It would be > unreasonable to loop through each one of them and start a query to the > book > index at each step. You misunderstood me here. I meant an index for authors, if the search results the use should get displayed are authors and the user (may) then click on one author and display all books. In your case I would add this weight value also to the books index: e.g. in field name "author-weight" (and use the new NumericField from Lucene 2.9 for it, this value is then also denormalized). Author names are in "author-name" and so on. And then a simple NumericRangeQuery on this field. I was wondering whether solr/lucene allows for this > sort > of intersections to be done at the server level, not at the client level. > What I would like to see in lucene/solr is to query an index with an a > large > array of values for a specific field (in this case something like > "book.author_name IN ([author0.name, author1.name, ..., authorN.name])" > the > author name array being the result of a "subquery/nested query" that could > return a large number of hits. Subqueries or joins are not possible per se. You should always denormalize everything. > Thanx for your help and consideration, > > Dado > -- > View this message in context: http://www.nabble.com/Boolean-query-with- > 50%2C000-clauses%21-Possible--Scalable--tp24664839p24702697.html > Sent from the Lucene - General mailing list archive at Nabble.com.