Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 8309 invoked from network); 8 Jun 2005 07:36:15 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 8 Jun 2005 07:36:15 -0000 Received: (qmail 63306 invoked by uid 500); 8 Jun 2005 07:36:02 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 63243 invoked by uid 500); 8 Jun 2005 07:36:01 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 63229 invoked by uid 99); 8 Jun 2005 07:36:01 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=FORGED_RCVD_HELO X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from smtp-vbr6.xs4all.nl (HELO smtp-vbr6.xs4all.nl) (194.109.24.26) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 08 Jun 2005 00:35:57 -0700 Received: from k8l.lan (porta.xs4all.nl [80.127.24.69]) by smtp-vbr6.xs4all.nl (8.13.3/8.13.3) with ESMTP id j587ZgOk033912 for ; Wed, 8 Jun 2005 09:35:42 +0200 (CEST) (envelope-from paul.elschot@xs4all.nl) From: Paul Elschot To: java-user@lucene.apache.org Subject: Re: Fastest way to fetch N documents with unique keys within large numbers of indexes.. Date: Wed, 8 Jun 2005 09:35:42 +0200 User-Agent: KMail/1.5.4 References: <42A4FB31.3020509@rojo.com> <200506070922.21027.paul.elschot@xs4all.nl> <42A62B58.3030703@rojo.com> In-Reply-To: <42A62B58.3030703@rojo.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200506080935.42290.paul.elschot@xs4all.nl> X-Virus-Scanned: by XS4ALL Virus Scanner X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Wednesday 08 June 2005 01:18, Kevin Burton wrote: > Paul Elschot wrote: > > >For a large number of indexes, it may be necessary to do this over > >multiple indexes by first getting the doc numbers for all indexes, > >then sorting these per index, then retrieving them > >from all indexes, and repeating the whole thing using terms determined > >from the retrieved docs. > > > > > Well this was a BIG win. Just benchmarking it out shows a 10x -> 50x > performance increase. > > Times in milliseconds: > > Before: > > duration: 1127 > duration: 449 > duration: 394 > duration: 564 > > After: > > duration: 182 > duration: 39 > duration: 12 > duration: 11 There is no need for a relational db when you have Lucene :) Thanks for reporting the old and the new times. Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org