Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 11556 invoked from network); 20 May 2005 19:00:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 20 May 2005 19:00:29 -0000 Received: (qmail 36464 invoked by uid 500); 20 May 2005 19:00:21 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 36425 invoked by uid 500); 20 May 2005 19:00:20 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 36410 invoked by uid 99); 20 May 2005 19:00:20 -0000 X-ASF-Spam-Status: No, hits=0.1 required=10.0 tests=FORGED_RCVD_HELO X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from smtp-vbr9.xs4all.nl (HELO smtp-vbr9.xs4all.nl) (194.109.24.29) by apache.org (qpsmtpd/0.28) with ESMTP; Fri, 20 May 2005 12:00:18 -0700 Received: from k8l.lan (porta.xs4all.nl [80.127.24.69]) by smtp-vbr9.xs4all.nl (8.12.11/8.12.11) with ESMTP id j4KIxvMx011537 for ; Fri, 20 May 2005 20:59:57 +0200 (CEST) (envelope-from paul.elschot@xs4all.nl) From: Paul Elschot To: java-user@lucene.apache.org Subject: Re: mutiple index question Date: Fri, 20 May 2005 20:59:57 +0200 User-Agent: KMail/1.5.4 References: <200505201411.10554.paul.elschot@xs4all.nl> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200505202059.57179.paul.elschot@xs4all.nl> X-Virus-Scanned: by XS4ALL Virus Scanner X-Virus-Checked: Checked X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On Friday 20 May 2005 16:21, Robert Newson wrote: > Paul Elschot wrote: > > On Friday 20 May 2005 13:58, Max Pfingsthorn wrote: > > > >>Hi! > >> > >>I was wondering if Lucene has any sort of functionality to distribute > > > > indices so that different fields are stored in separate indices but they > > still refer to the same document. This would be great for a situation where > > there are many large documents which have frequently changing properties. > > Content and properties could be indexed separately. Even different sets of > > properties could be combined in maybe different MultiSearcher instances to > > speed up querying... > > > >>Any ideas on this? > > > > > > Lucene doesn't have this, so you'll have to split your docs over two different > > Lucene indexes and adapt the search accordingly. Cached filtering helps a > > lot, but setting up a filter can still be costly. > > > > Regards, > > Paul Elschot > > Isn't this achieved by the new ParallelReader in CVS? Without having had a look at that code: probably not, because a reader does not know which field is the primary key. The problem with the two indexes above is that a single document has different fields in both of them. Usually a search would start in the "properties" index, retrieve the found primary keys from there, then use these primary keys to set up a filter in the "content" index, and then repeatedly search in the content text limiting to this filter. Regards, Paul Elschot --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org