Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 8978 invoked from network); 12 Mar 2008 01:31:59 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Mar 2008 01:31:59 -0000 Received: (qmail 37859 invoked by uid 500); 12 Mar 2008 01:31:50 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 37824 invoked by uid 500); 12 Mar 2008 01:31:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 37813 invoked by uid 99); 12 Mar 2008 01:31:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Mar 2008 18:31:50 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [203.217.22.128] (HELO file1.syd.nuix.com.au) (203.217.22.128) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Mar 2008 01:31:03 +0000 Received: from host68.syd.nuix.com.au (host68.syd.nuix.com.au [192.168.222.68]) by file1.syd.nuix.com.au (Postfix) with ESMTP id 801C44A81BA for ; Wed, 12 Mar 2008 12:31:08 +1100 (EST) From: Daniel Noll Organization: Nuix Pty Ltd To: java-user@lucene.apache.org Subject: Re: Document ID shuffling under 2.3.x (on merge?) Date: Wed, 12 Mar 2008 12:26:51 +1100 User-Agent: KMail/1.9.6 (enterprise 0.20070907.709405) References: <200803111645.27956.daniel@nuix.com> <200803120839.19906.daniel@nuix.com> <359a92830803111553y60847045o2ab2ec895e02b497@mail.gmail.com> In-Reply-To: <359a92830803111553y60847045o2ab2ec895e02b497@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200803121226.51420.daniel@nuix.com> X-Virus-Checked: Checked by ClamAV on apache.org On Wednesday 12 March 2008 09:53:58 Erick Erickson wrote: > But to me, it always seems...er...fraught to even *think* about relying > on doc ids. I know you've been around the block with Lucene, but do you > have a compelling reason to use the doc ID and not your own unique ID? =46rom memory it was around a factor of 10 times slower to use a text field= for=20 this; I haven't tested it recently and the case of retrieving the Document= =20 should be slightly faster now that we have FieldSelector, but it certainly= =20 won't be faster as to get the document you need the ID in the first place. =46or single documents it wasn't a problem, the use cases are: 1. Bulk database operations based on the matched documents. 2. Creating a filter BitSet based on a database query. Effectively this is required because Lucene offered no way to update a=20 Document after it was indexed; if it had that feature we would never have=20 needed a database. ;-) Daniel --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org