Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 7438 invoked from network); 6 Aug 2004 12:33:58 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 6 Aug 2004 12:33:58 -0000 Received: (qmail 37091 invoked by uid 500); 6 Aug 2004 12:33:49 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 37052 invoked by uid 500); 6 Aug 2004 12:33:48 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 37035 invoked by uid 99); 6 Aug 2004 12:33:48 -0000 X-ASF-Spam-Status: No, hits=1.3 required=10.0 tests=PORN_URL_SEX,SPF_PASS X-Spam-Check-By: apache.org Received: from [134.96.191.5] (HELO mousepad.xtramind.dfki.de) (134.96.191.5) by apache.org (qpsmtpd/0.27.1) with ESMTP; Fri, 06 Aug 2004 05:33:43 -0700 Received: from localhost (localhost [127.0.0.1]) by mousepad.xtramind.dfki.de (Postfix) with ESMTP id 258F97F3B for ; Fri, 6 Aug 2004 14:33:41 +0200 (MEST) Received: from ihawk.win.xtramind.dfki.de (unknown [192.168.4.39]) by mousepad.xtramind.dfki.de (Postfix) with ESMTP id 64C4B7D1B for ; Fri, 6 Aug 2004 14:33:37 +0200 (MEST) X-MimeOLE: Produced By Microsoft Exchange V6.5.7226.0 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Subject: AW: Lucene internal document number? Date: Fri, 6 Aug 2004 14:33:37 +0200 Message-ID: <90BBF8FAA569414698E019D903E37DA703D92D@ihawk.win.xtramind.dfki.de> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Lucene internal document number? Thread-Index: AcR7qiWuR2bNH98RQ+OBXODQafrBsgABkFNg From: "Karsten Konrad" To: "Lucene Users List" X-Virus-Scanned: by AMaViS with Sophos Sweep X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, >> a have a short question it's regarding lucene internal document numbers: = can you give me an idea where they are written into the index and how=20 they are generated? >> I am not 100% sure about the technical design, only from my experience with Lucene: The numbers depend on when the document was indexed.=20 The "older" the document, the smaller the number. All=20 documents are numbered from 0 to n-1 where n is the=20 number of documents the current reader sees. There are never any "gaps" in this numbering. There is, to my knowledge, no explicit point where these numbers are written in the index. Think of positions in a list - they are not part of the list itself. You have to take into account that these numbers may change for documents after=20 any deletions in the index. Regards, Karsten -- Dr.-Ing. Karsten Konrad Head of Artificial Intelligence Lab Xtramind Technologies GmbH=20 Stuhlsatzenhausweg 3=20 D-66123 Saarbr=FCcken Phone +49 (681) 3 02-51 13=20 Fax +49 (681) 3 02-51 09 karsten.konrad@xtramind.com=20 www.xtramind.com >Besuchen Sie uns ! >DMS | Halle 2 Stand 2705 | 07.- 09. September 2004 | Messe Essen | = www.dmsexpo.de > > -----Urspr=FCngliche Nachricht----- Von: B. Grimm [Eastbeam GmbH] [mailto:bg@eastbeam.com]=20 Gesendet: Freitag, 6. August 2004 13:42 An: lucene-user@jakarta.apache.org Betreff: Lucene internal document number? hi there, i looked around through the source but i dont get it. i also read the=20 faq and i know that numbers are incremental for each index and start by=20 0 and change when optimizing and so one... i looked at the doc writers in lucene, but i dont get the point where=20 numbers are given and written (i assume by using writeVInt() or=20 something like that). it would be very kind if anyone can tell me what line in which file i=20 had to look for. thanks in andvance and kind regards from berlin, germany. bastian --=20 Mit freundlichem Gru=DF, Bastian Grimm --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org