Return-Path: Delivered-To: apmail-lucene-java-dev-archive@www.apache.org Received: (qmail 95107 invoked from network); 23 Mar 2007 05:30:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 23 Mar 2007 05:30:11 -0000 Received: (qmail 35794 invoked by uid 500); 23 Mar 2007 05:30:17 -0000 Delivered-To: apmail-lucene-java-dev-archive@lucene.apache.org Received: (qmail 35739 invoked by uid 500); 23 Mar 2007 05:30:16 -0000 Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-dev@lucene.apache.org Delivered-To: mailing list java-dev@lucene.apache.org Received: (qmail 35728 invoked by uid 99); 23 Mar 2007 05:30:16 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from [169.229.70.167] (HELO rescomp.berkeley.edu) (169.229.70.167) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2007 22:30:16 -0700 Received: by rescomp.berkeley.edu (Postfix, from userid 1007) id 3027D5B771; Thu, 22 Mar 2007 22:29:18 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by rescomp.berkeley.edu (Postfix) with ESMTP id 2917B7F403 for ; Thu, 22 Mar 2007 22:29:18 -0700 (PDT) Date: Thu, 22 Mar 2007 22:29:18 -0700 (PDT) From: Chris Hostetter To: java-dev@lucene.apache.org Subject: Re: [jira] Updated: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents In-Reply-To: <6C32DEB6-1826-4295-9F18-35C38AD434EB@rectangular.com> Message-ID: References: <16648050.1174583194037.JavaMail.jira@brutus> <7849281.1174583312338.JavaMail.jira@brutus> <392521EA2692A2418DF48C331E61E32506AC6B@professorville.windows.esseff.org> <1174594078.7571.1180891687@webmail.messagingengine.com> <392521EA2692A2418DF48C331E61E32506AC6E@professorville.windows.esseff.org> <1174601936.25270.1180912467@webmail.messagingengine.com> <6C32DEB6-1826-4295-9F18-35C38AD434EB@rectangular.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Checked: Checked by ClamAV on apache.org : > Actually is #2 a hard requirement? : : A lot of Lucene users depend on having document number correspond to : age, I think. ISTR Hatcher at least recommending techniques that : require it. "Corrispond to age" may be missleading as it implies that the actual docid has meaning ... it's more that the relative order of addition is preserved regardless of deletions/merging A trivial example of using this is getting the N newest documents matching a search using a HitCollector, it's just a bounded queue that only remembers the last N things you put in it. An more complicated example is duplicate unique field detection: iterating over a TermDoc and knowing that the doc with the higheest docId is the last one added, so the earlier ones can be ignored/deleted. (as i recall, Solr takes advantage of this.) -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org For additional commands, e-mail: java-dev-help@lucene.apache.org