Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 3811 invoked from network); 9 Jun 2006 00:04:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 9 Jun 2006 00:04:02 -0000 Received: (qmail 86163 invoked by uid 500); 9 Jun 2006 00:03:58 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 85493 invoked by uid 500); 9 Jun 2006 00:03:56 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 85482 invoked by uid 99); 9 Jun 2006 00:03:56 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Jun 2006 17:03:56 -0700 X-ASF-Spam-Status: No, hits=1.4 required=10.0 tests=DNS_FROM_RFC_ABUSE,DNS_FROM_RFC_WHOIS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [206.190.38.59] (HELO web50305.mail.yahoo.com) (206.190.38.59) by apache.org (qpsmtpd/0.29) with SMTP; Thu, 08 Jun 2006 17:03:55 -0700 Received: (qmail 23413 invoked by uid 60001); 9 Jun 2006 00:03:33 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Received:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=Up3s9tidyUUQ7BQS1T07xAD1GSJcZygGgYyUgtx7y1o4JhlCFqsj4XxHvHTNQo4YcA0yVRHPr/jv/lVWgibgrq3d7/H05ilLMNFZ8C8wr8oB5HLfBZzXVMI71GFKiYMQypxEPM3baXpTtStcbecU7HEmHRecFnQfWESxgfJzWhw= ; Message-ID: <20060609000333.23411.qmail@web50305.mail.yahoo.com> Received: from [72.229.167.99] by web50305.mail.yahoo.com via HTTP; Thu, 08 Jun 2006 17:03:33 PDT Date: Thu, 8 Jun 2006 17:03:33 -0700 (PDT) From: Otis Gospodnetic Reply-To: Otis Gospodnetic Subject: Re: IndexWriter.addIndexes & optimization To: java-user@lucene.apache.org In-Reply-To: <4487319D.10006@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N When writing a unit test that comapres RAMDirectory and FSDirectory performance for Lucene in Action I had a very hard time showing that RAMDirectory really is faster. :) Just set your maxBufferedDocs to as high a number as your RAM/heap will let you, and pick a mergeFactor that is high, but doesn't get you in trouble with open files. Otis ----- Original Message ---- From: Dan Armbrust To: java-user@lucene.apache.org Sent: Wednesday, June 7, 2006 4:05:49 PM Subject: Re: IndexWriter.addIndexes & optimization Benjamin Stein wrote: > > I could probably store the little RAMDirectories to disk as many > FSDirectories, and then addIndexes() of *all* the FSDirectories at the end > instead of every time. That would probably be smart. > > Glad I asked myself! > That was what I was going to suggest - you may also want to benchmark to see if the RAMDirectory is buying you anything. With the data that I am indexing on my hardware, I found it to be faster to index to a regular FSDirectory that it is to use the RAMDirectory. Especially if you tweak the performance knobs on the indexer so it does its own caching before it writes to the Directory. I do batches of documents to FSDirectories - and then merge all of the FSDirectories into a new master index at the end - so I never have to optimize during the indexing process. Dan -- **************************** Daniel Armbrust Biomedical Informatics Mayo Clinic Rochester daniel.armbrust(at)mayo.edu http://informatics.mayo.edu/ --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org