Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 91219 invoked from network); 20 Dec 2002 19:25:48 -0000 Received: from exchange.sun.com (HELO nagoya.betaversion.org) (192.18.33.10) by daedalus.apache.org with SMTP; 20 Dec 2002 19:25:48 -0000 Received: (qmail 700 invoked by uid 97); 20 Dec 2002 19:26:59 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@jakarta.apache.org Received: (qmail 675 invoked by uid 97); 20 Dec 2002 19:26:58 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 659 invoked by uid 98); 20 Dec 2002 19:26:57 -0000 X-Antivirus: nagoya (v4218 created Aug 14 2002) Message-ID: <3E036ED2.8020203@lucene.com> Date: Fri, 20 Dec 2002 11:26:10 -0800 From: Doug Cutting User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2) Gecko/20021202 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Lucene Users List Subject: Re: Lucene Benchmarks and Information References: In-Reply-To: Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N petite_abeille wrote: > On Friday, Dec 20, 2002, at 19:58 Europe/Zurich, Scott Ganyo wrote: > >> FYI: The best thing I've found for both increasing speed and reducing >> file handles is to use an IndexWriter on a RamDirectory for indexing >> and then use FileWriter.addIndexes() to write the result to disk. >> This is subject to the amount of memory you have available, of course... > > I do follow this strategy and it seems to work nicely... however it > might be not enough :-( Another factor to consider is how homogeneous > are your indexes: the more documents are alike the better. I tried a > wide spectrum of combination and I settled on using one index per class > of documents. This seems to work best for me as far as file handles go. > YMMV. Each indexed field adds a file per segment. So, if you want to conserve file handles, use fewer indexed fields per index, and/or use multiple indexes, as you have done. Doug -- To unsubscribe, e-mail: For additional commands, e-mail: