Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 13824 invoked from network); 30 Jun 2003 17:48:25 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 30 Jun 2003 17:48:25 -0000 Received: (qmail 11368 invoked by uid 97); 30 Jun 2003 17:50:52 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 11360 invoked from network); 30 Jun 2003 17:50:51 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 30 Jun 2003 17:50:51 -0000 Received: (qmail 13548 invoked by uid 500); 30 Jun 2003 17:48:22 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 13535 invoked from network); 30 Jun 2003 17:48:21 -0000 Received: from rumms.uni-mannheim.de (134.155.50.52) by daedalus.apache.org with SMTP; 30 Jun 2003 17:48:21 -0000 Received: from localhost.localdomain (p50841FA7.dip.t-dialin.net [80.132.31.167]) (authenticated bits=0) by rumms.uni-mannheim.de (8.12.9/8.12.9) with ESMTP id h5UHmLeG011714 for ; Mon, 30 Jun 2003 19:48:23 +0200 (MEST) Message-ID: X-Priority: Normal X-Mailer: Columba v0.11.0-cvs From: tstich@uni-mannheim.de Date: Mon, 30 Jun 03 19:54:19 CEST To: lucene-user@jakarta.apache.org Subject: =?ISO-8859-1?Q?Using_Lucene_in_an_multiple_index/large_io_scenario?= Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: 8bit X-Virus-Scanned: by amavisd-new X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N Hello, i am ProjectManager from the columba.sourceforge.net java mailclient-project and we integrated Lucene as the search-backend half a year ago. It is now working for small scale mailtraffic but with increasing mailtraffic Lucene throws OutOfMemory and TooManyFilesOpen-Exceptions. I am now wondering if Lucene is capable of doing the job for us (like Otis Gospodnetic suggested) and would appreciate any help and knowledge you can share on this topic. I think the problem arises from following issues: - Lucene is designed to create an index once in a while and not to update an index frequently. We need it to add and delete documents very often *and* search the index eventualy after every operation. Has anyone experiences running Lucene in such an environment or do you think it is impossible? - Do you have an suggestion on how to use Lucene in such an environment because it is not very nice code if you have to create a new IndexReader/Writer after every operation? - We introduced a RAMIndex that is merged to the FileIndex after N operations to reduce the load and to not merge documents that are removed directly after they are added (with filters on the mailboxes that is happening very often). Any ideas if that was wise or if there is a better solution? - Does Lucene have problems with many indices in the same virtual machine? We have an index for every mailfolder and get TooManyFilesOpen-Excpetions when having >10 indices open. Maybe we should try to have only a single index that holds all messages? If you like to look at sourcecode, how we implememted all this look at http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/columba/columba/src/mail/core/org/columba/mail/folder/search/LuceneSearchEngine.java?rev=1.7&content-type=text/vnd.viewcvs-markup Its not nice to just give you the plain code and not the relevant snippets, but these are more general design issues that i think are better explained in words than in code. I would really like to see Lucene integrated in Columba, but i had to learn that it is no easy task, maybe an impossible one. Based on the responses i willl decide if we continue to work with Lucene or sadly have to drop it. Thanks in advance Timo Stich tstich@users.sourceforge.net --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org