Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 13572 invoked from network); 6 Jul 2006 15:09:38 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 6 Jul 2006 15:09:38 -0000 Received: (qmail 80337 invoked by uid 500); 6 Jul 2006 15:09:32 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 80295 invoked by uid 500); 6 Jul 2006 15:09:31 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 80284 invoked by uid 99); 6 Jul 2006 15:09:31 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Jul 2006 08:09:31 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [193.22.3.98] (HELO mail.insiders.de) (193.22.3.98) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Jul 2006 08:09:30 -0700 Received: from insiders.de (unknown [193.22.3.33]) by mail.insiders.de (Postfix) with ESMTP id 523C04D5B43 for ; Thu, 6 Jul 2006 17:09:08 +0200 (CEST) X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C6A10D.FA272DDA" Subject: Berkeley DB JEDirectory Performance Date: Thu, 6 Jul 2006 17:06:45 +0200 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Berkeley DB JEDirectory Performance thread-index: AcahDcusnJv7SFOSTJawuNELqpBwRQ== From: "Johannes Christen" To: X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------_=_NextPart_001_01C6A10D.FA272DDA Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi all. =20 I just want to share my experience with the Berkeley DB JEDirectory implementation from the contrib. area. I spend two days evaluating and testing it and found out that it does work, but has very bad performance and very high disk requirements for medium size document volume.=20 =20 I indexed about 78000 documents (DPA news items) in the FSDirectory and the JEDirectory, and here are the results: =20 Disk usage (index size): FSDirectory: 322 MB JEDirectory: 4650 MB =20 Indexing Performance: FSDirectory: 84 minutes JEDirectory: 402 minutes =20 Searching: Initial opening of the JEDirectory took about 45 minutes. The searching itself was ok, but still about 1.5 times slower than with the FSDirectory. =20 Ok. I hope than helped people who consider using the Berkeley DB directory implementation in their application. It may do a good job if you want to use transactions in small environments, but if the amount of documents is getting big I wouldn't recommend the JEDirectory implementation. =20 Bye for now =20 Jo Christen =20 =20 ------_=_NextPart_001_01C6A10D.FA272DDA--