Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 73171 invoked from network); 23 Jul 2004 15:29:33 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 23 Jul 2004 15:29:33 -0000 Received: (qmail 29250 invoked by uid 500); 23 Jul 2004 15:29:25 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 29220 invoked by uid 500); 23 Jul 2004 15:29:25 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 29206 invoked by uid 99); 23 Jul 2004 15:29:24 -0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=HTML_40_50,HTML_MESSAGE X-Spam-Check-By: apache.org Received: from [64.65.202.3] (HELO contextmedia.com) (64.65.202.3) by apache.org (qpsmtpd/0.27.1) with ESMTP; Fri, 23 Jul 2004 08:29:21 -0700 Received: from ppeddi ([192.168.1.48]) by contextmedia.com ; Fri, 23 Jul 2004 11:27:28 -0400 Message-ID: <035101c470c9$92d6bae0$aa87a8c0@ppeddi> From: "Praveen Peddi" To: "lucenelist" Subject: merge factor and minMergeDocs Date: Fri, 23 Jul 2004 11:27:32 -0400 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_034E_01C470A8.0BB6C300" X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1409 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1409 X-Rcpt-To: X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N ------=_NextPart_000_034E_01C470A8.0BB6C300 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Is anything changed lucene 1.4 regarding mergefactor? I recently ported to lucene 1.4 final and my indexing time doesnot = change with change in the merge factor. Increasing minMergeDocs is = improving my indexing as expected but changing mergefactor is making no = difference. If this is the case, I can always go with the default merge factor of 10 = so I won't run into too many files open problem. But just vary = minMergeDocs to tune the indexing perf. Currently I tested with 25K objects and the indexing time is almost the = same with mergefactor 10 and mergefactor of 100 (kept minMergeDocs = =3D100 in both cases). I am confident that my indexing time used to vary = with change in the merge factor before (with lucene 1.3 RC3 I think). Praveen **************************************************************=20 Praveen Peddi Sr Software Engg, Context Media, Inc.=20 email:ppeddi@contextmedia.com=20 Tel: 401.854.3475=20 Fax: 401.861.3596=20 web: http://www.contextmedia.com=20 **************************************************************=20 Context Media- "The Leader in Enterprise Content Integration"=20 ------=_NextPart_000_034E_01C470A8.0BB6C300--