Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 93847 invoked from network); 25 Aug 2004 21:31:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 25 Aug 2004 21:31:17 -0000 Received: (qmail 4413 invoked by uid 500); 25 Aug 2004 21:31:11 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 3876 invoked by uid 500); 25 Aug 2004 21:31:08 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 3863 invoked by uid 99); 25 Aug 2004 21:31:07 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [192.18.98.34] (HELO brmea-mail-3.sun.com) (192.18.98.34) by apache.org (qpsmtpd/0.27.1) with ESMTP; Wed, 25 Aug 2004 14:31:06 -0700 Received: from phys-d3-ha21sca-1 ([129.145.155.163]) by brmea-mail-3.sun.com (8.12.10/8.12.9) with ESMTP id i7PLV5il014045 for ; Wed, 25 Aug 2004 15:31:05 -0600 (MDT) Received: from conversion-daemon.ha21sca-mail1.sfbay.sun.com by ha21sca-mail1.sfbay.sun.com (iPlanet Messaging Server 5.2 HotFix 1.24 (built Dec 19 2003)) id <0I3000I01UD7MY@ha21sca-mail1.sfbay.sun.com> (original mail from Hetan.Shah@Sun.COM) for lucene-user@jakarta.apache.org; Wed, 25 Aug 2004 14:31:05 -0700 (PDT) Received: from [127.0.0.1] (d-sjc06-25-188.SFBay.Sun.COM [129.145.25.188]) by ha21sca-mail1.sfbay.sun.com (iPlanet Messaging Server 5.2 HotFix 1.24 (built Dec 19 2003)) with ESMTP id <0I3000A80UFJC1@ha21sca-mail1.sfbay.sun.com> for lucene-user@jakarta.apache.org; Wed, 25 Aug 2004 14:31:03 -0700 (PDT) Date: Wed, 25 Aug 2004 14:30:52 -0700 From: Hetan Shah Subject: Time to index documents To: Lucene Users List Message-id: <412D050C.9080309@sun.com> MIME-version: 1.0 Content-type: text/plain; charset=us-ascii; format=flowed Content-transfer-encoding: 7BIT X-Accept-Language: en-us, en User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7) Gecko/20040616 X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hello all, Is there a way to reduce the indexing time taken when the indexer is indexing about 30,000 + files. It is roughly taking around 6-7 hours to do this. I am using IndexHTML class to create the index out of HTML files. Another issue that I see is every once in a while I get the following output on the screen. adding ../31/1104852.html Parse Aborted: Encountered "\"" at line 7, column 1. Was expecting one of: ... "=" ... ... Any suggestions on preventing this from happening? Thanks in advance. -H --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org