Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 90229 invoked from network); 16 Dec 2004 18:38:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 16 Dec 2004 18:38:20 -0000 Received: (qmail 85644 invoked by uid 500); 16 Dec 2004 18:38:14 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 85618 invoked by uid 500); 16 Dec 2004 18:38:13 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 85603 invoked by uid 99); 16 Dec 2004 18:38:13 -0000 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=DNS_FROM_RFC_ABUSE X-Spam-Check-By: apache.org Received-SPF: pass (hermes.apache.org: local policy) Received: from web12701.mail.yahoo.com (HELO web12701.mail.yahoo.com) (216.136.173.238) by apache.org (qpsmtpd/0.28) with SMTP; Thu, 16 Dec 2004 10:36:26 -0800 Received: (qmail 59437 invoked by uid 60001); 16 Dec 2004 18:35:26 -0000 Comment: DomainKeys? See http://antispam.yahoo.com/domainkeys DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; b=SVMzTWhxqa/m8IWu88lG7PuujjS6f3Ul1JaWXFVJg9tIN2nlMmjWqipJpkuWW4F4orV7VVoxXlt8LL56Gd3NDhSHAlKwLkMFdqdWCKwC6vzag3n/hvETZ3LsWa5/VIpCWXL6DlwarHIsi5tHcooQASXUC9vhY4cOeIvLDE6dwsU= ; Message-ID: <20041216183525.59432.qmail@web12701.mail.yahoo.com> Received: from [216.194.17.194] by web12701.mail.yahoo.com via HTTP; Thu, 16 Dec 2004 10:35:25 PST Date: Thu, 16 Dec 2004 10:35:25 -0800 (PST) From: Otis Gospodnetic Subject: Re: Disk space needed for indexing??? To: Lucene Users List In-Reply-To: <1103221082.41c1d15aae8cd@webmail.iu.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N The exact disk space usage depends on the number of fields in the index and on how many of them store the original text. You should also keep in mind that the call to IndexWriter's optimize() will result in your index directory size doubling while the optimization is in progress, so if you want to optimize you will need extra free disk space. Otis --- yingjin@indiana.edu wrote: > > > Hi, everyone, > > Does anyone have any idea how much disk space will be needed for > generating the final index with ~1.5G size, for example? > > I have ~3.5G disk space and is able to generate index with 1G size. > However, after I add more records, it will run out of disk space. > Does > Lucene suppose to take so much disk space for indexing? Is there any > way > that I can improve the code to let it take less space? > > > Thanks, > Ying > > --------------------------------------------------------------------- > To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org > For additional commands, e-mail: lucene-user-help@jakarta.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org