Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 77618 invoked from network); 27 Oct 2005 18:18:31 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 27 Oct 2005 18:18:31 -0000 Received: (qmail 14886 invoked by uid 500); 27 Oct 2005 18:18:26 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 14869 invoked by uid 500); 27 Oct 2005 18:18:26 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 14858 invoked by uid 99); 27 Oct 2005 18:18:26 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2005 11:18:26 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [69.55.225.129] (HELO ehatchersolutions.com) (69.55.225.129) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Oct 2005 11:18:22 -0700 Received: by ehatchersolutions.com (Postfix, from userid 504) id A8B7313E2006; Thu, 27 Oct 2005 14:18:03 -0400 (EDT) Received: from [128.143.167.108] (d-128-167-108.bootp.Virginia.EDU [128.143.167.108]) by ehatchersolutions.com (Postfix) with ESMTP id 8BE0913E2005 for ; Thu, 27 Oct 2005 14:17:57 -0400 (EDT) Mime-Version: 1.0 (Apple Message framework v734) In-Reply-To: <20051027142158.29921.qmail@web60014.mail.yahoo.com> References: <20051027142158.29921.qmail@web60014.mail.yahoo.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Erik Hatcher Subject: Re: Index File Size Date: Thu, 27 Oct 2005 14:17:53 -0400 To: java-user@lucene.apache.org X-Mailer: Apple Mail (2.734) X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on javelina X-Spam-Level: X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-2.6 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.0.1 X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N On 27 Oct 2005, at 10:21, Chandramohan wrote: > In general, will index size be equal to the size of > the document? Also, does Lucene employ any index > compression schemes? I am a relatively new user of > Lucene and I just love it! It depends on how you create Field's. The general rule of thumb I've heard is that if you don't store fields, but just tokenize and index them, an index is about 35% the size of the original documents. Though most applications need some stored fields. Erik --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org