Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm
Precedence: bulk
Reply-To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Message-ID: <3F259509.7@lucene.com>
Date: Mon, 28 Jul 2003 14:26:33 -0700
From: Doug Cutting <cutting@lucene.com>
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4) Gecko/20030701
MIME-Version: 1.0
To: Lucene Users List <lucene-user@jakarta.apache.org>
Subject: Re: Indexing very large sets (10 million docs)
References: <B2671BA904946B44AA2C84B8EC8C7858A0D9D5@cj-qa1.cj.com>
In-Reply-To: <B2671BA904946B44AA2C84B8EC8C7858A0D9D5@cj-qa1.cj.com>
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit

Ryan Clifton wrote:
> You seem to by implying that it is possible to optimize very large indexes.  My index has a couple million records, but more importantly it's about 40 gigs in size.  I have tried many times to optimize it and this always results in hitting the Linux file size limit.  Is there  a way to get around this?  I have the merge factor and max merge docs set, but the optimization process seems to ignore those fields.  

On Redhat 8.0 I have built indexes whose total size is 49GB and whose 
largest file (the .prx) is 28GB.  I haven't yet tried to build anything 
larger, so I don't know exactly where the limit is.

Doug


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org