lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <>
Subject Re: OutOfMemory errors while indexing large documents
Date Mon, 25 Jul 2005 13:13:21 GMT
Could you be more specific about where the OutOfMemory error is  
happening?  Do you have a complete stack trace?

As for maxFieldLength - in my use of Lucene, it is necessary to index  
the entire document and not just the first 10,000 or so terms - I set  
maxFieldLength to Integer.MAX_VALUE.


On Jul 25, 2005, at 7:30 AM, Harini Raghavan wrote:

> Hi All,
> I am using lucene to index large documents(HTML pages). The  
> application is running on JBoss and MySQL on UNIX. The indexing is  
> throwing OutOfMemory errors beyond a certain point. I am not sure  
> why this is happening. I am using the default IndexWriter  
> properties, but the lucene documentation mentions about setting the  
> max field length on the IndexWriter to some optimum value for large  
> documents. Is anyone aware of any optimum settings for  
> maxFieldLength, mergeFactor, minMergeDoc and maxMergeDoc?
> Thanks,
> Harini
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message