lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Taurat" <>
Subject Re: Out of memory in lucene 1.4.1 when re-indexing large number of documents
Date Fri, 10 Sep 2004 13:57:34 GMT
The Parser is pdfBox. pdf is about 25% of the over all indexing volume  
on the productive system. I also have word-docs and loads of hmtl 
resources to be indexed.
In my testing environment I merely have 5 pdf docs and still those 
permanent object hanging around, though.

Ben Litchfield wrote:

>>I can say that gc is not collecting these objects since I  forced gc
>>runs when indexing every now and then (when parsing pdf-type objects,
>>that is): No effect.
> <>
> What PDF parser are you using? Is the problem within the parser and not
> lucene? Are you releasing all resources?
> Ben
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message