lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: indexing performance issue
Date Thu, 30 Nov 2006 18:56:03 GMT

On Nov 30, 2006, at 10:54 AM, spinergywmy wrote:

> Hi Grant,
>    Thanks for the tips. I will take ur adviced and look into the  
> link that u
> send to me.
>    For my scenario will be every time the users upload the single  
> file, I
> need to index that particular file. Previously was because the  
> previous
> version of pdfbox integrate with log4j.jar file and I believe is the
> log4j.jar cause the indexing performance and takes up a lot of memory
> resources. However, the latest version of pdfbox doesn't need to  
> integrate
> with log4j.jar, and I thought that will actually speed up the indexing
> performance but the result was no.

I would isolate PDFBox and do some performance testing on it, then  
submit your questions on the PDFBox forums, as they will know better  
about PDFBox performance.

Good luck,

Grant Ingersoll
Center for Natural Language Processing

Read the Lucene Java FAQ at 

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message