lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <erickerick...@gmail.com>
Subject Re: Indexing Performance issue
Date Fri, 10 Nov 2006 12:48:14 GMT
Have you measured to see how much of your time is spent indexing and how
much is just parsing the file? You need to do this before having a clue what
you need to make faster....

Erick

On 11/10/06, Daniel Naber <lucenelist2005@danielnaber.de> wrote:
>
> On Friday 10 November 2006 12:18, spinergywmy wrote:
>
> > I having this indexing the pdf file performance issue. It took me more
> > than 10 sec to index a pdf file about 200kb. Is it because I only have a
> > segment file? How can I make the indexing performance better?
>
> PDFBox (which I assume you are using) can be quite slow converting large
> PDF files to text. This has nothing to do with Lucene.
>
> Regards
> Daniel
>
> --
> http://www.danielnaber.de
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message