lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jake Mannix" <jake.man...@gmail.com>
Subject Re: Indexing Speed: 2.3 vs 2.2 (real world numbers)
Date Mon, 04 Feb 2008 05:26:19 GMT
Note that in particular, we use the StandardTokenizer as part of our
analyzer
chain, which means it has the switch from the JavaCC version to the JFlex
based
code, which I'm betting is a substantial part of that speedup.

  -jake

On Feb 3, 2008 2:11 PM, Briggs <acidbriggs@gmail.com> wrote:

> Damn, really?  I haven't had the opportunity to test this yet.  Has
> anyone else seen this kind of improvement?
>
>
>
> On Feb 3, 2008 2:57 PM, Jake Mannix <jake.mannix@gmail.com> wrote:
> > Hello all,
> >   I know you lucene devs did a lot of work on indexing performance in
> 2.3,
> > and I just tested it out last thursday, so I thought I'd let you know
> how it
> > fared:
> >
> >   On a 2.17 million document index, a recent test gave indexing time to
> be:
> >
> >     * lucene 2.2: 4.83 hours
> >     * lucene 2.3: 26 minutes
> >
> >   About a factor of 11 speedup.  Holy smokes!  Great work folks.
> >
> >
> >   -jake
> >
>
>
>
> --
> "Conscious decisions by conscious minds are what make reality real"
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message