lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Otis Gospodnetic <>
Subject Observations: profiling indexing process
Date Wed, 20 Nov 2002 06:59:32 GMT

I decided to run a little Lucene app that does some indexing under a
profiler. (I used JMP,, a rather simple

The app uses StandardAnalyzer.
I've noticed that a lot of time is spent in StandardTokenizer and
various JavaCC-generated methods.
I am wondering if anyone tried replacing StandardTokenizer.jj with
something more efficient?

Also,StopFilter is using a Hashtable to store the list of stop words. 
Has anyone tried using HashMap instead?


Do you Yahoo!?
Yahoo! Web Hosting - Let the expert host your site

To unsubscribe, e-mail:   <>
For additional commands, e-mail: <>

View raw message