lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: Performance of hit highlighting and finding term positions for
Date Wed, 31 Mar 2004 21:54:23 GMT
markharw00d@yahoo.co.uk wrote:
> As a note of warning: I did find StandardTokenizer to be the major culprit in my tokenizing
benchmarks (avg 75ms for 16k sized docs).
> I have found I can live without StandardTokenizer in my apps.

FYI, the message with Mark's timings can be found at:

http://nagoya.apache.org/eyebrowse/ReadMsg?listName=lucene-dev@jakarta.apache.org&msgId=1413989

According to these, if your documents average 16k, then a 10-hit result 
page would require just 66ms to generate highlights using SimpleAnalyzer.

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message