lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kevin A. Burton" <bur...@newsmonster.org>
Subject Re: Performance of hit highlighting and finding term positions for
Date Thu, 01 Apr 2004 02:43:21 GMT
Doug Cutting wrote:

> http://nagoya.apache.org/eyebrowse/ReadMsg?listName=lucene-dev@jakarta.apache.org&msgId=1413989

>
>
> According to these, if your documents average 16k, then a 10-hit 
> result page would require just 66ms to generate highlights using 
> SimpleAnalyzer.

The whole search takes only 300ms... this means that if I highlight 5 
docs I've doubled my search time.

Note that Google has a whole subsection of their cluster dedicated to 
keyword in context extraction.

Kevin

-- 

Please reply using PGP.

    http://peerfear.org/pubkey.asc    
    
    NewsMonster - http://www.newsmonster.org/
    
Kevin A. Burton, Location - San Francisco, CA, Cell - 415.595.9965
       AIM/YIM - sfburtonator,  Web - http://peerfear.org/
GPG fingerprint: 5FB2 F3E2 760E 70A8 6174 D393 E84D 8D04 99F1 4412
  IRC - freenode.net #infoanarchy | #p2p-hackers | #newsmonster


Mime
View raw message