lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marvin Humphrey <mar...@rectangular.com>
Subject Re: caching term information?
Date Sat, 20 May 2006 15:20:41 GMT

On May 20, 2006, at 12:01 AM, Robert Engels wrote:

> Maybe don't cache the term pages, then, just cache the frequently  
> requested
> terms themselves.

That sounds like a winner.  Search term frequencies follow a power  
law distribution.  Cache the top 20% or so in an LRU and you'll cut  
down on disk seeks and linear scanning significantly.

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message