lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Marvin Humphrey <>
Subject Re: caching term information?
Date Sat, 20 May 2006 15:20:41 GMT

On May 20, 2006, at 12:01 AM, Robert Engels wrote:

> Maybe don't cache the term pages, then, just cache the frequently  
> requested
> terms themselves.

That sounds like a winner.  Search term frequencies follow a power  
law distribution.  Cache the top 20% or so in an LRU and you'll cut  
down on disk seeks and linear scanning significantly.

Marvin Humphrey
Rectangular Research

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message