lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nader S. Henein" <>
Subject RE: commercial websites powered by Lucene?
Date Tue, 24 Jun 2003 11:39:29 GMT
About 100 documents every twenty minutes, but it fluctuates depending on
how much traffic is on the site

-----Original Message-----
From: news [] On Behalf Of Chris Miller
Sent: Tuesday, June 24, 2003 3:28 PM
Subject: Re: commercial websites powered by Lucene?

Hmm, good point with the cost of copying indicies in a distributed
environment, although that is unlikely to affect us in the foreseeable
future. But, noted!

Do you have any rough statistics on how many documents you index/day, or
how many every 20 minutes?

This discussion is fantastic by the way, lots of great experience and
comments coming out here. Thanks, it's really appreciated.

"Nader S. Henein" <> wrote in message
> We thought of that in the beginning and then we became more 
> comfortable with multiple indices for simple backup purposes, and now 
> our indices are in excess of 100megs, and transferring that kind of 
> data between three machines sitting in the same data center is 
> passable, but once you start thinking of distributed webservers in 
> different hosting facilities, copying  100Megs every 20 minutes, or 
> even every hour becomes financially expensive.
> Our webservers are on Single Processor Sun Ultra Sparc III 400 Mhz 
> with two gegs of memory, and I've never seen the CPU usage go over 0.8

> at peek time with the indexer running. Try it out first, take your 
> time to gather your own numbers so you can really get  a feel of what 
> set up fits you best.
> Nader

To unsubscribe, e-mail:
For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message