Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@apache.org Received: (qmail 25397 invoked from network); 24 Jun 2003 11:33:20 -0000 Received: from exchange.sun.com (192.18.33.10) by daedalus.apache.org with SMTP; 24 Jun 2003 11:33:20 -0000 Received: (qmail 4076 invoked by uid 97); 24 Jun 2003 11:35:40 -0000 Delivered-To: qmlist-jakarta-archive-lucene-user@nagoya.betaversion.org Received: (qmail 4069 invoked from network); 24 Jun 2003 11:35:40 -0000 Received: from daedalus.apache.org (HELO apache.org) (208.185.179.12) by nagoya.betaversion.org with SMTP; 24 Jun 2003 11:35:40 -0000 Received: (qmail 25082 invoked by uid 500); 24 Jun 2003 11:33:17 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 24990 invoked from network); 24 Jun 2003 11:33:15 -0000 Received: from unknown (HELO mail-out.dubaiinternetcity.net) (213.132.36.135) by daedalus.apache.org with SMTP; 24 Jun 2003 11:33:15 -0000 Received: from naderit ([213.132.40.2]) by dicisp003b.dic.sys (iPlanet Messaging Server 5.2 Patch 1 (built Aug 19 2002)) with ESMTP id <0HGZ007MGHFE0O@dicisp003b.dic.sys> for lucene-user@jakarta.apache.org; Tue, 24 Jun 2003 15:33:14 +0400 (GMT) Date: Tue, 24 Jun 2003 15:39:29 +0400 From: "Nader S. Henein" Subject: RE: commercial websites powered by Lucene? In-reply-to: To: 'Lucene Users List' Reply-to: nsh@bayt.net Message-id: <002701c33a45$45e44c90$1801a8c0@naderit> Organization: Bayt.com MIME-version: 1.0 X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2800.1106 X-Mailer: Microsoft Outlook, Build 10.0.4024 Content-type: text/plain; charset=US-ASCII Content-transfer-encoding: 7BIT Importance: Normal X-Priority: 3 (Normal) X-MSMail-priority: Normal X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N X-Spam-Rating: daedalus.apache.org 1.6.2 0/1000/N About 100 documents every twenty minutes, but it fluctuates depending on how much traffic is on the site -----Original Message----- From: news [mailto:news@main.gmane.org] On Behalf Of Chris Miller Sent: Tuesday, June 24, 2003 3:28 PM To: lucene-user@jakarta.apache.org Subject: Re: commercial websites powered by Lucene? Hmm, good point with the cost of copying indicies in a distributed environment, although that is unlikely to affect us in the foreseeable future. But, noted! Do you have any rough statistics on how many documents you index/day, or how many every 20 minutes? This discussion is fantastic by the way, lots of great experience and comments coming out here. Thanks, it's really appreciated. "Nader S. Henein" wrote in message news:002401c33a42$6a350ce0$1801a8c0@naderit... > We thought of that in the beginning and then we became more > comfortable with multiple indices for simple backup purposes, and now > our indices are in excess of 100megs, and transferring that kind of > data between three machines sitting in the same data center is > passable, but once you start thinking of distributed webservers in > different hosting facilities, copying 100Megs every 20 minutes, or > even every hour becomes financially expensive. > > Our webservers are on Single Processor Sun Ultra Sparc III 400 Mhz > with two gegs of memory, and I've never seen the CPU usage go over 0.8 > at peek time with the indexer running. Try it out first, take your > time to gather your own numbers so you can really get a feel of what > set up fits you best. > > Nader --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org