Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 10450 invoked from network); 11 Aug 2006 22:10:50 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 Aug 2006 22:10:50 -0000 Received: (qmail 25066 invoked by uid 500); 11 Aug 2006 22:10:45 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 25039 invoked by uid 500); 11 Aug 2006 22:10:44 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 25021 invoked by uid 99); 11 Aug 2006 22:10:43 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Aug 2006 15:10:43 -0700 X-ASF-Spam-Status: No, hits=0.5 required=10.0 tests=DNS_FROM_RFC_ABUSE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of markrmiller@gmail.com designates 66.249.82.224 as permitted sender) Received: from [66.249.82.224] (HELO wx-out-0506.google.com) (66.249.82.224) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Aug 2006 15:10:41 -0700 Received: by wx-out-0506.google.com with SMTP id s15so761097wxc for ; Fri, 11 Aug 2006 15:10:20 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=h4VIufC1xwQI1NS20YAE7xcMEHC3cufIajP6oV0BdpygtQzIaXJokQHi7phPIBDShbOwDXo72GaPLc8olgYHokHsExekzr4tJb8cj9c6oIIwZa3j59EJYMCmzwmkc49eH4lf74E98HI6iAP+BGy60zjvt205mNyx8KgeuVLrksw= Received: by 10.70.32.13 with SMTP id f13mr5910992wxf; Fri, 11 Aug 2006 15:10:19 -0700 (PDT) Received: from ?192.168.1.102? ( [216.66.115.97]) by mx.gmail.com with ESMTP id 44sm3597620wri.2006.08.11.15.10.18; Fri, 11 Aug 2006 15:10:19 -0700 (PDT) Message-ID: <44DD0057.1030606@gmail.com> Date: Fri, 11 Aug 2006 18:10:31 -0400 From: Mark Miller User-Agent: Thunderbird 1.5.0.5 (Windows/20060719) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: 30 milllion+ docs on a single server References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N I've made a nice little archive application with lucene. I made it to handle our largest need: 2.5 million docs or so on a single server. Now the powers that be say: lets use it for a 30+ million document archive on a single server! (each doc size maybe 10k max...as small as a 1 or 2k) Please tell me why we are in trouble...please tell me why we are not. I have tested up to 2 million docs without much trouble but 30 million...the average search will include a sort on a field as well...can I search 30+ million docs with a sort? Man am I worried about that. Maybe the server will have 8 procs and 12 billion gigs of RAM. Mabye. Even still, Tomcat seems to be able to launch with a max of 1.5 or 1.6 gig of Ram in Windows. What do you think? 30 million+ sounds like too much of a load to me for a single server. Not that they care what I think...I only wrote the thing (man I hate my job, offer me a new one :) )...please...comments? Cheers, Miserable Mark --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org