Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 60885 invoked from network); 14 Aug 2010 11:09:55 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 14 Aug 2010 11:09:55 -0000 Received: (qmail 8962 invoked by uid 500); 14 Aug 2010 11:09:53 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 8747 invoked by uid 500); 14 Aug 2010 11:09:50 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 8739 invoked by uid 99); 14 Aug 2010 11:09:49 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Aug 2010 11:09:49 +0000 X-ASF-Spam-Status: No, hits=3.5 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qw0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Aug 2010 11:09:43 +0000 Received: by qwk3 with SMTP id 3so3086280qwk.35 for ; Sat, 14 Aug 2010 04:09:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=D6SZ0Y8DV2uTXdw4xNtT6skayGOzapJRu/wLN/pnxII=; b=RYSa/YHJyUCC7WOph0cjLwEaT2Y/fj3j6VNEcqCOo3kUPtwkJBDr0HUTHyG9MLxmSa e6N9IUXCkFVKeXPmHyPS2efq8MfkL2T3ThocbEuFrU/jJ9b2QpGHD7x3V3sosDWUBY+a SNsZLVMHBKbUaiAwfVgFjpZYYtDPlrwlHeLu0= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=N4jiUGL8qeRulacDFp3WwMDBzmFduJDh6Bugfm3JfXNT8tja1i5fGLrcSuk6akLB6x gQv06SBqbYYwy3SfHeB/4S1a+4rWpsi6qEDAvKOBHR0rV+k/yv7KhGIUTGBJitWH79WI Xw6LrWJwY/zSd5fIIDalblEHfocCcEsXxRHZE= MIME-Version: 1.0 Received: by 10.224.31.138 with SMTP id y10mr1813051qac.289.1281784162332; Sat, 14 Aug 2010 04:09:22 -0700 (PDT) Received: by 10.229.219.11 with HTTP; Sat, 14 Aug 2010 04:09:22 -0700 (PDT) In-Reply-To: <1281749081591-1142517.post@n3.nabble.com> References: <1281749081591-1142517.post@n3.nabble.com> Date: Sat, 14 Aug 2010 07:09:22 -0400 Message-ID: Subject: Re: scalability limit in terms of numbers of large documents From: Erick Erickson To: java-user@lucene.apache.org Content-Type: multipart/alternative; boundary=000feaf9578dacb0ba048dc6a12e --000feaf9578dacb0ba048dc6a12e Content-Type: text/plain; charset=ISO-8859-1 As asked, that's really an unanswerable question. The math is pretty easy in terms of running out of document IDs, but "searched quickly" depends on too many variables. I suspect, though, that long before you ran out of document IDs, you'd need to shard your index, Have you looked at SOLR? Best Erick On Fri, Aug 13, 2010 at 9:24 PM, andynuss wrote: > > Hi, > > Lets say that I am indexing large book documents broken into chapters. A > typical book that you buy at amazon. What would be the approximate limit > to > the number of books that can be indexed slowly and searched quickly. The > search unit would be a chapter, so assume that a book is divided into 15-50 > chapters. Any ideas? > > Andy > -- > View this message in context: > http://lucene.472066.n3.nabble.com/scalability-limit-in-terms-of-numbers-of-large-documents-tp1142517p1142517.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --000feaf9578dacb0ba048dc6a12e--