Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 6576 invoked from network); 28 Jul 2006 06:03:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 28 Jul 2006 06:03:09 -0000 Received: (qmail 58670 invoked by uid 500); 28 Jul 2006 06:03:05 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 57776 invoked by uid 500); 28 Jul 2006 06:03:03 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 57765 invoked by uid 99); 28 Jul 2006 06:03:03 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Jul 2006 23:03:03 -0700 X-ASF-Spam-Status: No, hits=0.5 required=10.0 tests=DNS_FROM_RFC_ABUSE,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: domain of jeff.rodenburg@gmail.com designates 64.233.182.191 as permitted sender) Received: from [64.233.182.191] (HELO nf-out-0910.google.com) (64.233.182.191) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 27 Jul 2006 23:03:02 -0700 Received: by nf-out-0910.google.com with SMTP id o63so79553nfa for ; Thu, 27 Jul 2006 23:02:41 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=AOaHmiuVU+/HbeT3uYoXF0Xt3MWoLMxLc+HZUjuFhwK7UHDUfB9SrFW6HF0c0k2OCU+LTbS9ZS50j3FQh7FMhtJG/HBzHIOFbN7exC5zarkaLIRgflI6DfPP77rJdpdJpKvD84KL9uwtEYS1GX8vELzQQQCgnRRgHnFsxRertXY= Received: by 10.78.132.12 with SMTP id f12mr804539hud; Thu, 27 Jul 2006 23:02:40 -0700 (PDT) Received: by 10.78.119.3 with HTTP; Thu, 27 Jul 2006 23:02:40 -0700 (PDT) Message-ID: <50f433360607272302g6a835fbag7a1feb1b8d43227d@mail.gmail.com> Date: Thu, 27 Jul 2006 23:02:40 -0700 From: "Jeff Rodenburg" To: java-user@lucene.apache.org Subject: Re: Distributed Search In-Reply-To: <44C93410.6090207@gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_3968_29601276.1154066560840" References: <20060727210100.82026.qmail@web86209.mail.ird.yahoo.com> <44C93410.6090207@gmail.com> X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_Part_3968_29601276.1154066560840 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi Mark - Having gone down this path for the past year, I echo comments from others that scalability/availability/failover is a lot of work. We migrated away from a custom system based on Lucene running on Windows to Solr running on Linux. It took us 6 months to get our system to a solid five-nines in availability. Having done this previously, I can advise one not to underestimate the effort involved with this. We would have taken the simple route had it been available. We shifted to Solr because of the operational elements that allows us to achieve clustering and failover capability within the Linux/Apache/Tomcat (our flavor) mix. It just works better for us than our home-brew. -- j On 7/27/06, Mark Miller wrote: > > I know there has been a lot of discussion on distributed search...I am > looking for a cross platform solution, which seems to kill solr's > approach...Everyone seems to have implemented this, but only as > proprietary code...it would seem that just using the RMI searcher would > allow a simple solution? Is this the case? Could you easily provide > clustering and fail over using a variety of indexes and searching them > all with RMI searcher? Is it all really that complicated? I have read > that Lucene tops out at about 10m docs for a single server...I want to > hit 100m. I have a beautiful app that allows realtime updating/searching > (updates are rare but should be instant)...and I just want it to scale > up to 100m docs or so . Is that going to be an really advanced project > no matter how I slice it? I have done a lot of custom work with the > lucene stuff so it would seem difficult to adapt it to Nutch (but what > do I know Nutch) ... I have seen a lot of talk but not much on a simple > RMI searcher solution...any idea? > > > - Mark > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > ------=_Part_3968_29601276.1154066560840--