Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 8233 invoked from network); 24 Aug 2007 23:32:44 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 24 Aug 2007 23:32:44 -0000 Received: (qmail 70606 invoked by uid 500); 24 Aug 2007 23:32:41 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 70107 invoked by uid 500); 24 Aug 2007 23:32:39 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 70098 invoked by uid 99); 24 Aug 2007 23:32:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Aug 2007 16:32:39 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [216.109.112.28] (HELO mrout2-b.corp.dcn.yahoo.com) (216.109.112.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 24 Aug 2007 23:33:14 +0000 Received: from [10.72.122.47] (shinefelt-lm.corp.yahoo.com [10.72.122.47]) by mrout2-b.corp.dcn.yahoo.com (8.13.6/8.13.6/y.out) with ESMTP id l7ONVrkV080336 for ; Fri, 24 Aug 2007 16:31:53 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=mime-version:in-reply-to:references:content-type:message-id: content-transfer-encoding:from:subject:date:to:x-mailer; b=INDxe73rv47kCmg7BzoZqRqvJ+7CYXBrj9vThBMQ6VNo1OzpnML9f2zflyVvRdUm Mime-Version: 1.0 (Apple Message framework v752.3) In-Reply-To: References: Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: Eric Baldeschwieler Subject: Re: Poly-reduce? Date: Fri, 24 Aug 2007 16:31:35 -0700 To: X-Mailer: Apple Mail (2.752.3) X-Virus-Checked: Checked by ClamAV on apache.org especially at scale! And we are testing on >1000 node clusters with long jobs. We see lots of failures per job. On Aug 24, 2007, at 4:20 PM, Ted Dunning wrote: > > > > On 8/24/07 12:11 PM, "Doug Cutting" wrote: > > > > >> Using the same logic, streaming reduce outputs to > >> the next map and reduce steps (before the first reduce is complete) > >> should also provide speedup. > > > > Perhaps, but the bookeeping required in the jobtracker might be > onerous. > > The failure modes are more complex, complicating recovery. > > Frankly, I find Doug's arguments about reliability fairly compelling. > > Map-reduce^n is not the same, nor is it entirely analogous to pipe- > style > programming. It feels the same, but there are very important > differences > that I wasn't thinking of when I made this suggestion. The most > important > is the issue of reliability. In a large cluster, failure is a > continuous > process, not an isolated event. As such, the problems of having to > roll > back an entire program due to node failure are not something that > can be > treated as unusual. That makes Doug's comments about risk more on- > point > than the potential gains. It is very easy to imagine scenarios > where the > possibility of program roll-back results in very large average run- > times > while the chained reduce results in only incremental savings. This > isn't a > good thing to bet on. > >