Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 70500 invoked from network); 31 Jul 2007 15:01:57 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 31 Jul 2007 15:01:57 -0000 Received: (qmail 40864 invoked by uid 500); 31 Jul 2007 15:00:38 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 40624 invoked by uid 500); 31 Jul 2007 15:00:37 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 39607 invoked by uid 99); 31 Jul 2007 15:00:34 -0000 Received: from Unknown (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2007 08:00:33 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [146.169.1.194] (HELO finch.doc.ic.ac.uk) (146.169.1.194) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2007 14:36:44 +0000 Received: from tern.doc.ic.ac.uk ([146.169.1.119] helo=localhost) by finch.doc.ic.ac.uk with esmtp (Exim 4.63) (envelope-from ) id 1IFspj-0002hB-3b for hadoop-user@lucene.apache.org; Tue, 31 Jul 2007 15:36:23 +0100 Received: from host-83-146-62-153.bulldogdsl.com (host-83-146-62-153.bulldogdsl.com [83.146.62.153]) by webmail.doc.ic.ac.uk (Horde MIME library) with HTTP; Tue, 31 Jul 2007 15:36:23 +0100 Message-ID: <20070731153623.6419jivmucckk8wk@tern.doc.ic.ac.uk> Date: Tue, 31 Jul 2007 15:36:23 +0100 From: ojh06@doc.ic.ac.uk To: hadoop-user@lucene.apache.org Subject: Re: Error reporting from map function References: <20070730214209.keqxk01140kococg@tern.doc.ic.ac.uk> <20070730214414.GB5688@killa.net> <68d348060707310416p73ab0818pc4efdaa2b9c0dc8f@mail.gmail.com> In-Reply-To: <68d348060707310416p73ab0818pc4efdaa2b9c0dc8f@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; DelSp="Yes"; format="flowed" Content-Disposition: inline Content-Transfer-Encoding: 7bit User-Agent: Internet Messaging Program (IMP) H3 (4.1.3) X-ICDoC-Trusted: YES X-BrightMail-Verdict: X-BrightMail-Tracker: X-Spam-Level: X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Flag: NO Well, I don't think it will be too much of a problem for me. I'll only be running this one type of job. The problem I have is that I can only throw IOExceptions out of the Mapper function. So if a job fails for some other reason, other than my numerical calculation error I have no way of knowing. I'd like to retry if its a communication problem, but terminate if its a calculation problem within my function. I'm getting the feeling this isn't possible? Quoting Jeroen Verhagen : > Hi, > > On 7/30/07, Anthony D. Urso wrote: >> Call JobConf.setMaxMapAttempts(0) in the job conf, then throw an exception >> when your mapper fails. This should kill the entire job instantly, since >> the job tracker will allow no mapper failures. > > Wouldn't this cause all other running and future jobs to stop > attempting to recover from an error? Or do all jobs have copies of the > original job conf? > > -- > > regards, > > Jeroen >