Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 96854 invoked from network); 21 Jul 2009 07:35:45 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 21 Jul 2009 07:35:45 -0000 Received: (qmail 84685 invoked by uid 500); 21 Jul 2009 07:36:48 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 84602 invoked by uid 500); 21 Jul 2009 07:36:48 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 84592 invoked by uid 99); 21 Jul 2009 07:36:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jul 2009 07:36:48 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [216.145.54.172] (HELO mrout2.yahoo.com) (216.145.54.172) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 21 Jul 2009 07:36:36 +0000 Received: from SNV-EXBH01.ds.corp.yahoo.com (snv-exbh01.ds.corp.yahoo.com [207.126.227.249]) by mrout2.yahoo.com (8.13.6/8.13.6/y.out) with ESMTP id n6L7a0r2001452 for ; Tue, 21 Jul 2009 00:36:00 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=received:user-agent:date:subject:from:to:message-id: thread-topic:thread-index:in-reply-to:mime-version:content-type: content-transfer-encoding:x-originalarrivaltime; b=SBGqZ45YkRNETrYTbGbLUYvOg/yykpP8XNU16XTvs938QaO9JbXZ0J79JS5gV0o2 Received: from SNV-EXVS08.ds.corp.yahoo.com ([207.126.227.9]) by SNV-EXBH01.ds.corp.yahoo.com with Microsoft SMTPSVC(6.0.3790.3959); Tue, 21 Jul 2009 00:36:00 -0700 Received: from 10.66.92.213 ([10.66.92.213]) by SNV-EXVS08.ds.corp.yahoo.com ([207.126.227.58]) with Microsoft Exchange Server HTTP-DAV ; Tue, 21 Jul 2009 07:35:58 +0000 User-Agent: Microsoft-Entourage/12.19.0.090515 Date: Tue, 21 Jul 2009 13:05:56 +0530 Subject: Re: Too many fetch failures From: Jothi Padmanabhan To: Message-ID: Thread-Topic: Too many fetch failures Thread-Index: AcoJ1eIh903jgZmlCUa/TcMSs/8Sxg== In-Reply-To: <9c39bdeb0907210003i718f9f47saf94868debff45dd@mail.gmail.com> Mime-version: 1.0 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit X-OriginalArrivalTime: 21 Jul 2009 07:36:00.0150 (UTC) FILETIME=[E49A9360:01CA09D5] X-Virus-Checked: Checked by ClamAV on apache.org This error occurs when several reducers are unable to fetch the given map output ( attempt_200907202331_0001_m_000001_0 in your example). I am guessing that there is a configuration issue in your setup -- the reducers are not able to contact/transfer map outputs from the TaskTracker. The TT log on the node where the map ran could throw some light on the error. Could you verify if all the nodes in your cluster are able to connect with others? You could also manually login to the reducer node and try pulling the map output yourself and see what error you are getting. Cheers Jothi On 7/21/09 12:33 PM, "George Pang" wrote: > Hi users, > > I got this "Too many fetch failures" in the following error message: > > *09/07/20 23:33:39 INFO mapred.JobClient: map 100% reduce 16% > 09/07/20 23:46:22 INFO mapred.JobClient: Task Id : > attempt_200907202331_0001_m_000001_0, Status : FAILED > Too many fetch-failures > 09/07/20 23:46:37 INFO mapred.JobClient: Job complete: job_200907202331_0001 > > *Don't know why it always stops at reduce 16% then assumes. It take a long > time even to run a small task. > > I saw people asking the same question in previous mail list, but I don't get > the help I need. > > Hadoop version: 0.18.3 > Ubuntu version: 8.04 > > Thank you in advance! > > George