Return-Path: Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: (qmail 45214 invoked from network); 10 Jul 2009 10:07:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 10 Jul 2009 10:07:27 -0000 Received: (qmail 40744 invoked by uid 500); 10 Jul 2009 10:07:37 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 40675 invoked by uid 500); 10 Jul 2009 10:07:37 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 40665 invoked by uid 99); 10 Jul 2009 10:07:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2009 10:07:37 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2009 10:07:34 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D678F234C044 for ; Fri, 10 Jul 2009 03:07:14 -0700 (PDT) Message-ID: <189432432.1247220434877.JavaMail.jira@brutus> Date: Fri, 10 Jul 2009 03:07:14 -0700 (PDT) From: "vijayan.B (JIRA)" To: hdfs-dev@hadoop.apache.org Subject: [jira] Created: (HDFS-485) error : too many fetch failures MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org error : too many fetch failures ------------------------------- Key: HDFS-485 URL: https://issues.apache.org/jira/browse/HDFS-485 Project: Hadoop HDFS Issue Type: Bug Components: data-node Environment: fedora - 8 , virtual environment using Xen Reporter: vijayan.B Fix For: 0.20.1 i have a hadoop cluster configured with 1 physcical machine as name node and 7 data nodes(2 physcical+5 virtual). When a sort job (hadoop-core-examples) is submitted it completes with the following error:can anyone tell me why and how to solve this issue. hadoop version:0.18.3 O/P from terminal******************************** [root@hadoop1 hadoop-0.18.3]# bin/hadoop jar hadoop-0.18.3-examples.jar sort input13 output1 Running on 7 nodes to sort from hdfs://hadoop1:8022/user/root/input13 into hdfs://hadoop1:8022/user/root/output1 with 12 reduces. Job started: Fri Jul 10 14:00:19 IST 2009 09/07/10 14:00:19 INFO mapred.FileInputFormat: Total input paths to process : 1 09/07/10 14:00:19 INFO mapred.FileInputFormat: Total input paths to process : 1 09/07/10 14:00:19 INFO mapred.JobClient: Running job: job_200907101344_0002 09/07/10 14:00:20 INFO mapred.JobClient: map 0% reduce 0% 09/07/10 14:00:24 INFO mapred.JobClient: map 6% reduce 0% 09/07/10 14:00:25 INFO mapred.JobClient: map 12% reduce 0% 09/07/10 14:00:28 INFO mapred.JobClient: map 31% reduce 0% 09/07/10 14:00:29 INFO mapred.JobClient: map 50% reduce 0% 09/07/10 14:00:33 INFO mapred.JobClient: map 66% reduce 0% 09/07/10 14:00:34 INFO mapred.JobClient: map 72% reduce 0% 09/07/10 14:00:35 INFO mapred.JobClient: map 75% reduce 0% 09/07/10 14:00:37 INFO mapred.JobClient: map 75% reduce 1% 09/07/10 14:00:38 INFO mapred.JobClient: map 78% reduce 5% 09/07/10 14:00:39 INFO mapred.JobClient: map 89% reduce 10% 09/07/10 14:00:40 INFO mapred.JobClient: map 89% reduce 11% 09/07/10 14:00:41 INFO mapred.JobClient: map 90% reduce 11% 09/07/10 14:00:42 INFO mapred.JobClient: map 99% reduce 14% 09/07/10 14:00:43 INFO mapred.JobClient: map 99% reduce 16% 09/07/10 14:00:44 INFO mapred.JobClient: map 99% reduce 18% 09/07/10 14:00:45 INFO mapred.JobClient: map 99% reduce 19% 09/07/10 14:00:47 INFO mapred.JobClient: map 99% reduce 22% 09/07/10 14:00:48 INFO mapred.JobClient: map 100% reduce 22% 09/07/10 14:00:50 INFO mapred.JobClient: map 100% reduce 24% 09/07/10 14:00:52 INFO mapred.JobClient: map 100% reduce 25% 09/07/10 14:00:53 INFO mapred.JobClient: map 100% reduce 26% 09/07/10 14:00:54 INFO mapred.JobClient: map 100% reduce 27% 09/07/10 14:00:58 INFO mapred.JobClient: map 100% reduce 33% 09/07/10 14:01:00 INFO mapred.JobClient: map 100% reduce 34% 09/07/10 14:01:03 INFO mapred.JobClient: map 100% reduce 39% 09/07/10 14:03:29 INFO mapred.JobClient: map 100% reduce 40% 09/07/10 14:03:42 INFO mapred.JobClient: map 100% reduce 41% 09/07/10 14:03:50 INFO mapred.JobClient: map 100% reduce 47% 09/07/10 14:03:51 INFO mapred.JobClient: map 100% reduce 50% 09/07/10 14:03:52 INFO mapred.JobClient: map 100% reduce 56% 09/07/10 14:03:57 INFO mapred.JobClient: map 100% reduce 57% 09/07/10 14:04:00 INFO mapred.JobClient: map 100% reduce 58% 09/07/10 14:04:05 INFO mapred.JobClient: map 100% reduce 59% 09/07/10 14:04:07 INFO mapred.JobClient: map 100% reduce 60% 09/07/10 14:04:09 INFO mapred.JobClient: map 100% reduce 66% 09/07/10 14:04:10 INFO mapred.JobClient: map 100% reduce 67% 09/07/10 14:04:12 INFO mapred.JobClient: map 100% reduce 68% 09/07/10 14:04:13 INFO mapred.JobClient: map 100% reduce 69% 09/07/10 14:04:14 INFO mapred.JobClient: map 100% reduce 70% 09/07/10 14:04:20 INFO mapred.JobClient: map 100% reduce 79% 09/07/10 14:04:21 INFO mapred.JobClient: map 100% reduce 80% 09/07/10 14:04:22 INFO mapred.JobClient: map 100% reduce 81% 09/07/10 14:04:23 INFO mapred.JobClient: map 100% reduce 82% 09/07/10 14:04:33 INFO mapred.JobClient: map 100% reduce 87% 09/07/10 14:04:42 INFO mapred.JobClient: Task Id : attempt_200907101344_0002_m_000013_0, Status : FAILED Too many fetch-failures 09/07/10 14:04:44 INFO mapred.JobClient: map 93% reduce 87% 09/07/10 14:04:50 INFO mapred.JobClient: map 100% reduce 87% 09/07/10 14:05:06 INFO mapred.JobClient: map 100% reduce 93% 09/07/10 14:05:36 INFO mapred.JobClient: map 100% reduce 94% 09/07/10 14:06:17 INFO mapred.JobClient: Job complete: job_200907101344_0002 09/07/10 14:06:17 INFO mapred.JobClient: Counters: 17 09/07/10 14:06:17 INFO mapred.JobClient: File Systems 09/07/10 14:06:17 INFO mapred.JobClient: HDFS bytes read=1077612760 09/07/10 14:06:17 INFO mapred.JobClient: HDFS bytes written=1077285377 09/07/10 14:06:17 INFO mapred.JobClient: Local bytes read=1083539214 09/07/10 14:06:17 INFO mapred.JobClient: Local bytes written=2167083496 09/07/10 14:06:17 INFO mapred.JobClient: Job Counters 09/07/10 14:06:17 INFO mapred.JobClient: Launched reduce tasks=18 09/07/10 14:06:17 INFO mapred.JobClient: Rack-local map tasks=2 09/07/10 14:06:17 INFO mapred.JobClient: Launched map tasks=18 09/07/10 14:06:17 INFO mapred.JobClient: Data-local map tasks=15 09/07/10 14:06:17 INFO mapred.JobClient: Map-Reduce Framework 09/07/10 14:06:17 INFO mapred.JobClient: Reduce input groups=102341 09/07/10 14:06:17 INFO mapred.JobClient: Combine output records=0 09/07/10 14:06:17 INFO mapred.JobClient: Map input records=102341 09/07/10 14:06:17 INFO mapred.JobClient: Reduce output records=102341 09/07/10 14:06:17 INFO mapred.JobClient: Map output bytes=1074564177 09/07/10 14:06:17 INFO mapred.JobClient: Map input bytes=1077284885 09/07/10 14:06:17 INFO mapred.JobClient: Combine input records=0 09/07/10 14:06:17 INFO mapred.JobClient: Map output records=102341 09/07/10 14:06:17 INFO mapred.JobClient: Reduce input records=102341 Job ended: Fri Jul 10 14:06:17 IST 2009 The job took 357 seconds. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.