Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 85368 invoked from network); 31 Mar 2010 11:30:25 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 31 Mar 2010 11:30:25 -0000 Received: (qmail 66625 invoked by uid 500); 31 Mar 2010 11:30:22 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 66543 invoked by uid 500); 31 Mar 2010 11:30:22 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 66535 invoked by uid 99); 31 Mar 2010 11:30:22 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Mar 2010 11:30:22 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pallavi.palleti@corp.aol.com designates 64.12.206.40 as permitted sender) Received: from [64.12.206.40] (HELO imr-ma02.mx.aol.com) (64.12.206.40) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Mar 2010 11:30:14 +0000 Received: from AOLMTCMEH01.ad.office.aol.com (aolmtcmeh01.office.aol.com [10.178.121.20]) by imr-ma02.mx.aol.com (8.14.1/8.14.1) with ESMTP id o2VBTkOX007392 for ; Wed, 31 Mar 2010 07:29:46 -0400 Received: from AOLMTCMEI01.ad.office.aol.com ([10.178.3.18]) by AOLMTCMEH01.ad.office.aol.com with Microsoft SMTPSVC(6.0.3790.3959); Wed, 31 Mar 2010 07:29:46 -0400 Received: from eu042874.in.office.aol.com ([10.178.3.10]) by AOLMTCMEI01.ad.office.aol.com over TLS secured channel with Microsoft SMTPSVC(6.0.3790.3959); Wed, 31 Mar 2010 07:29:45 -0400 Message-ID: <4BB33232.8050400@corp.aol.com> Date: Wed, 31 Mar 2010 16:59:54 +0530 From: Pallavi Palleti User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.8) Gecko/20100301 Fedora/3.0.3-1.fc12 Lightning/1.0b1 Thunderbird/3.0.3 MIME-Version: 1.0 To: common-user@hadoop.apache.org Subject: Re: Query over DFSClient References: <4BB1E10A.7060609@corp.aol.com> In-Reply-To: <4BB1E10A.7060609@corp.aol.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 31 Mar 2010 11:29:45.0919 (UTC) FILETIME=[771FD4F0:01CAD0C5] X-Virus-Checked: Checked by ClamAV on apache.org Hi, I am looking into hadoop-20 source code for below issue. From DFSClient, I could see that once the datanodes given by namenode are not reachable, it is setting "lastException" variable to error message saying "recovery from primary datanode is failed N times, aborting.."(line No:2546 in processDataNodeError). However, I couldn't figure out where this exception is thrown. I could see the throw statement in isClosed() but not finding the exact sequence after Streamer exits with lastException set to isClosed() method call. It would be great if some one could shed some light on this. I am essentially looking whether DFSClient approaches namenode in the case of failure of all datanodes that namenode has given for a given data block previously. Thanks Pallavi On 03/30/2010 05:01 PM, Pallavi Palleti wrote: > Hi, > > Could some one kindly let me know if the DFSClient takes care of > datanode failures and attempt to write to another datanode if primary > datanode (and replicated datanodes) fail. I looked into the souce code > of DFSClient and figured out that it attempts to write to one of the > datanodes in pipeline and fails if it failed to write to at least one > of them. However, I am not sure as I haven't explored fully. If so, is > there a way of querying namenode to provide different datanodes in the > case of failure. I am sure the Mapper would be doing similar > thing(attempting to fetch different datanode from namenode) if it > fails to write to datanodes. Kindly let me know. > > Thanks > Pallavi >