Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 1328 invoked from network); 20 Aug 2010 21:28:16 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Aug 2010 21:28:16 -0000 Received: (qmail 19418 invoked by uid 500); 20 Aug 2010 21:28:14 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 19229 invoked by uid 500); 20 Aug 2010 21:28:13 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 19215 invoked by uid 99); 20 Aug 2010 21:28:13 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Aug 2010 21:28:13 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of qwertymaniac@gmail.com designates 74.125.82.176 as permitted sender) Received: from [74.125.82.176] (HELO mail-wy0-f176.google.com) (74.125.82.176) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Aug 2010 21:27:51 +0000 Received: by wyb35 with SMTP id 35so5083211wyb.35 for ; Fri, 20 Aug 2010 14:27:31 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=1UT+atABKNaRqhxB8m00P2g5wHadurW1OmLPUwb9ENQ=; b=nLFjPBkG6W/DOG59XKprX703WpACiNS6s2RBKxwJhmDZ6ViJe9YSZKPVUTiGXgNjwg xqHmxBs7hLlC/5lQC8mH4DChaX3fKXZCd2j7rXBA94vk1zd+amuIhESGzXqunfHejVDN 6x+ScOATsuCP4eWgNZoHWyz29vWOdEuDs0rBI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=isZKpo/00brpwm0uNWh2cKLD+93leW/QUrOgYRsxS1VJDqyMPVHUYho7tVKx2YpOMi YIPBHX9ENBXX8OS/FKmxhQkpi+B1quCcQSMVdpuZMSGQV3+8qJyQSNK0VI+4EjpQYhKB d5x0BTvUpJerA6E9mIXrMrgxY4wgw/4SxM7BM= Received: by 10.216.231.230 with SMTP id l80mr1808822weq.53.1282339651214; Fri, 20 Aug 2010 14:27:31 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.236.197 with HTTP; Fri, 20 Aug 2010 14:27:11 -0700 (PDT) In-Reply-To: <721161.30127.qm@web45113.mail.sp1.yahoo.com> References: <721161.30127.qm@web45113.mail.sp1.yahoo.com> From: Harsh J Date: Sat, 21 Aug 2010 02:57:11 +0530 Message-ID: Subject: Re: what will happen if a backup name node folder becomes unaccessible? To: common-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Whee, lets try it out: Start with both paths available. ... Starts fine. Store some files. ... Works. rm -r the second path. ... Ouch. Store some more files. ... Still Works. [Cuz the SNN hasn't sent us stuff back yet] Wait for checkpoint to hit. And ... Boom! 2010-08-21 02:42:00,385 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Roll Edit Log from 127.0.0.1 2010-08-21 02:42:00,385 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Number of transactions: 37 Total time for transactions(ms): 6Number of transactions batched in Syncs: 0 Number of syncs: 26 SyncTimes(ms): 307 277 2010-08-21 02:42:00,439 FATAL org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Fatal Error : All storage directories are inaccessible. 2010-08-21 02:42:00,440 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: SHUTDOWN_MSG: /************************************************************ SHUTDOWN_MSG: Shutting down NameNode at localhost/127.0.0.1 ************************************************************/ So yes, as Edward says - never let this happen! On Sat, Aug 21, 2010 at 2:26 AM, jiang licht wrote: > Using nfs folder to back up dfs meta information as follows, > > > =A0=A0=A0 =A0=A0=A0 dfs.name.dir > =A0=A0=A0 =A0=A0=A0 /hadoop/dfs/name,/hadoop-backup/dfs/name > =A0=A0=A0 > > where /hadoop-backup is on a backup machine and mounted on the master nod= e. > > I have a question: if somehow, the backup folder becomes unavailable, wil= l it freeze master node? That is, will write operation simply hang up on th= is condition on the master node? Or will master node log the problem and co= ntinues to work? > > Thanks, > > Michael > > > --=20 Harsh J www.harshj.com