Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 86820 invoked from network); 22 Feb 2008 17:16:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 22 Feb 2008 17:16:11 -0000 Received: (qmail 40534 invoked by uid 500); 22 Feb 2008 17:16:04 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 40508 invoked by uid 500); 22 Feb 2008 17:16:04 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 40499 invoked by uid 99); 22 Feb 2008 17:16:04 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 Feb 2008 09:16:04 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 22 Feb 2008 17:15:30 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 003AC234C040 for ; Fri, 22 Feb 2008 09:15:21 -0800 (PST) Message-ID: <1780466077.1203700520999.JavaMail.jira@brutus> Date: Fri, 22 Feb 2008 09:15:20 -0800 (PST) From: "Raghu Angadi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-2873) Namenode fails to re-start after cluster shutdown - DFSClient: Could not obtain blocks even all datanodes were up & live In-Reply-To: <1810556335.1203677179345.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2873?page=3Dcom.atlassia= n.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D125= 71474#action_12571474 ]=20 Raghu Angadi commented on HADOOP-2873: -------------------------------------- FSNameSystem.saveFilesUnderConstruction() and FSImage.loadFilesUnderConstru= ction() don't seem to match. FSImage.loadFilesUnderConstruction() assumes there is only one file per lea= se. > Namenode fails to re-start after cluster shutdown - DFSClient: Could not = obtain blocks even all datanodes were up & live > -------------------------------------------------------------------------= ----------------------------------------------- > > Key: HADOOP-2873 > URL: https://issues.apache.org/jira/browse/HADOOP-2873 > Project: Hadoop Core > Issue Type: Bug > Components: dfs > Affects Versions: 0.17.0 > Reporter: Andr=C3=A9 Martin > > Namenode fails to re-start with the following exception: > 2008-02-21 14:20:48,831 INFO org.apache.hadoop.dfs.NameNode: STARTUP_MSG= : > /************************************************************ > STARTUP_MSG: Starting NameNode > STARTUP_MSG: host =3D se09/141.76.xxx.xxx > STARTUP_MSG: args =3D [] > STARTUP_MSG: version =3D 2008-02-19_11-01-48 > STARTUP_MSG: build =3D http://svn.apache.org/repos/asf/hadoop/core/tru= nk -r 628999; compiled by 'hudson' on Tue Feb 19 11:09:05 UTC 2008 > ************************************************************/ > 2008-02-21 14:20:49,367 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: I= nitializing RPC Metrics with serverName=3DNameNode, port=3D8000 > 2008-02-21 14:20:49,374 INFO org.apache.hadoop.dfs.NameNode: Namenode up= at: se09.inf.tu-dresden.de/141.76.xxx.xxx:8000 > 2008-02-21 14:20:49,378 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: I= nitializing JVM Metrics with processName=3DNameNode, sessionId=3Dnull > 2008-02-21 14:20:49,381 INFO org.apache.hadoop.dfs.NameNodeMetrics: Init= ializing NameNodeMeterics using context object:org.apache.hadoop.metrics.sp= i.NullContext > 2008-02-21 14:20:49,501 INFO org.apache.hadoop.fs.FSNamesystem: fsOwner= =3Damartin,students > 2008-02-21 14:20:49,501 INFO org.apache.hadoop.fs.FSNamesystem: supergro= up=3Dsupergroup > 2008-02-21 14:20:49,501 INFO org.apache.hadoop.fs.FSNamesystem: isPermis= sionEnabled=3Dtrue > 2008-02-21 14:20:49,788 INFO org.apache.hadoop.ipc.Server: Stopping serv= er on 8000 > 2008-02-21 14:20:49,790 ERROR org.apache.hadoop.dfs.NameNode: java.io.IO= Exception: Created 13 leases but found 4 > at org.apache.hadoop.dfs.FSImage.loadFilesUnderConstruction(FSImage.= java:935) > at org.apache.hadoop.dfs.FSImage.loadFSImage(FSImage.java:749) > at org.apache.hadoop.dfs.FSImage.loadFSImage(FSImage.java:634) > at org.apache.hadoop.dfs.FSImage.recoverTransitionRead(FSImage.java:= 223) > at org.apache.hadoop.dfs.FSDirectory.loadFSImage(FSDirectory.java:79= ) > at org.apache.hadoop.dfs.FSNamesystem.initialize(FSNamesystem.java:2= 61) > at org.apache.hadoop.dfs.FSNamesystem.(FSNamesystem.java:242) > at org.apache.hadoop.dfs.NameNode.initialize(NameNode.java:131) > at org.apache.hadoop.dfs.NameNode.(NameNode.java:176) > at org.apache.hadoop.dfs.NameNode.(NameNode.java:162) > at org.apache.hadoop.dfs.NameNode.createNameNode(NameNode.java:851) > at org.apache.hadoop.dfs.NameNode.main(NameNode.java:860) > 2008-02-21 14:20:49,791 INFO org.apache.hadoop.dfs.NameNode: SHUTDOWN_MS= G: > /************************************************************ > SHUTDOWN_MSG: Shutting down NameNode at se09/141.76.xxx.xxx > ************************************************************/=20 > Cluster restart was needed since the DFS client produced the following er= ror message even all datanodes were up: > 08/02/21 14:04:35 INFO fs.DFSClient: Could not obtain block blk_-4008950= 704646490788 from any node: java.io.IOException: No live nodes contain cur= rent block --=20 This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.