Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F3CFC42B6 for ; Sun, 12 Jun 2011 14:48:05 +0000 (UTC) Received: (qmail 51779 invoked by uid 500); 12 Jun 2011 14:48:03 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 51714 invoked by uid 500); 12 Jun 2011 14:48:03 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 51706 invoked by uid 99); 12 Jun 2011 14:48:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 12 Jun 2011 14:48:03 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [208.247.100.55] (HELO searshc.com) (208.247.100.55) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 12 Jun 2011 14:47:54 +0000 Received: from ([157.241.173.134]) by Usdmzsvpsmtpct2.searshc.com with ESMTP id 5502345.114701235; Sun, 12 Jun 2011 10:47:30 -0400 Received: from ustrysvpexch23.kih.kmart.com ([157.241.187.245]) by ustrysvpsmtp03.kih.kmart.com with Microsoft SMTPSVC(6.0.3790.4675); Sun, 12 Jun 2011 10:47:30 -0400 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: Help - can't start namenode after disk full error Date: Sun, 12 Jun 2011 10:47:29 -0400 Message-ID: <34E9B7D077A04640BDFB43DF226509C20432D573@ustrysvpexch23.kih.kmart.com> In-Reply-To: <34E9B7D077A04640BDFB43DF226509C20432D572@ustrysvpexch23.kih.kmart.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Help - can't start namenode after disk full error Thread-Index: AcwpDTKkIHh8m14URQyDxldVWSOBeAAAWKxwAAA6ISA= References: <34E9B7D077A04640BDFB43DF226509C20432D572@ustrysvpexch23.kih.kmart.com> From: "Zhong, Andy" To: X-OriginalArrivalTime: 12 Jun 2011 14:47:30.0121 (UTC) FILETIME=[A7ABE390:01CC290F] Only apply it to /dfs/name/current/edits file... -----Original Message----- From: Zhong, Andy [mailto:Sheng.Zhong@searshc.com]=20 Sent: Sunday, June 12, 2011 9:43 AM To: common-user@hadoop.apache.org Subject: RE: Help - can't start namenode after disk full error Could you backup your edits file, try $ printf "\xff\xff\xff\xee\xff" > edits, and start HDFS? It should work. - Andy=20 -----Original Message----- From: Ryan LeCompte [mailto:lecompte@gmail.com] Sent: Sunday, June 12, 2011 9:29 AM To: common-user@hadoop.apache.org Subject: Help - can't start namenode after disk full error Hey guys, Really trying to get our namenode back up and running after a full disk error last night. I've freed up a lot of space, however the NameNode still fails to startup: 2011-06-12 10:26:09,042 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Registered FSNamesystemStatusMBean 2011-06-12 10:26:09,083 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files =3D 614919 2011-06-12 10:26:22,293 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files under construction =3D 17 2011-06-12 10:26:22,300 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 102029859 loaded in 13 seconds. 2011-06-12 10:26:22,510 ERROR org.apache.hadoop.hdfs.server.namenode.NameNode: java.lang.NumberFormatException: For input string: "" at java.lang.NumberFormatException.forInputString(NumberFormatException.jav a:48) at java.lang.Integer.parseInt(Integer.java:468) at java.lang.Short.parseShort(Short.java:120) at java.lang.Short.parseShort(Short.java:78) at org.apache.hadoop.hdfs.server.namenode.FSEditLog.readShort(FSEditLog.jav a:1269) at org.apache.hadoop.hdfs.server.namenode.FSEditLog.loadFSEdits(FSEditLog.j ava:550) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSEdits(FSImage.java: 992) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java: 812) at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSI mage.java:364) at org.apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirecto ry.java:87) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesys tem.java:311) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.(FSNamesystem. java:292) at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java :201) at org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:279 ) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode. java:956) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:965) We currently have our config setup as follows: dfs.name.dir /data1/hadoop/dfs/name,/data2/hadoop/dfs/name,/data3/hadoop/dfs/n ame,/data4/hadoop/dfs/name I've looked in each of those directories for an image/edits.new file, but only the "edits" files exist. Can anyone please guide me on the next step here to get this back up and running? Thanks! Ryan This message, including any attachments, is the property of Sears Holdings Corporation and/or one of its subsidiaries. It is confidential and may contain proprietary or legally privileged information. If you are not the intended recipient, please delete it without reading the contents. Thank you.