Return-Path: X-Original-To: apmail-hadoop-common-dev-archive@www.apache.org Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 612C4D6D7 for ; Mon, 30 Jul 2012 17:30:39 +0000 (UTC) Received: (qmail 7817 invoked by uid 500); 30 Jul 2012 17:30:37 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 7758 invoked by uid 500); 30 Jul 2012 17:30:37 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 7749 invoked by uid 99); 30 Jul 2012 17:30:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Jul 2012 17:30:37 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,HS_INDEX_PARAM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mouradk78@googlemail.com designates 209.85.212.170 as permitted sender) Received: from [209.85.212.170] (HELO mail-wi0-f170.google.com) (209.85.212.170) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Jul 2012 17:30:32 +0000 Received: by wibhq12 with SMTP id hq12so2498511wib.5 for ; Mon, 30 Jul 2012 10:30:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=date:from:to:message-id:subject:x-mailer:mime-version:content-type; bh=JeJkQMqlsCwimFvgcPbAbNFFzfbSaOCbnxWE3grlnrY=; b=b5+GhbfN0QJw/WVukH6vEcW+PpHKn1Td6xSBxy4dWE1XZqChIR5+rlov0mcqU3iFel xDitLpGY9ZwbCTPhCw96O0DPobmZ2BOmfuKlN+Gu7FkN+F4fwpKeF3taMc6e60Q+n4Oo jc2WSgq147Epu6df1Ah1znhb/6lWAyyBTNW/HbQ/y3nwXMT78NlwzmBLrtq7prmrLt7d DCeEV6qNfZaHo3g2dcZb+vQxFQiHdd8HvzOAufvVPWNNactf/aI8f72rQqncn8b867Km QhYnqN7qhJjvB6NFnVpug/ez7VKsrgBePOcadywo3GGVdZPhAspPM7TZqmyRjYg7jngE rd+A== Received: by 10.180.78.99 with SMTP id a3mr44213647wix.15.1343669410701; Mon, 30 Jul 2012 10:30:10 -0700 (PDT) Received: from mouradk.local ([31.24.220.3]) by mx.google.com with ESMTPS id bc2sm25675080wib.0.2012.07.30.10.30.10 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 30 Jul 2012 10:30:10 -0700 (PDT) Date: Mon, 30 Jul 2012 18:30:09 +0100 From: mouradk To: common-dev@hadoop.apache.org Message-ID: Subject: Fix a corrupt edits file? X-Mailer: sparrow 1.5 (build 1043.2) MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="5016c4a1_6a5ee64_d59e" X-Virus-Checked: Checked by ClamAV on apache.org --5016c4a1_6a5ee64_d59e Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Content-Disposition: inline Hello all, I have just had a problem with a NameNode restart and someone on the mailing list kindly suggested that the edits file was corrupted. I have made a backup copy of the file and checked my /namesecondary/previous.checkpoint but the edits file there is empty 4kb with ????? inside. This suggest to me that I cannot recover from the secondaryNameNode? How do you fix this problem? Thanks for your help. Original error log: TARTUP_MSG: build =https://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.20 -r 911707; compiled by 'chrisdo' on Fri Feb 19 08:07:34 UTC 2010 ************************************************************/ 2012-07-30 16:02:23,649 INFO org.apache.hadoop.ipc.metrics.RpcMetrics: Initializing RPC Metrics with hostName=NameNode, port=50001 2012-07-30 16:02:23,656 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Namenode up at: localhost/127.0.0.1:50001 2012-07-30 16:02:23,659 INFO org.apache.hadoop.metrics.jvm.JvmMetrics: Initializing JVM Metrics with processName=NameNode, sessionId=null 2012-07-30 16:02:23,660 INFO org.apache.hadoop.hdfs.server.namenode.metrics.NameNodeMetrics: Initializing NameNodeMeterics using context object:org.apache.hadoop.metrics.spi.NullContext 2012-07-30 16:02:23,714 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: fsOwner=hadoop,hadoop 2012-07-30 16:02:23,714 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: supergroup=supergroup 2012-07-30 16:02:23,714 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: isPermissionEnabled=false 2012-07-30 16:02:23,721 INFO org.apache.hadoop.hdfs.server.namenode.metrics.FSNamesystemMetrics: Initializing FSNamesystemMetrics using context object:org.apache.hadoop.metrics.spi.NullContext 2012-07-30 16:02:23,723 INFO org.apache.hadoop.hdfs.server.namenode.FSNamesystem: Registered FSNamesystemStatusMBean 2012-07-30 16:02:23,756 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files = 533 2012-07-30 16:02:23,833 INFO org.apache.hadoop.hdfs.server.common.Storage: Number of files under construction = 2 2012-07-30 16:02:23,835 INFO org.apache.hadoop.hdfs.server.common.Storage: Image file of size 55400 loaded in 0 seconds. 2012-07-30 16:02:23,844 ERROR org.apache.hadoop.hdfs.server.namenode.NameNode: java.lang.NumberFormatException: For input string: "1343506" at java.lang.NumberFormatException.forInputString(NumberFormatException.java:48) at java.lang.Long.parseLong(Long.java:419) at java.lang.Long.parseLong(Long.java:468) at org.apache.hadoop.hdfs.server.namenode.FSEditLog.readLong(FSEditLog.java:1273) at org.apache.hadoop.hdfs.server.namenode.FSEditLog.loadFSEdits(FSEditLog.java:775) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSEdits(FSImage.java:992) at org.apache.hadoop.hdfs.server.namenode.FSImage.loadFSImage(FSImage.java:812) at org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:364) at org.apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.java:87) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.java:311) at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.(FSNamesystem.java:292) at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:201) at org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:279) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:956) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:965) 2012-07-30 16:02:23,845 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: SHUTDOWN_MSG: Mouradk Mouradk Sent with Sparrow (http://www.sparrowmailapp.com/?sig) --5016c4a1_6a5ee64_d59e--