Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 151ADC7C5 for ; Thu, 17 May 2012 10:15:33 +0000 (UTC) Received: (qmail 78525 invoked by uid 500); 17 May 2012 10:15:32 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 78394 invoked by uid 500); 17 May 2012 10:15:32 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 78378 invoked by uid 99); 17 May 2012 10:15:32 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 May 2012 10:15:32 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 May 2012 10:15:29 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id A7F8F95EB for ; Thu, 17 May 2012 10:15:08 +0000 (UTC) Date: Thu, 17 May 2012 10:15:08 +0000 (UTC) From: "Uma Maheswara Rao G (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <53091637.8154.1337249708722.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <2005507606.37875.1336460685391.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HDFS-3423) BookKeeperJournalManager: NN startup is failing, when tries to recoverUnfinalizedSegments() a bad inProgress_ ZNodes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-3423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uma Maheswara Rao G updated HDFS-3423: -------------------------------------- Issue Type: Sub-task (was: Bug) Parent: HDFS-3399 > BookKeeperJournalManager: NN startup is failing, when tries to recoverUnfinalizedSegments() a bad inProgress_ ZNodes > -------------------------------------------------------------------------------------------------------------------- > > Key: HDFS-3423 > URL: https://issues.apache.org/jira/browse/HDFS-3423 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Rakesh R > Assignee: Ivan Kelly > Attachments: HDFS-3423.diff > > > Say, the InProgress_000X node is corrupted due to not writing the data(version, ledgerId, firstTxId) to this inProgress_000X znode. Namenode startup has the logic to recover all the unfinalized segments, here will try to read the segment and getting shutdown. > {noformat} > EditLogLedgerMetadata.java: > static EditLogLedgerMetadata read(ZooKeeper zkc, String path) > throws IOException, KeeperException.NoNodeException { > byte[] data = zkc.getData(path, false, null); > String[] parts = new String(data).split(";"); > if (parts.length == 3) > ....reading inprogress metadata > else if (parts.length == 4) > ....reading inprogress metadata > else > throw new IOException("Invalid ledger entry, " > + new String(data)); > } > {noformat} > Scenario:- Leaving bad inProgress_000X node ? > Assume BKJM has created the inProgress_000X zNode and ZK is not available when trying to add the metadata. Now, inProgress_000X ends up with partial information. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira