Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 743449A76 for ; Sun, 6 May 2012 20:46:09 +0000 (UTC) Received: (qmail 10561 invoked by uid 500); 6 May 2012 20:46:09 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 10531 invoked by uid 500); 6 May 2012 20:46:09 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 10523 invoked by uid 99); 6 May 2012 20:46:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 May 2012 20:46:09 +0000 X-ASF-Spam-Status: No, hits=-1996.4 required=5.0 tests=ALL_TRUSTED,FS_REPLICA,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 06 May 2012 20:46:08 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 1B340436E5D for ; Sun, 6 May 2012 20:45:48 +0000 (UTC) Date: Sun, 6 May 2012 20:45:48 +0000 (UTC) From: "Matt Foley (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <135160711.32387.1336337148113.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1552916955.31820.1333011150954.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HDFS-3161) 20 Append: Excluded DN replica from recovery should be removed from DN. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Foley updated HDFS-3161: ----------------------------- Target Version/s: 1.1.0 (was: 1.0.3) Fix Version/s: (was: 1.0.3) No patch submitted in 1.0.3 timeframe. If still intend to fix, please work in 1.1 context (branch-1). Thank you. > 20 Append: Excluded DN replica from recovery should be removed from DN. > ----------------------------------------------------------------------- > > Key: HDFS-3161 > URL: https://issues.apache.org/jira/browse/HDFS-3161 > Project: Hadoop HDFS > Issue Type: Bug > Affects Versions: 1.0.0 > Reporter: suja s > Priority: Critical > > 1) DN1->DN2->DN3 are in pipeline. > 2) Client killed abruptly > 3) one DN has restarted , say DN3 > 4) In DN3 info.wasRecoveredOnStartup() will be true > 5) NN recovery triggered, DN3 skipped from recovery due to above check. > 6) Now DN1, DN2 has blocks with generataion stamp 2 and DN3 has older generation stamp say 1 and also DN3 still has this block entry in ongoingCreates > 7) as part of recovery file has closed and got only two live replicas ( from DN1 and DN2) > 8) So, NN issued the command for replication. Now DN3 also has the replica with newer generation stamp. > 9) Now DN3 contains 2 replicas on disk. and one entry in ongoing creates with referring to blocksBeingWritten directory. > When we call append/ leaseRecovery, it may again skip this node for that recovery as blockId entry still presents in ongoingCreates with startup recovery true. > It may keep continue this dance for evry recovery. > And this stale replica will not be cleaned untill we restart the cluster. Actual replica will be trasferred to this node only through replication process. > Also unnecessarily that replicated blocks will get invalidated after next recoveries.... -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira