Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B3AB6D8B6 for ; Mon, 3 Dec 2012 02:44:02 +0000 (UTC) Received: (qmail 89375 invoked by uid 500); 3 Dec 2012 02:44:01 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 89308 invoked by uid 500); 3 Dec 2012 02:44:00 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 89226 invoked by uid 99); 3 Dec 2012 02:43:58 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 03 Dec 2012 02:43:58 +0000 Date: Mon, 3 Dec 2012 02:43:58 +0000 (UTC) From: "liaowenrui (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1226216248.51766.1354502638293.JavaMail.jiratomcat@arcas> In-Reply-To: <2096728107.39369.1354186378630.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HDFS-4238) [HA] Standby namenode should not do purging of shared storage edits. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-4238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508439#comment-13508439 ] liaowenrui commented on HDFS-4238: ---------------------------------- the standby NN never purged any files from the shared storage - only the active did that lwr:this isn't accurate.standby nn purged editlog from the share storage in StandbyCheckpointer. i will changed it to only the active did that for this issue > [HA] Standby namenode should not do purging of shared storage edits. > -------------------------------------------------------------------- > > Key: HDFS-4238 > URL: https://issues.apache.org/jira/browse/HDFS-4238 > Project: Hadoop HDFS > Issue Type: Bug > Components: ha > Affects Versions: 3.0.0, 2.0.2-alpha > Reporter: Vinay > > This happened in our cluster, > >> Standby NN was keep doing checkpoint every one hour and uploading to Active NN was continuously failing due to some kerberos issue and nobody noticed this, since Active was servicing properly. > >> Active NN was up for long time with fsimage having very least transaction. > >> Standby NN has saved the checkpoint in its name dir and purged the txns > 1000000 from shared storage ( includes edits which are not present in Active NN's fsimage) > >> After some time Active NN is restarted and StandBy NN switched to Active. > Now current Standby not able to load any edits from shared storage, as expected edits are not present in shared storage. Its keep running idle. > So {{editLog.purgeLogsOlderThan(purgeLogsFrom);}} always should be called from Active NameNode. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira