Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8456911834 for ; Mon, 30 Jun 2014 12:29:25 +0000 (UTC) Received: (qmail 42600 invoked by uid 500); 30 Jun 2014 12:29:25 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 42544 invoked by uid 500); 30 Jun 2014 12:29:25 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 42530 invoked by uid 99); 30 Jun 2014 12:29:25 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Jun 2014 12:29:25 +0000 Date: Mon, 30 Jun 2014 12:29:25 +0000 (UTC) From: "Vinayakumar B (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-4120) Add a new "-skipSharedEditsCheck" option for BootstrapStandby MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14047608#comment-14047608 ] Vinayakumar B commented on HDFS-4120: ------------------------------------- Hi rakesh, Thanks for adding a test. Following are comments about the same. 1. The new test added actually passes even without "-skipSharedEditsCheck" option. QJM doesn't need this option. May be this we can try with BKJM? 2. After stopping the standby NN, need to clear out old files before calling bootstrap again and then can look for the checkpoints. > Add a new "-skipSharedEditsCheck" option for BootstrapStandby > ------------------------------------------------------------- > > Key: HDFS-4120 > URL: https://issues.apache.org/jira/browse/HDFS-4120 > Project: Hadoop HDFS > Issue Type: Improvement > Components: ha, namenode > Affects Versions: 3.0.0, 2.0.2-alpha > Reporter: Liang Xie > Assignee: Liang Xie > Priority: Minor > Attachments: HDFS-4120.patch, HDFS-4120.txt > > > Per https://issues.apache.org/jira/browse/HDFS-3752?focusedCommentId=13449466&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13449466 , let's introduce a new option, it should be very safe, but really useful for some corner case. e.g. when SNN losts local storage, we need to reset SNN, but in current trunk, it'll always get a FATAL msg and could never be successful. Another workaroud for this case, is full-sync the "current" directory from ANN, but it'll be cost more disk-space & net bandwidth, IMHO. -- This message was sent by Atlassian JIRA (v6.2#6252)