Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7191B6DB2 for ; Wed, 29 Jun 2011 01:01:03 +0000 (UTC) Received: (qmail 66302 invoked by uid 500); 29 Jun 2011 01:01:03 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 66251 invoked by uid 500); 29 Jun 2011 01:01:02 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 66239 invoked by uid 99); 29 Jun 2011 01:01:02 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jun 2011 01:01:02 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jun 2011 01:01:01 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 8F283438D31 for ; Wed, 29 Jun 2011 01:00:41 +0000 (UTC) Date: Wed, 29 Jun 2011 01:00:41 +0000 (UTC) From: "Todd Lipcon (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <501136005.748.1309309241583.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <623674188.52725.1306731827373.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-2010) Clean up and test behavior under failed edit streams MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13056924#comment-13056924 ] Todd Lipcon commented on HDFS-2010: ----------------------------------- - the log before exit should be LOG.fatal - rather than assign {{runtime}} in the constructor, why not just assign it where the variable is defined? - rather than using a mockito Answer, you can use normal mockito {{verify()}} code, right? > Clean up and test behavior under failed edit streams > ---------------------------------------------------- > > Key: HDFS-2010 > URL: https://issues.apache.org/jira/browse/HDFS-2010 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: name-node > Affects Versions: Edit log branch (HDFS-1073) > Reporter: Todd Lipcon > Assignee: Aaron T. Myers > Fix For: Edit log branch (HDFS-1073) > > Attachments: hdfs-2010.0.patch, hdfs-2010.1.patch > > > Right now there is very little test coverage of situations where one or more of the edits directories fails. In trunk, the behavior when all of the edits directories are dead is that the NN prints a fatal level log message and calls Runtime.exit(-1). > I don't think this is really the behavior we want. Needs a bit of thought, but I think something like the following would make more sense: > - any calls currently waiting on logSync should end up throwing an exception > - NN should probably enter safe mode > - ops can restore edits directories and then ask the NN to restore storage, at which point it could edit safemode > - alternatively, ops could call ask the NN to do saveNamespace and then shut it down -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira