Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 05AD492E7 for ; Wed, 8 Feb 2012 19:27:22 +0000 (UTC) Received: (qmail 49814 invoked by uid 500); 8 Feb 2012 19:27:21 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 49622 invoked by uid 500); 8 Feb 2012 19:27:20 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 49604 invoked by uid 99); 8 Feb 2012 19:27:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 19:27:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 08 Feb 2012 19:27:19 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 7BC9A1AB212 for ; Wed, 8 Feb 2012 19:26:59 +0000 (UTC) Date: Wed, 8 Feb 2012 19:26:59 +0000 (UTC) From: "Aaron T. Myers (Commented) (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1718656071.16006.1328729219508.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1061660440.13806.1328690577519.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HDFS-2915) HA: TestFailureOfSharedDir.testFailureOfSharedDir() has race condition MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-2915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13203875#comment-13203875 ] Aaron T. Myers commented on HDFS-2915: -------------------------------------- bq. I am planning to make the shared dir inaccessible by changing write permissions (instead of deleting the directory). That gets rid all potential race conditions. Sure, that'd be fine too. > HA: TestFailureOfSharedDir.testFailureOfSharedDir() has race condition > ---------------------------------------------------------------------- > > Key: HDFS-2915 > URL: https://issues.apache.org/jira/browse/HDFS-2915 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: name-node > Affects Versions: HA branch (HDFS-1623) > Reporter: Bikas Saha > Assignee: Bikas Saha > Priority: Minor > > The test deletes the shared edits dir to simulate a failure. Then it calls rollEditLogs() to trigger the deleted dir to be used and fail with an IOException. Unfortunately, deleting the shared dir can put the NN in safe mode due to lack of space. This causes a SafeModeException to be thrown when rollEditDirs() is called. This exception is caught as an IOException in the test but the associated assert in the catch block fails. > This always happens in the debugger because the delay in stepping through causes the safe mode change to happen before rollEditLogs() gets called. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira