Return-Path: X-Original-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 07A0299C9 for ; Sat, 21 Jan 2012 02:49:11 +0000 (UTC) Received: (qmail 993 invoked by uid 500); 21 Jan 2012 02:49:09 -0000 Delivered-To: apmail-hadoop-hdfs-dev-archive@hadoop.apache.org Received: (qmail 365 invoked by uid 500); 21 Jan 2012 02:49:05 -0000 Mailing-List: contact hdfs-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-dev@hadoop.apache.org Delivered-To: mailing list hdfs-dev@hadoop.apache.org Received: (qmail 338 invoked by uid 99); 21 Jan 2012 02:49:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 21 Jan 2012 02:49:03 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 21 Jan 2012 02:49:00 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id B6583159C13 for ; Sat, 21 Jan 2012 02:48:39 +0000 (UTC) Date: Sat, 21 Jan 2012 02:48:39 +0000 (UTC) From: "Aaron T. Myers (Created) (JIRA)" To: hdfs-dev@hadoop.apache.org Message-ID: <979532600.62562.1327114119748.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Created] (HDFS-2823) HA: Transition to active can cause NN deadlock MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org HA: Transition to active can cause NN deadlock ---------------------------------------------- Key: HDFS-2823 URL: https://issues.apache.org/jira/browse/HDFS-2823 Project: Hadoop HDFS Issue Type: Sub-task Components: ha, name-node Affects Versions: HA branch (HDFS-1623) Reporter: Aaron T. Myers Assignee: Aaron T. Myers Priority: Blocker On transition to active, we have to take the FSNS write lock. In {{EditLogTailer#stop}}, we interrupt the edit log tailer thread and then join on that thread. When tailing edits, the edit log tailer thread acquires the FSNS write lock interruptibly, precisely so that we avoid deadlocks on transition to active. However, the edit log tailer thread now also triggers edit log rolls. Several places in {{ipc.Client}} catch and ignore {{InterruptedException}}, and in so doing may cause the {{Thread#interrupt}} call to be missed by the edit log tailer thread. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira