Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 44826 invoked from network); 8 Nov 2010 23:20:04 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 8 Nov 2010 23:20:04 -0000 Received: (qmail 6929 invoked by uid 500); 8 Nov 2010 23:20:35 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 6883 invoked by uid 500); 8 Nov 2010 23:20:35 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 6768 invoked by uid 99); 8 Nov 2010 23:20:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Nov 2010 23:20:35 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Nov 2010 23:20:33 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oA8NKCU6029624 for ; Mon, 8 Nov 2010 23:20:12 GMT Message-ID: <27112037.87911289258412162.JavaMail.jira@thor> Date: Mon, 8 Nov 2010 18:20:12 -0500 (EST) From: "Jonathan Gray (JIRA)" To: issues@hbase.apache.org Subject: [jira] Updated: (HBASE-3207) If we get IOException when closing a region, we should still remove it from online regions and complete the close in ZK In-Reply-To: <7045247.87791289258171384.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-3207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Gray updated HBASE-3207: --------------------------------- Status: Patch Available (was: Open) > If we get IOException when closing a region, we should still remove it from online regions and complete the close in ZK > ----------------------------------------------------------------------------------------------------------------------- > > Key: HBASE-3207 > URL: https://issues.apache.org/jira/browse/HBASE-3207 > Project: HBase > Issue Type: Bug > Components: regionserver > Affects Versions: 0.90.0 > Reporter: Jonathan Gray > Assignee: Jonathan Gray > Fix For: 0.90.0 > > Attachments: HBASE-3207-v1.patch > > > Ran into issue on cluster where HDFS was taken out from under it. RS eventually tried to shut itself down. As regions were being closed, they got IOException "Filesystem closed". In the CloseRegionHandlers, this was causing the close operation to not finish (in ZK and in the online region list in RS). That, in turn, held up the waitOnAllRegionsToClose() so the RS never shut down. > If we get an IOException during a close, which can happen if fatal error doing flush, this is not recoverable so we should complete the region close in ZK and by removing from map of online regions on that RS. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.