Return-Path: Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: (qmail 43603 invoked from network); 8 Nov 2010 23:16:03 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 8 Nov 2010 23:16:03 -0000 Received: (qmail 2081 invoked by uid 500); 8 Nov 2010 23:16:35 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 1998 invoked by uid 500); 8 Nov 2010 23:16:35 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 1990 invoked by uid 99); 8 Nov 2010 23:16:35 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Nov 2010 23:16:35 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Nov 2010 23:16:33 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id oA8NGBhf029571 for ; Mon, 8 Nov 2010 23:16:11 GMT Message-ID: <7045247.87791289258171384.JavaMail.jira@thor> Date: Mon, 8 Nov 2010 18:16:11 -0500 (EST) From: "Jonathan Gray (JIRA)" To: issues@hbase.apache.org Subject: [jira] Created: (HBASE-3207) If we get IOException when closing a region, we should still remove it from online regions and complete the close in ZK MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org If we get IOException when closing a region, we should still remove it from online regions and complete the close in ZK ----------------------------------------------------------------------------------------------------------------------- Key: HBASE-3207 URL: https://issues.apache.org/jira/browse/HBASE-3207 Project: HBase Issue Type: Bug Components: regionserver Affects Versions: 0.90.0 Reporter: Jonathan Gray Assignee: Jonathan Gray Fix For: 0.90.0 Ran into issue on cluster where HDFS was taken out from under it. RS eventually tried to shut itself down. As regions were being closed, they got IOException "Filesystem closed". In the CloseRegionHandlers, this was causing the close operation to not finish (in ZK and in the online region list in RS). That, in turn, held up the waitOnAllRegionsToClose() so the RS never shut down. If we get an IOException during a close, which can happen if fatal error doing flush, this is not recoverable so we should complete the region close in ZK and by removing from map of online regions on that RS. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.