Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4435191D4 for ; Tue, 7 Feb 2012 02:55:30 +0000 (UTC) Received: (qmail 15376 invoked by uid 500); 7 Feb 2012 02:55:27 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 14582 invoked by uid 500); 7 Feb 2012 02:55:25 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 14571 invoked by uid 99); 7 Feb 2012 02:55:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Feb 2012 02:55:24 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Feb 2012 02:55:22 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id C6BF9CCA6A for ; Tue, 7 Feb 2012 02:55:00 +0000 (UTC) Date: Tue, 7 Feb 2012 02:55:00 +0000 (UTC) From: "Phabricator (Updated) (JIRA)" To: issues@hbase.apache.org Message-ID: <2026078860.6447.1328583300815.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <2048270211.6439.1328583059369.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HBASE-5344) [89-fb] Scan unassigned region directory on master failover MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-5344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phabricator updated HBASE-5344: ------------------------------- Attachment: D1605.1.patch mbautin requested code review of "[jira] [HBASE-5344] [89-fb] Scan unassigned region directory on master failover". Reviewers: Kannan, Karthik, Liyin, JIRA, stack In case the master dies after a regionserver writes region state as OPENED or CLOSED in ZK but before the update is received by master and written to meta, the new master that comes up has to pick up the region state from ZK and write it to meta. Otherwise we can get multiply-assigned regions. The current solution tries to reassign the root region if it is unassigned but does not implement a work-around if META regions are missing. Also, it currently heavily relies on "direct scanning" of regionservers (reading regionserver list from ZK and doing an RPC on each regionserver to get the list of online regions). We were already doing that in master failover, but I am making it parallel here. TEST PLAN Unit tests, dev cluster, dark launch with killing regionservers and master REVISION DETAIL https://reviews.facebook.net/D1605 AFFECTED FILES src/main/java/org/apache/hadoop/hbase/executor/RegionTransitionEventData.java src/main/java/org/apache/hadoop/hbase/ipc/HBaseServer.java src/main/java/org/apache/hadoop/hbase/master/BaseScanner.java src/main/java/org/apache/hadoop/hbase/master/DirectRegionServerScanner.java src/main/java/org/apache/hadoop/hbase/master/HMaster.java src/main/java/org/apache/hadoop/hbase/master/ProcessRegionOpen.java src/main/java/org/apache/hadoop/hbase/master/RegionManager.java src/main/java/org/apache/hadoop/hbase/master/RootScanner.java src/main/java/org/apache/hadoop/hbase/master/ServerManager.java src/main/java/org/apache/hadoop/hbase/master/ZKUnassignedWatcher.java src/main/java/org/apache/hadoop/hbase/master/handler/MasterOpenRegionHandler.java src/test/java/org/apache/hadoop/hbase/master/TestRegionStateOnMasterFailure.java MANAGE HERALD DIFFERENTIAL RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/3429/ Tip: use the X-Herald-Rules header to filter Herald messages in your client. > [89-fb] Scan unassigned region directory on master failover > ----------------------------------------------------------- > > Key: HBASE-5344 > URL: https://issues.apache.org/jira/browse/HBASE-5344 > Project: HBase > Issue Type: Bug > Reporter: Mikhail Bautin > Assignee: Mikhail Bautin > Attachments: D1605.1.patch > > > In case the master dies after a regionserver writes region state as OPENED or CLOSED in ZK but before the update is received by master and written to meta, the new master that comes up has to pick up the region state from ZK and write it to meta. Otherwise we can get multiply-assigned regions. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira