Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9A48998E5 for ; Sat, 26 Nov 2011 05:48:09 +0000 (UTC) Received: (qmail 1316 invoked by uid 500); 26 Nov 2011 05:48:08 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 1245 invoked by uid 500); 26 Nov 2011 05:48:05 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 1237 invoked by uid 99); 26 Nov 2011 05:48:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Nov 2011 05:48:03 +0000 X-ASF-Spam-Status: No, hits=-2001.2 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 26 Nov 2011 05:48:00 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id D9B34A0E0F for ; Sat, 26 Nov 2011 05:47:39 +0000 (UTC) Date: Sat, 26 Nov 2011 05:47:39 +0000 (UTC) From: "Ted Yu (Commented) (JIRA)" To: issues@hbase.apache.org Message-ID: <1075897944.13066.1322286459893.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1327483236.13057.1322285980017.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Commented] (HBASE-4875) ZKLeaderManager.handleLeaderChange() doesn't handle KeeperException$SessionExpiredException MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-4875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157376#comment-13157376 ] Ted Yu commented on HBASE-4875: ------------------------------- I think we can follow the example from Master dealing with SessionExpiredException: {code} if (t != null && t instanceof KeeperException.SessionExpiredException) { try { LOG.info("Primary Master trying to recover from ZooKeeper session " + "expiry."); return !tryRecoveringExpiredZKSession(); {code} > ZKLeaderManager.handleLeaderChange() doesn't handle KeeperException$SessionExpiredException > ------------------------------------------------------------------------------------------- > > Key: HBASE-4875 > URL: https://issues.apache.org/jira/browse/HBASE-4875 > Project: HBase > Issue Type: Bug > Affects Versions: 0.92.0 > Reporter: Ted Yu > > TestMasterFailover#testSimpleMasterFailover has failed twice in a row for builds 15 and 16. > From https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92-security/16/testReport/org.apache.hadoop.hbase.master/TestMasterFailover/testSimpleMasterFailover/: > {code} > 2011-11-26 01:34:49,217 DEBUG [RegionServer:0;hemera.apache.org,57516,1322271278190-EventThread] zookeeper.ZooKeeperWatcher(257): regionserver:57516-0x133dd8281300003 Received ZooKeeper Event, type=NodeDeleted, state=SyncConnected, path=/hbase/tokenauth/keymaster > 2011-11-26 01:34:49,217 WARN [Thread-1-EventThread] zookeeper.ZKUtil(234): master:52934-0x133dd8281300001 Unable to set watcher on znode /hbase/tokenauth/keymaster > org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired for /hbase/tokenauth/keymaster > at org.apache.zookeeper.KeeperException.create(KeeperException.java:127) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003) > at org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:154) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:225) > at org.apache.hadoop.hbase.zookeeper.ZKLeaderManager.handleLeaderChange(ZKLeaderManager.java:85) > at org.apache.hadoop.hbase.zookeeper.ZKLeaderManager.nodeDeleted(ZKLeaderManager.java:78) > at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:281) > at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:521) > at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:497) > 2011-11-26 01:34:49,218 ERROR [Thread-1-EventThread] zookeeper.ZooKeeperWatcher(403): master:52934-0x133dd8281300001 Received unexpected KeeperException, re-throwing exception > org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired for /hbase/tokenauth/keymaster > at org.apache.zookeeper.KeeperException.create(KeeperException.java:127) > at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) > at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003) > at org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:154) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:225) > at org.apache.hadoop.hbase.zookeeper.ZKLeaderManager.handleLeaderChange(ZKLeaderManager.java:85) > at org.apache.hadoop.hbase.zookeeper.ZKLeaderManager.nodeDeleted(ZKLeaderManager.java:78) > at org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:281) > at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:521) > at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:497) > 2011-11-26 01:34:49,216 DEBUG [RegionServer:2;hemera.apache.org,44702,1322271278232-EventThread] zookeeper.ZKUtil(230): hconnection-0x133dd8281300009 /hbase/master does not exist. Watcher is set. > 2011-11-26 01:34:49,215 DEBUG [Thread-1-EventThread] zookeeper.ZKUtil(230): master:44883-0x133dd8281300002 /hbase/master does not exist. Watcher is set. > 2011-11-26 01:34:49,219 DEBUG [Thread-1-EventThread] master.ActiveMasterManager(104): No master available. Notifying waiting threads > 2011-11-26 01:34:49,215 INFO [Master:1;hemera.apache.org,52934,1322271278115] master.HMaster(338): HMaster main thread exiting > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira