Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8143AD552 for ; Tue, 17 Jul 2012 21:37:35 +0000 (UTC) Received: (qmail 80817 invoked by uid 500); 17 Jul 2012 21:37:35 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 80763 invoked by uid 500); 17 Jul 2012 21:37:35 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 80754 invoked by uid 99); 17 Jul 2012 21:37:35 -0000 Received: from issues-vm.apache.org (HELO issues-vm) (140.211.11.160) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 17 Jul 2012 21:37:35 +0000 Received: from isssues-vm.apache.org (localhost [127.0.0.1]) by issues-vm (Postfix) with ESMTP id 04FDA14281C for ; Tue, 17 Jul 2012 21:37:35 +0000 (UTC) Date: Tue, 17 Jul 2012 21:37:35 +0000 (UTC) From: "Jonathan Hsieh (JIRA)" To: issues@hbase.apache.org Message-ID: <1966548999.66128.1342561055024.JavaMail.jiratomcat@issues-vm> In-Reply-To: <1311404678.4738.1335436517956.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (HBASE-5883) Backup master is going down due to connection refused exception MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-5883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Hsieh updated HBASE-5883: ---------------------------------- Resolution: Fixed Fix Version/s: (was: 0.94.2) 0.94.1 Status: Resolved (was: Patch Available) I'm resolving this. I believe Greg tested this and the addendum version and found that the pre-addendum version fixed the problem more effectively than afterwards. The patch has been committed already on 0.94.1 (it is in the 0.94.1rc0) an in the other branches. Please file a new issue if to address the addendum. > Backup master is going down due to connection refused exception > --------------------------------------------------------------- > > Key: HBASE-5883 > URL: https://issues.apache.org/jira/browse/HBASE-5883 > Project: HBase > Issue Type: Bug > Components: master > Affects Versions: 0.90.6, 0.92.1, 0.94.0 > Reporter: Gopinathan A > Assignee: Jieshan Bean > Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1 > > Attachments: 90-addendum.patch, 92-addendum.patch, 94-addendum.patch, HBASE-5883-90.patch, HBASE-5883-92.patch, HBASE-5883-94.patch, HBASE-5883-trunk.patch, trunk-addendum.patch > > > The active master node network was down for some time (This node contains Master,DN,ZK,RS). Here backup node got > notification, and started to became active. Immedietly backup node got aborted with the below exception. > {noformat} > 2012-04-09 10:42:24,270 INFO org.apache.hadoop.hbase.master.SplitLogManager: finished splitting (more than or equal to) 861248320 bytes in 4 log files in [hdfs://192.168.47.205:9000/hbase/.logs/HOST-192-168-47-202,60020,1333715537172-splitting] in 26374ms > 2012-04-09 10:42:24,316 FATAL org.apache.hadoop.hbase.master.HMaster: Master server abort: loaded coprocessors are: [] > 2012-04-09 10:42:24,333 FATAL org.apache.hadoop.hbase.master.HMaster: Unhandled exception. Starting shutdown. > java.io.IOException: java.net.ConnectException: Connection refused > at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:375) > at org.apache.hadoop.hbase.ipc.HBaseClient.getConnection(HBaseClient.java:1045) > at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:897) > at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:150) > at $Proxy13.getProtocolVersion(Unknown Source) > at org.apache.hadoop.hbase.ipc.WritableRpcEngine.getProxy(WritableRpcEngine.java:183) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:303) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:280) > at org.apache.hadoop.hbase.ipc.HBaseRPC.getProxy(HBaseRPC.java:332) > at org.apache.hadoop.hbase.ipc.HBaseRPC.waitForProxy(HBaseRPC.java:236) > at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getHRegionConnection(HConnectionManager.java:1276) > at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getHRegionConnection(HConnectionManager.java:1233) > at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getHRegionConnection(HConnectionManager.java:1220) > at org.apache.hadoop.hbase.catalog.CatalogTracker.getCachedConnection(CatalogTracker.java:569) > at org.apache.hadoop.hbase.catalog.CatalogTracker.getRootServerConnection(CatalogTracker.java:369) > at org.apache.hadoop.hbase.catalog.CatalogTracker.waitForRootServerConnection(CatalogTracker.java:353) > at org.apache.hadoop.hbase.catalog.CatalogTracker.verifyRootRegionLocation(CatalogTracker.java:660) > at org.apache.hadoop.hbase.master.HMaster.assignRootAndMeta(HMaster.java:616) > at org.apache.hadoop.hbase.master.HMaster.finishInitialization(HMaster.java:540) > at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:363) > at java.lang.Thread.run(Thread.java:662) > Caused by: java.net.ConnectException: Connection refused > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:567) > at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) > at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:488) > at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupConnection(HBaseClient.java:328) > at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.setupIOstreams(HBaseClient.java:362) > ... 20 more > 2012-04-09 10:42:24,336 INFO org.apache.hadoop.hbase.master.HMaster: Aborting > 2012-04-09 10:42:24,336 DEBUG org.apache.hadoop.hbase.master.HMaster: Stopping service threads > {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira