Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BE90510CBE for ; Mon, 24 Jun 2013 11:56:21 +0000 (UTC) Received: (qmail 6124 invoked by uid 500); 24 Jun 2013 11:56:21 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 6096 invoked by uid 500); 24 Jun 2013 11:56:21 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 5890 invoked by uid 99); 24 Jun 2013 11:56:21 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 24 Jun 2013 11:56:21 +0000 Date: Mon, 24 Jun 2013 11:56:20 +0000 (UTC) From: "Hudson (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13691903#comment-13691903 ] Hudson commented on HBASE-8783: ------------------------------- Integrated in hbase-0.95-on-hadoop2 #146 (See [https://builds.apache.org/job/hbase-0.95-on-hadoop2/146/]) HBASE-8783 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name (Revision 1495947) Result = FAILURE mbertozzi : Files : * /hbase/branches/0.95/hbase-server/src/main/java/org/apache/hadoop/hbase/procedure/ProcedureMemberRpcs.java * /hbase/branches/0.95/hbase-server/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureCoordinatorRpcs.java * /hbase/branches/0.95/hbase-server/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureMemberRpcs.java * /hbase/branches/0.95/hbase-server/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureUtil.java * /hbase/branches/0.95/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/snapshot/RegionServerSnapshotManager.java * /hbase/branches/0.95/hbase-server/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedure.java * /hbase/branches/0.95/hbase-server/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedureControllers.java > RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name > ------------------------------------------------------------------------------------- > > Key: HBASE-8783 > URL: https://issues.apache.org/jira/browse/HBASE-8783 > Project: HBase > Issue Type: Bug > Components: snapshots > Affects Versions: 0.94.8, 0.95.1 > Reporter: Matteo Bertozzi > Assignee: Matteo Bertozzi > Priority: Minor > Fix For: 0.98.0, 0.95.2, 0.94.9 > > Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, HBASE-8783-v0.patch, HBASE-8783-v1.patch > > > The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be initialized with the wrong memberName. > {code} > 2013-06-21 05:03:41,732 DEBUG org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot Manager > ... > 2013-06-21 05:03:41,875 INFO org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com > {code} > The Region Server Name is used as memberName, but since the snapshot manger is initialized before the RS receives the server name used by the master, the zkprocedure will use the wrong name (0.0.0.0). > This will case the snapshot to fail with a TimeoutException since the master will not receive the expected RS > {code} > Master: > ZKProcedureCoordinatorRpcs: Watching for acquire node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915 > RS: > ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired barrier for procedure (foo23) in zk > ... > org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! Source:Timeout caused Foreign Exception Start:1371798732141, End:1371798792141, diff:60000, max:60000 ms > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira