Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 42810CB83 for ; Tue, 23 Jul 2013 11:18:54 +0000 (UTC) Received: (qmail 35477 invoked by uid 500); 23 Jul 2013 11:18:52 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 35400 invoked by uid 500); 23 Jul 2013 11:18:52 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 34844 invoked by uid 99); 23 Jul 2013 11:18:51 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 23 Jul 2013 11:18:51 +0000 Date: Tue, 23 Jul 2013 11:18:51 +0000 (UTC) From: "Y. SREENIVASULU REDDY (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-9026) RestartRsHoldingRoot action in org.apache.hadoop.hbase.util.ChaosMonkey restarting the server holding .META. instead of -ROOT- MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-9026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13716313#comment-13716313 ] Y. SREENIVASULU REDDY commented on HBASE-9026: ---------------------------------------------- thanks for review, rajesh. > RestartRsHoldingRoot action in org.apache.hadoop.hbase.util.ChaosMonkey restarting the server holding .META. instead of -ROOT- > ------------------------------------------------------------------------------------------------------------------------------ > > Key: HBASE-9026 > URL: https://issues.apache.org/jira/browse/HBASE-9026 > Project: HBase > Issue Type: Bug > Affects Versions: 0.94.8 > Reporter: Y. SREENIVASULU REDDY > Priority: Minor > Fix For: 0.94.11 > > Attachments: ChaosMonkey.java.patch > > > In ChaosMonkey instead of restarting Root holded regionServer it is restarting META holded regionServer. > {code} > public static class RestartRsHoldingRoot extends RestartRandomRs { > public RestartRsHoldingRoot(long sleepTime) { > super(sleepTime); > } > @Override > void perform() throws Exception { > LOG.info("Performing action: Restart region server holding ROOT"); > ServerName server = cluster.getServerHoldingMeta(); > if (server == null) { > LOG.warn("No server is holding -ROOT- right now."); > return; > } > restartRs(server, sleepTime); > } > } > {code} > {noformat} > 13/07/23 17:03:54 INFO util.ChaosMonkey: Performing action: Restart region server holding ROOT > 13/07/23 17:03:54 DEBUG client.HConnectionManager$HConnectionImplementation: Looked up root region location, connection=org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation@52b57e9a; serverName=ocean06,60020,1374569995361 > 13/07/23 17:03:54 DEBUG client.HConnectionManager$HConnectionImplementation: Removed .META.,,1.1028785192 for tableName=.META. from cache because of > 13/07/23 17:03:54 DEBUG client.HConnectionManager$HConnectionImplementation: Cached location for .META.,,1.1028785192 is ocean06:60020 > 13/07/23 17:03:54 INFO util.ChaosMonkey: Killing region server:ocean06,60020,1374569995361 > 13/07/23 17:03:54 INFO hbase.HBaseCluster: Aborting RS: ocean06,60020,1374569995361 > 13/07/23 17:03:54 INFO hbase.ClusterManager: Executing remote command: ps ux | grep regionserver | grep hbase | grep -v grep | tr -s ' ' | cut -d ' ' -f2 | xargs kill -s SIGKILL , hostname:ocean06 > 13/07/23 17:03:54 INFO util.Shell: Executing full command [/usr/bin/ssh ocean06 "ps ux | grep regionserver | grep hbase | grep -v grep | tr -s ' ' | cut -d ' ' -f2 | xargs kill -s SIGKILL"] > 13/07/23 17:03:54 INFO hbase.ClusterManager: Executed remote command, exit code:0 , output: > 13/07/23 17:03:54 INFO hbase.HBaseCluster: Waiting service:regionserver to stop: ocean06,60020,1374569995361 > 13/07/23 17:03:54 INFO hbase.ClusterManager: Executing remote command: ps ux | grep regionserver | grep hbase | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:ocean06 > 13/07/23 17:03:54 INFO util.Shell: Executing full command [/usr/bin/ssh ocean06 "ps ux | grep regionserver | grep hbase | grep -v grep | tr -s ' ' | cut -d ' ' -f2"] > 13/07/23 17:03:55 INFO hbase.ClusterManager: Executed remote command, exit code:0 , output: > 13/07/23 17:03:55 INFO util.ChaosMonkey: Killed region server:ocean06,60020,1374569995361. Reported num of rs:2 > {noformat} > This is only in 0.94.X -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira