Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 464EE100AB for ; Mon, 19 Aug 2013 23:56:52 +0000 (UTC) Received: (qmail 41676 invoked by uid 500); 19 Aug 2013 23:56:51 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 41643 invoked by uid 500); 19 Aug 2013 23:56:51 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 41635 invoked by uid 99); 19 Aug 2013 23:56:51 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 19 Aug 2013 23:56:51 +0000 Date: Mon, 19 Aug 2013 23:56:51 +0000 (UTC) From: "Elliott Clark (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HBASE-9267) StochasticLoadBalancer goes over its processing time limit MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-9267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Elliott Clark updated HBASE-9267: --------------------------------- Attachment: HBASE-9267-1.patch Some formatting. > StochasticLoadBalancer goes over its processing time limit > ---------------------------------------------------------- > > Key: HBASE-9267 > URL: https://issues.apache.org/jira/browse/HBASE-9267 > Project: HBase > Issue Type: Bug > Affects Versions: 0.95.2 > Reporter: Jean-Daniel Cryans > Assignee: Elliott Clark > Fix For: 0.98.0, 0.95.3 > > Attachments: HBASE-9267-0.patch, HBASE-9267-1.patch > > > I trying out 0.95.2, I left it running over the weekend (8 RS, average load between 12 and 3 regions) and right now the balancer runs for 12 mins: > bq. 2013-08-19 21:54:45,534 DEBUG [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore] org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not find a better load balance plan. Tried 0 different configurations in 777309ms, and did not find anything with a computed cost less than 36.32576937689094 > It seems it slowly crept up there, yesterday it was doing: > bq. 2013-08-18 20:53:17,232 DEBUG [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore] org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not find a better load balance plan. Tried 0 different configurations in 257374ms, and did not find anything with a computed cost less than 36.3251082542424 > And originally it was doing 1 minute. > In the jstack I see a 1000 of these and jstack doesn't want to show me the whole thing: > bq. at java.util.SubList$1.nextIndex(AbstractList.java:713) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira