hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jean-Daniel Cryans (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-9267) StochasticLoadBalancer goes over its processing time limit
Date Wed, 21 Aug 2013 16:50:52 GMT

    [ https://issues.apache.org/jira/browse/HBASE-9267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13746553#comment-13746553
] 

Jean-Daniel Cryans commented on HBASE-9267:
-------------------------------------------

Yesterday I was playing more with v2 and v3 and although I don't see the sublist issue anymore,
the time it takes to balance always goes up until it reaches 60 seconds. The more I think
about it the less I like it... there's no way to kill the balancer while it's running (AFAIK)
and it blocks a couple of other things like HBCK. We could discuss this on dev@ or another
jira though.

So I'm still +1 on the patch.
                
> StochasticLoadBalancer goes over its processing time limit
> ----------------------------------------------------------
>
>                 Key: HBASE-9267
>                 URL: https://issues.apache.org/jira/browse/HBASE-9267
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.95.2
>            Reporter: Jean-Daniel Cryans
>            Assignee: Elliott Clark
>             Fix For: 0.98.0, 0.95.3
>
>         Attachments: HBASE-9267-0.patch, HBASE-9267-1.patch, HBASE-9267-2.patch, HBASE-9267-3.patch,
HBASE-9267-4.patch
>
>
> I trying out 0.95.2, I left it running over the weekend (8 RS, average load between 12
and 3 regions) and right now the balancer runs for 12 mins:
> bq. 2013-08-19 21:54:45,534 DEBUG [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore]
org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not find a better load
balance plan.  Tried 0 different configurations in 777309ms, and did not find anything with
a computed cost less than 36.32576937689094
> It seems it slowly crept up there, yesterday it was doing:
> bq. 2013-08-18 20:53:17,232 DEBUG [jdec2hbase0403-1.vpc.cloudera.com,60000,1376689696384-BalancerChore]
org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Could not find a better load
balance plan.  Tried 0 different configurations in 257374ms, and did not find anything with
a computed cost less than 36.3251082542424
> And originally it was doing 1 minute.
> In the jstack I see a 1000 of these and jstack doesn't want to show me the whole thing:
> bq.  at java.util.SubList$1.nextIndex(AbstractList.java:713)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message