hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Billy Pearson (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1583) Start/Stop of large cluster untenable
Date Mon, 29 Jun 2009 15:34:47 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12725218#action_12725218
] 

Billy Pearson commented on HBASE-1583:
--------------------------------------

I suggested that we do not do come out of safe mode until all regions have been assigned when
we added safe mode and make the regions not run compactions while in safe mode I thank that
would be an easy fix for this problem
I have seen the same thing when you have region that are behind on compactions after a shutdown
on start up compaction tie up reassignments.

Billy


> Start/Stop of large cluster untenable
> -------------------------------------
>
>                 Key: HBASE-1583
>                 URL: https://issues.apache.org/jira/browse/HBASE-1583
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>             Fix For: 0.20.0
>
>
> Starting and stopping a loaded large cluster is way too flakey and takes too long.  This
is 0.19.x but same issues apply to TRUNK I'd say.
> At pset with our > 100 nodes carrying 6k regions:
> + shutdown takes way too long.... maybe ten minutes or so.  We compact regions inline
with shutdown.  We should just go down.  It doesn't seem like all regionservers go down everytime
either.
> + startup is a mess with our assigning out regions an rebalancing at same time.  By time
that the compactions on open run, it can be near an hour before whole thing settles down and
becomes useable

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message