hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1583) Start/Stop of large cluster untenable
Date Thu, 16 Jul 2009 21:23:15 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732170#action_12732170
] 

Andrew Purtell commented on HBASE-1583:
---------------------------------------

Ack on safe mode having oddities. I brought down a cluster with 133 regions cleanly and restarted
it just now. Right away ~128 regions were assigned out. The rest were assigned out a few minutes
later. Invoking 'enable' on a incompletely assigned table prodded the master into some action
but did not bring things up all the way as this kludge has done in the past. 

> Start/Stop of large cluster untenable
> -------------------------------------
>
>                 Key: HBASE-1583
>                 URL: https://issues.apache.org/jira/browse/HBASE-1583
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: stack
>             Fix For: 0.20.0
>
>         Attachments: 1583-nocompactonclose.patch, 1583-v2-nocompactonopenclose.patch
>
>
> Starting and stopping a loaded large cluster is way too flakey and takes too long.  This
is 0.19.x but same issues apply to TRUNK I'd say.
> At pset with our > 100 nodes carrying 6k regions:
> + shutdown takes way too long.... maybe ten minutes or so.  We compact regions inline
with shutdown.  We should just go down.  It doesn't seem like all regionservers go down everytime
either.
> + startup is a mess with our assigning out regions an rebalancing at same time.  By time
that the compactions on open run, it can be near an hour before whole thing settles down and
becomes useable

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message