hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-3071) Graceful decommissioning of a regionserver
Date Wed, 30 Mar 2011 21:47:06 GMT

     [ https://issues.apache.org/jira/browse/HBASE-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

stack updated HBASE-3071:

    Attachment: 3701-v2.txt

Addressed J-D issues.

Changed names of scripts and how they run.  now there is a graceful_stop.sh script that manages
running of the region_mover.rb script and subsequent remote shutdown.  graceful_stop.sh takes
flags to restart the node subsequently and then another reload flag which will put back the
old region set on the just-started node.

I played trying to add the load/unload region script to hbase-daemon.sh so we could do stuff
like ./bin/hbase-daemons.sh unload regionserver but that gets messy in bash.  I already had
to add flag to bin/hbase to optionally not run java with an exec.

Testing on cluster seems to basically work.   Going to try with a cluster under load next.

> Graceful decommissioning of a regionserver
> ------------------------------------------
>                 Key: HBASE-3071
>                 URL: https://issues.apache.org/jira/browse/HBASE-3071
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: stack
>         Attachments: 3071.txt, 3701-v2.txt
> Currently if you stop a regionserver nicely, it'll put up its stopping flag and then
close all hosted regions.  While the stopping flag is in place all region requests are rejected.
 If this server was under load, closing could take a while.  Only after all is closed is the
master informed and it'll restart assigning (in old master, master woud get a report with
list of all regions closed, in new master the zk expired is triggered and we'll run shutdown
> At least in new master, we have means of disabling balancer, and then moving the regions
off the server one by one via HBaseAdmin methods -- we shoud write a script to do this at
least for rolling restarts -- but we need something better.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message