hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Loknath Priyatham Teja Singamsetty (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-15924) Enhance hbase services autorestart capability to hbase-daemon.sh
Date Thu, 22 Sep 2016 14:32:20 GMT

    [ https://issues.apache.org/jira/browse/HBASE-15924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15513446#comment-15513446

Loknath Priyatham Teja Singamsetty  commented on HBASE-15924:

[~apurtell] [~larsh] [~lhofhansl] [~giacomotaylor]

Please review the changes submitted to support autorestart with enhanced capability to configure.
Please note that as part of this change, some of the helper scripts under "bin" are modified
to fix identified bugs while testing autostart functionality in local, pseudo-distributed
and distributed modes.

Release notes: 
Now one can start hbase services with enabled "autostart/autorestart" feature in controlled
fashion with the help of "--autostart-window-size" to define the window period and the "--autostart-window-retry-limit"
to define the number of times the hbase services have to be restarted upon being killed/terminated
abnormally within the provided window perioid.

The following cases are supported with "autostart/autorestart":

a) --autostart-window-size=0 and --autostart-window-retry-limit=0, indicates infinite window
size and no retry limit
b) not providing the args, will default to a)
c) --autostart-window-size=0 and --autostart-window-retry-limit=<positive value> indicates
the autostart process to bail out if the retry limit exceeds irrespective of window period
d) --autostart-window-size=<x> and --autostart-window-retry-limit=<y> indicates
the autostart process to bail out if the retry limit "y" is exceeded for the last window period


> Enhance hbase services autorestart capability to hbase-daemon.sh 
> -----------------------------------------------------------------
>                 Key: HBASE-15924
>                 URL: https://issues.apache.org/jira/browse/HBASE-15924
>             Project: HBase
>          Issue Type: Improvement
>    Affects Versions: 0.98.19
>            Reporter: Loknath Priyatham Teja Singamsetty 
>            Assignee: Loknath Priyatham Teja Singamsetty 
>             Fix For: 0.98.23
>         Attachments: HBASE-15924.master.0001.patch
> As part of HBASE-5939, the autorestart for hbase services has been added to deal with
scenarios where hbase services (master/regionserver/master-backup) gets killed or goes down
leading to unplanned outages. The changes were made to hbase-daemon.sh to support autorestart
> However, the autorestart implementation doesn't work in standalone mode and other than
that have few gaps with the implementation as per release notes of HBASE-5939. Here is an
attempt to re-design and fix the functionality considering all possible usecases with hbase
service operations.
> Release Notes of HBASE-5939:
> ------------------------------------------
> When launched with autorestart, HBase processes will automatically restart if they are
not properly terminated, either by a "stop" command or by a cluster stop. To ensure that it
does not overload the system when the server itself is corrupted and the process cannot be
restarted, the server sleeps for 5 minutes before restarting if it was already started 5 minutes
ago previously. To use it, launch the process with "bin/start-hbase autorestart". This option
is not fully compatible with the existing "restart" command: if you ask for a restart on a
server launched with autorestart, the server will restart but the next server instance won't
be automatically restarted.

This message was sent by Atlassian JIRA

View raw message