hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Enis Soztutar (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18131) Add an hbase shell command to clear deadserver list in ServerManager
Date Tue, 30 May 2017 17:44:04 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16029816#comment-16029816
] 

Enis Soztutar commented on HBASE-18131:
---------------------------------------

I think the root cause of this is not that servers are in the dead-servers list indefinitely.
They will be removed from this list, however, due to HBASE-14223 the master thinks that the
server is dead-but-not-processed even after restart.  

> Add an hbase shell command to clear deadserver list in ServerManager
> --------------------------------------------------------------------
>
>                 Key: HBASE-18131
>                 URL: https://issues.apache.org/jira/browse/HBASE-18131
>             Project: HBase
>          Issue Type: New Feature
>          Components: Operability
>            Reporter: Yu Li
>            Assignee: Yu Li
>             Fix For: 2.0.0, 1.4.0
>
>
> Currently if a regionserver is aborted due to fatal error or stopped by operator on purpose,
it will be added into {{ServerManager#deadservers}} list and shown as "Dead Servers" in the
master UI. This is a valid warn for operators to  notice the self-aborted servers and give
a sanity check to avoid further issues. However, after necessary checks, even if operator
is sure that the node is decommissioned (such as for repair), there's no way to clear the
dead server list except restarting master. See more details in [this discussion|http://mail-archives.apache.org/mod_mbox/hbase-user/201705.mbox/%3CCAM7-19%2BD4MLu2b1R94%2BtWQDspjfny2sCy4Qit8JtCgjvTOZzzg%40mail.gmail.com%3E]
in mail list
> Here we propose to add a hbase shell command to allow clearing dead server list in {{ServerManager}}
for advanced users, and the command should be executed with caution.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message