Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 1B6FB200D09 for ; Tue, 12 Sep 2017 23:21:07 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 19D941609B4; Tue, 12 Sep 2017 21:21:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 5F34D1609C7 for ; Tue, 12 Sep 2017 23:21:06 +0200 (CEST) Received: (qmail 9382 invoked by uid 500); 12 Sep 2017 21:21:05 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 9370 invoked by uid 99); 12 Sep 2017 21:21:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Sep 2017 21:21:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id DE3B918553B for ; Tue, 12 Sep 2017 21:21:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id ZqGa65032Ik3 for ; Tue, 12 Sep 2017 21:21:03 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 09DB65FB2E for ; Tue, 12 Sep 2017 21:21:03 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 842A5E0F0D for ; Tue, 12 Sep 2017 21:21:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id A922625397 for ; Tue, 12 Sep 2017 21:21:00 +0000 (UTC) Date: Tue, 12 Sep 2017 21:21:00 +0000 (UTC) From: "Hudson (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-18131) Add an hbase shell command to clear deadserver list in ServerManager MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 12 Sep 2017 21:21:07 -0000 [ https://issues.apache.org/jira/browse/HBASE-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16163676#comment-16163676 ] Hudson commented on HBASE-18131: -------------------------------- FAILURE: Integrated in Jenkins build HBase-Trunk_matrix #3704 (See [https://builds.apache.org/job/HBase-Trunk_matrix/3704/]) HBASE-18131 Add an hbase shell command to clear deadserver list in (tedyu: rev cfdbdd2066bf977c951b74b31c0e41dce152f9ef) * (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/security/access/AccessController.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/Admin.java * (edit) hbase-shell/src/main/ruby/shell.rb * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/shaded/protobuf/ProtobufUtil.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/AsyncAdmin.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/AsyncHBaseAdmin.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/shaded/protobuf/RequestConverter.java * (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/coprocessor/MasterObserver.java * (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/master/MasterCoprocessorHost.java * (add) hbase-shell/src/main/ruby/shell/commands/list_deadservers.rb * (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/master/DeadServer.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/HBaseAdmin.java * (edit) hbase-server/src/test/java/org/apache/hadoop/hbase/master/TestDeadServer.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/RawAsyncHBaseAdmin.java * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/ConnectionImplementation.java * (edit) hbase-protocol-shaded/src/main/protobuf/Master.proto * (add) hbase-shell/src/main/ruby/shell/commands/clear_deadservers.rb * (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/ShortCircuitMasterConnection.java * (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/master/MasterRpcServices.java * (edit) hbase-shell/src/main/ruby/hbase/admin.rb > Add an hbase shell command to clear deadserver list in ServerManager > -------------------------------------------------------------------- > > Key: HBASE-18131 > URL: https://issues.apache.org/jira/browse/HBASE-18131 > Project: HBase > Issue Type: New Feature > Components: Operability > Reporter: Yu Li > Assignee: Guangxu Cheng > Fix For: 2.0.0, 1.4.0 > > Attachments: HBASE-18131.branch-1.v1.patch, HBASE-18131.master.v1.patch, HBASE-18131.master.v2.patch, HBASE-18131.master.v3.patch, HBASE-18131.master.v4.patch, HBASE-18131.master.v5.patch, HBASE-18131.master.v6.patch, HBASE-18131.master.v6.patch, HBASE-18131.master.v7.patch, HBASE-18131.patch > > > Currently if a regionserver is aborted due to fatal error or stopped by operator on purpose, it will be added into {{ServerManager#deadservers}} list and shown as "Dead Servers" in the master UI. This is a valid warn for operators to notice the self-aborted servers and give a sanity check to avoid further issues. However, after necessary checks, even if operator is sure that the node is decommissioned (such as for repair), there's no way to clear the dead server list except restarting master. See more details in [this discussion|http://mail-archives.apache.org/mod_mbox/hbase-user/201705.mbox/%3CCAM7-19%2BD4MLu2b1R94%2BtWQDspjfny2sCy4Qit8JtCgjvTOZzzg%40mail.gmail.com%3E] in mail list > Here we propose to add a hbase shell command to allow clearing dead server list in {{ServerManager}} for advanced users, and the command should be executed with caution. -- This message was sent by Atlassian JIRA (v6.4.14#64029)