Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 93F3818110 for ; Wed, 24 Feb 2016 16:48:18 +0000 (UTC) Received: (qmail 78524 invoked by uid 500); 24 Feb 2016 16:48:18 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 78473 invoked by uid 500); 24 Feb 2016 16:48:18 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 78419 invoked by uid 99); 24 Feb 2016 16:48:18 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Feb 2016 16:48:18 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 2A36D2C1F5D for ; Wed, 24 Feb 2016 16:48:18 +0000 (UTC) Date: Wed, 24 Feb 2016 16:48:18 +0000 (UTC) From: "Yong Zhang (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-15318) Zk-less region server state management MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15163304#comment-15163304 ] Yong Zhang commented on HBASE-15318: ------------------------------------ Thanks [~mbertozzi] for explain. bq. So, the ephemeral nodes the RS registers on startup Yes, RS will send a heartbeat to HM. bq. When you say ..."but we find many user network issue is not network disconnected but package lost", is it that the RS is dead/lost (but its zk connection is fine?) Here just describe one case, that when RS start, it will create one znode on zk, but some time later, network package has 10% lost for example, because connection from this RS to ZK is not break then, HM also consider this RS is health, but in fact this RS may could not provide service. > Zk-less region server state management > -------------------------------------- > > Key: HBASE-15318 > URL: https://issues.apache.org/jira/browse/HBASE-15318 > Project: HBase > Issue Type: Improvement > Reporter: Yong Zhang > Assignee: Yong Zhang > > Current region server state is managed via znode created by region server, master just listen these nodes. but we find many user network issue is not network disconnected but package lost, which is hard to capture because connection between region server and zk is fine. > This jira goal is region server state is managed by master without shared info in zk, via enhancement heartbeat from region server to master. -- This message was sent by Atlassian JIRA (v6.3.4#6332)