Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 7712 invoked from network); 25 Nov 2008 21:59:35 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 25 Nov 2008 21:59:35 -0000 Received: (qmail 71843 invoked by uid 500); 25 Nov 2008 21:59:46 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 71835 invoked by uid 500); 25 Nov 2008 21:59:46 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 71609 invoked by uid 99); 25 Nov 2008 21:59:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Nov 2008 13:59:45 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 25 Nov 2008 21:58:28 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 502C2234C29B for ; Tue, 25 Nov 2008 13:58:44 -0800 (PST) Message-ID: <777604625.1227650324327.JavaMail.jira@brutus> Date: Tue, 25 Nov 2008 13:58:44 -0800 (PST) From: "stack (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-1018) Regionservers should report detailed health to master; master should flag troubled regionservers in UI In-Reply-To: <1802291055.1227381164401.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650762#action_12650762 ] stack commented on HBASE-1018: ------------------------------ +1 on removing TTS (I've been burned by its sillyness too often to admit). One thought I had -- don't have to do it here, we can do it later -- is that the heartbeat should carry the data portion of RegionServerMetrics .... or you'd stuff a RSM into a ServerLoad and it would know how to iterate over all Metrics in RSM and extract current value. We'd do this so just adding to the RSM would have the new metric show master-side. Otherwise, +1 on this patch as is. > Regionservers should report detailed health to master; master should flag troubled regionservers in UI > ------------------------------------------------------------------------------------------------------ > > Key: HBASE-1018 > URL: https://issues.apache.org/jira/browse/HBASE-1018 > Project: Hadoop HBase > Issue Type: Improvement > Components: master, regionserver > Reporter: Andrew Purtell > Assignee: Andrew Purtell > Fix For: 0.19.0 > > Attachments: 1018.patch, 1018.patch > > > Regionservers should report detailed health to master. The master should flag troubled regionservers in the UI. > The concern at the moment is primarily heap. Regionservers should report used, committed, and max heap metrics in the periodic report. The master should flag in the regionserver list on /master.jsp those regionservers where available heap is below a configurable threshold. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.