Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 41425 invoked from network); 3 Feb 2010 00:09:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Feb 2010 00:09:42 -0000 Received: (qmail 69226 invoked by uid 500); 3 Feb 2010 00:09:42 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 69184 invoked by uid 500); 3 Feb 2010 00:09:42 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 69174 invoked by uid 99); 3 Feb 2010 00:09:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Feb 2010 00:09:42 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 03 Feb 2010 00:09:39 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id CED3B234C4AC for ; Tue, 2 Feb 2010 16:09:18 -0800 (PST) Message-ID: <60494795.40301265155758846.JavaMail.jira@brutus.apache.org> Date: Wed, 3 Feb 2010 00:09:18 +0000 (UTC) From: "ryan rawson (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-1956) Export HDFS read and write latency as a metric In-Reply-To: <281663005.1257412592362.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12828852#action_12828852 ] ryan rawson commented on HBASE-1956: ------------------------------------ I'm using the file context and I don't seem to get the count reset. So the average is over all time, not on a heartbeat interval (10s in the config). Is this expected? According to docs on the 'net about volatile, it is not atomic, so maybe im always seeing a race condition and the counter is never reset. > Export HDFS read and write latency as a metric > ---------------------------------------------- > > Key: HBASE-1956 > URL: https://issues.apache.org/jira/browse/HBASE-1956 > Project: Hadoop HBase > Issue Type: Improvement > Reporter: Andrew Purtell > Assignee: Andrew Purtell > Priority: Minor > Fix For: 0.20.3, 0.21.0 > > Attachments: HBASE-1956.patch, HBASE-1956.patch > > > HDFS write latency spikes especially are an indicator of general cluster overloading. We see this where the WAL writer complains about writes taking > 1 second, sometimes > 4, etc. If for example the average write latency over the monitoring period is exported as a metric, then this can feed into alerting for or automatic provisioning of additional cluster hardware. While we're at it, export read side metrics as well. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.