Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C90788730 for ; Sat, 10 Sep 2011 00:42:31 +0000 (UTC) Received: (qmail 99778 invoked by uid 500); 10 Sep 2011 00:42:31 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 99707 invoked by uid 500); 10 Sep 2011 00:42:31 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 99693 invoked by uid 99); 10 Sep 2011 00:42:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Sep 2011 00:42:30 +0000 X-ASF-Spam-Status: No, hits=-2000.5 required=5.0 tests=ALL_TRUSTED,RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 10 Sep 2011 00:42:29 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id E4F583771C for ; Sat, 10 Sep 2011 00:42:08 +0000 (UTC) Date: Sat, 10 Sep 2011 00:42:08 +0000 (UTC) From: "Ming Ma (JIRA)" To: issues@hbase.apache.org Message-ID: <1344480628.10637.1315615328934.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Created] (HBASE-4366) dynamic metrics logging MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 dynamic metrics logging ----------------------- Key: HBASE-4366 URL: https://issues.apache.org/jira/browse/HBASE-4366 Project: HBase Issue Type: New Feature Components: metrics Reporter: Ming Ma Assignee: Ming Ma First, if there is existing solution for this, I would close this jira. Also I realize we already have various overlapping solutions; creating another solution isn't necessarily the best approach. However, I couldn't find anything that can meet the need. So open this jira for discussion. We have some scenarios in hbase/mapreduce/hdfs that requires logging large number of dynamic metrics. They can be used for troubleshooting, better measurement on the system and scorecard. For example, 1.HBase. Get metrics such as request per sec that are specific to a table, or column family. 2.Mapreduce Job history analysis. Would like to found out all the job ids that are submitted, completed, etc. in a specific time window. For troubleshooting, what people usually do today, 1) Use current machine-level metrics to find out which machine has the issue. 2) go to that machine, analysis the local log. The characteristics of such kind of metrics: 1.It isn't something that can be predefined. The key such as table name, job id is dynamic. 2.The number of such metrics could be much larger than what the current metrics framework can handle. 3.We don't have a scenario that require near real time query support, e.g., from the time the metrics is generated to the time it is available to query can be at like an hour. 4.How data is consumed is highly application specific. Some ideas: 1. Provide some interface for any application to log data. 2. The metrics can be written to log files. The log files or log entries will be loaded to HBase, or HDFS asynchronously. That could go to a separate cluster. 3. To consume such data, application could run map reduce job on the log files for aggregation, or do random read directly from HBase. Comments? -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira