Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 50226 invoked from network); 26 Jun 2009 13:43:20 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 26 Jun 2009 13:43:20 -0000 Received: (qmail 95410 invoked by uid 500); 26 Jun 2009 13:43:30 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 95340 invoked by uid 500); 26 Jun 2009 13:43:30 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 95223 invoked by uid 99); 26 Jun 2009 13:43:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Jun 2009 13:43:30 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Jun 2009 13:43:27 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 9CB61234C1EB for ; Fri, 26 Jun 2009 06:43:07 -0700 (PDT) Message-ID: <157688291.1246023787641.JavaMail.jira@brutus> Date: Fri, 26 Jun 2009 06:43:07 -0700 (PDT) From: "Steve Loughran (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-6107) Have some log messages designed for machine parsing, either real-time or post-mortem In-Reply-To: <420434294.1245934627597.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-6107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724536#action_12724536 ] Steve Loughran commented on HADOOP-6107: ---------------------------------------- sounds like you need a way of having different plug-ins for monitoring, or a thrift back-end to log4j > Have some log messages designed for machine parsing, either real-time or post-mortem > ------------------------------------------------------------------------------------ > > Key: HADOOP-6107 > URL: https://issues.apache.org/jira/browse/HADOOP-6107 > Project: Hadoop Common > Issue Type: Improvement > Affects Versions: 0.21.0 > Reporter: Steve Loughran > > Many programs take the log output of bits of Hadoop, and try and parse it. Some may also put their own back end behind commons-logging, to capture the input without going via Log4J, so as to keep the output more machine-readable. > These programs need log messages that > # are easy to parse by a regexp or other simple string parse (consider quoting values, etc) > # push out the full exception chain rather than stringify() bits of it > # stay stable across versions > # log the things the tools need to analyse: events, data volumes, errors > For these logging tools, ease of parsing, retention of data and stability over time take the edge over readability. In HADOOP-5073, Jiaqi Tan proposed marking some of the existing log events as evolving towards stability. As someone who regulary patches log messages to improve diagnostics, this creates a conflict of interest. For me, good logs are ones that help people debug their problems without anyone else helping, and if that means improving the text, so be it. Tools like Chukwa have a different need. > What to do? Some options > # Have some messages that are designed purely for other programs to handle > # Have some logs specifically for machines, to which we log alongside the human-centric messages > # Fix many of the common messages, then leave them alone. > # Mark log messages to be left alone (somehow) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.