Return-Path: Delivered-To: apmail-hadoop-common-issues-archive@minotaur.apache.org Received: (qmail 63767 invoked from network); 13 Jul 2010 22:44:45 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 13 Jul 2010 22:44:45 -0000 Received: (qmail 81519 invoked by uid 500); 13 Jul 2010 22:44:45 -0000 Delivered-To: apmail-hadoop-common-issues-archive@hadoop.apache.org Received: (qmail 81476 invoked by uid 500); 13 Jul 2010 22:44:45 -0000 Mailing-List: contact common-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-issues@hadoop.apache.org Delivered-To: mailing list common-issues@hadoop.apache.org Received: (qmail 81468 invoked by uid 99); 13 Jul 2010 22:44:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Jul 2010 22:44:45 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 13 Jul 2010 22:44:42 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o6DMaoxu026829 for ; Tue, 13 Jul 2010 22:36:50 GMT Message-ID: <9668808.361641279060610494.JavaMail.jira@thor> Date: Tue, 13 Jul 2010 18:36:50 -0400 (EDT) From: "Allen Wittenauer (JIRA)" To: common-issues@hadoop.apache.org Subject: [jira] Commented: (HADOOP-6858) Enable rotateable JVM garbage collection logs for Hadoop daemons In-Reply-To: <10952334.359241279055269951.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-6858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888061#action_12888061 ] Allen Wittenauer commented on HADOOP-6858: ------------------------------------------ We just put a date on our logs. i.e., gc.log.`date +blahblah`. > Enable rotateable JVM garbage collection logs for Hadoop daemons > ---------------------------------------------------------------- > > Key: HADOOP-6858 > URL: https://issues.apache.org/jira/browse/HADOOP-6858 > Project: Hadoop Common > Issue Type: New Feature > Components: scripts > Affects Versions: 0.22.0 > Reporter: Andrew Ryan > Attachments: HADOOP-6858.patch > > > The purpose of this enhancement is to make it easier to collect garbage collection logs and insure that they persist across restarts in the same way that the standard output files of Hadoop daemon JVM's currently does. > Garbage collection logs are a vital debugging tool for administrators and developers. In our production environments, at some point or another, every single type of Hadoop daemon has OOM'ed or experienced other significant issues related to GC and/or lack of heap memory. For the longest time, we have put in garbage collection logs in our HADOOP_NAMENODE_OPTS, HADOOP_JOBTRACKER_OPTS, etc. by using options like "-XX:+PrintGCDateStamps -XX:+PrintGCDetails -Xloggc:$HADOOP_LOG_DIR/jobtracker.gc.log". > Unfortunately, these logs don't survive a restart of the node, so if a node OOM's and then is restarted automatically, or manually by someone who is unaware, we lose the GC logs forever. We also have to manually add GC log options to each daemon. This patch: > 1) Creates a single, optional, off by default, parameter for specifying GC logging. > 2) If that parameter is set, automatically enables GC logging for all daemons in the cluster. The parameter is flexible enough to allow for the different ways various vendor's JVM's require garbage collection logging to be specified. > 3) If GC logging is on, insures that the GC log files for each daemon are rotated with up to 5 copies kept, same as the .out files currently. > We are currently running a variation of this patch in our 0.20 install. This patch actually includes changes to common, mapred, and hdfs, so it obviously cannot be applied as-is, but is included here for review and comments. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.