Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2D7F06C3D for ; Mon, 13 Jun 2011 20:23:13 +0000 (UTC) Received: (qmail 65962 invoked by uid 500); 13 Jun 2011 20:23:13 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 65937 invoked by uid 500); 13 Jun 2011 20:23:13 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 65929 invoked by uid 99); 13 Jun 2011 20:23:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Jun 2011 20:23:13 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 Jun 2011 20:23:11 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 9DBBC418C41 for ; Mon, 13 Jun 2011 20:22:51 +0000 (UTC) Date: Mon, 13 Jun 2011 20:22:51 +0000 (UTC) From: "Travis Crawford (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: <1533906232.3739.1307996571642.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <1441496444.3456.1307991051510.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] [Updated] (MAPREDUCE-2589) TaskTracker not purging userlog directories MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Travis Crawford updated MAPREDUCE-2589: --------------------------------------- Attachment: cleanup_userlogs.py We see this on our clusters too. Attached is a script that I run from cron to cleanup old userlogs. The general idea is setting some high water mark for userlog disk space, and when passed, delete logs until passing some low water mark. Logs for running jobs are excluded from cleanup, which has infrequently caused issues but in general are worth excluding. Posting as an example of what the replacement might look like (as an internal periodic task, of course). Also, not sure how the nextgen stuff deals with cleanup. > TaskTracker not purging userlog directories > ------------------------------------------- > > Key: MAPREDUCE-2589 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2589 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: tasktracker > Affects Versions: 0.20.205.0 > Environment: 0.20.205 > Reporter: Sherry Chen > Assignee: Sherry Chen > Priority: Minor > Attachments: cleanup_userlogs.py > > > UserLogCleaner is not robust. Leftover userlogs after a restart sometimes have to be manually > cleaned. Things can accumulate over a period of time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira