hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sherry Chen (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-2589) TaskTracker not purging userlog directories
Date Tue, 14 Jun 2011 17:47:47 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sherry Chen updated MAPREDUCE-2589:
-----------------------------------

    Attachment: MAPREDUCE-2589.patch

Attached fix will delete leftover user logs which are last modified 7 days ago
and are not for any running jobs when TaskTracker restarts. 
UserLogCleaner still
takes care of normal user logs cleanup.
DEFAULT_USER_LOG_RETAIN_HOURS is 1 day, so I added new
DEFAULT_USER_LOG_RETAIN_MAX_HOURS as 7 days.

I would like to have a separate JIRA to handle cleanup old userlogs based on
userlog disk space water mark since which may deal with job configuration settings
change.


> TaskTracker not purging userlog directories
> -------------------------------------------
>
>                 Key: MAPREDUCE-2589
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2589
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: tasktracker
>    Affects Versions: 0.20.205.0
>         Environment: 0.20.205
>            Reporter: Sherry Chen
>            Assignee: Sherry Chen
>            Priority: Minor
>         Attachments: MAPREDUCE-2589.patch, cleanup_userlogs.py
>
>
> UserLogCleaner is not robust. Leftover userlogs after a restart sometimes have to be
manually
> cleaned. Things can accumulate over a period of time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message