hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amareshwari Sriramadasu (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-2427) Cleanup of mapred.local.dir after maptask is complete
Date Mon, 26 May 2008 09:45:56 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Amareshwari Sriramadasu updated HADOOP-2427:
--------------------------------------------

    Attachment: patch-2427.txt

Currently the failed and killed tasks are cleanedup as soon as they report as finished. But
successful tasks are cleanedup at the end of the job, so that the map outputs will be available
for the reducers.  But now, since we have <taskid>/work as current working directory
for the task, and <taskid>/output directory for intermediate map output files. So, <taskid>/work
can be cleanedup for successful tasks as soon as the task is finished.

Here is a patch doing cleanup of workdir for successful tasks.

> Cleanup of mapred.local.dir after maptask is complete
> -----------------------------------------------------
>
>                 Key: HADOOP-2427
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2427
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.15.1
>            Reporter: lohit vijayarenu
>             Fix For: 0.18.0
>
>         Attachments: patch-2427.txt
>
>
> I see that after a map task is complete, its working directory (mapred.local.dir)/taskTracker/jobcache/<jobid>/<task_dir>
is not deleted untill the job is complete. If map out files are stored in there, could this
be created in different directory and the working directory cleaned up after map task is complete.
One problem we are seeing is, if a map task creates files temporary files, they get accumulated
and we may run out of disk space thus failing the job. Relying on the user to cleanup all
temp files created is be error prone.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message