hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karthik Kambatla (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-5968) Work directory is not deleted when downloadCacheObject throws IOException
Date Tue, 05 Aug 2014 06:36:12 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Karthik Kambatla updated MAPREDUCE-5968:

       Resolution: Fixed
    Fix Version/s: 1.3.0
     Hadoop Flags: Reviewed
           Status: Resolved  (was: Patch Available)

Thanks for the contribution, [~zxu]. Just committed this to branch-1. 

> Work directory is not deleted when downloadCacheObject throws IOException
> -------------------------------------------------------------------------
>                 Key: MAPREDUCE-5968
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5968
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv1
>    Affects Versions: 1.2.1
>            Reporter: zhihai xu
>            Assignee: zhihai xu
>             Fix For: 1.3.0
>         Attachments: MAPREDUCE-5968.branch1.patch, MAPREDUCE-5968.branch1_new.patch,
> Work directory is not deleted in  DistCache if Exception happen in downloadCacheObject.
In downloadCacheObject, the cache file will be copied to temporarily work directory first,
then the  work directory will be renamed to the final directory. If IOException happens during
the copy, the  work directory will not be deleted. This will cause garbage data left in local
disk cache. For example If the MR application use Distributed Cache to send a very large Archive/file(50G),
if the disk is full during the copy, then the IOException will be triggered, the work directory
will be not deleted or renamed and the work directory will occupy a big chunk of disk space.

This message was sent by Atlassian JIRA

View raw message