hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Christian Kunz (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3841) merge phase runs out of disk space
Date Wed, 01 Oct 2008 01:07:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12635922#action_12635922

Christian Kunz commented on HADOOP-3841:

To get beyond this bottleneck, for such reduces requiring a lot disk space for merging, we
deleted all map outputs on such nodes, getting back a lot of space. On one of these nodes
with about 280GB reduce input we observed that one of the merged files was 75GB (a single

> merge phase runs out of disk space
> ----------------------------------
>                 Key: HADOOP-3841
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3841
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.17.2
>            Reporter: Christian Kunz
> We observe that reduce tasks run out of disk space during merging (after fetching all
map output) although there would be enough space if the framework did not try to generate
too large merge files.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message