hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Thomas Graves (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
Date Tue, 26 Mar 2013 18:01:22 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Thomas Graves updated MAPREDUCE-4087:
-------------------------------------

    Fix Version/s: 2.0.5-beta

I merged this to branch-2
                
> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> ----------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4087
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>             Fix For: 1.1.0, 3.0.0, 2.0.5-beta
>
>         Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is done every
time based on the bytes to be written to a distributed cache file. When we try to write data
to next distributed cache file in the same map task, the size of random data generated in
each iteration can become small based on the particular case. This can make this dist cache
data generation slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message