hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Gummadi (Updated) (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-4087) [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
Date Fri, 30 Mar 2012 11:06:25 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-4087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ravi Gummadi updated MAPREDUCE-4087:
------------------------------------

    Attachment: 4087.trunk.patch

Attaching patch for trunk.
                
> [Gridmix] GenerateDistCacheData job of Gridmix can become slow in some cases
> ----------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4087
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4087
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>         Attachments: 4087.patch, 4087.trunk.patch
>
>
> In map() method of GenerateDistCacheData job of Gridmix, val.setSize() is done every
time based on the bytes to be written to a distributed cache file. When we try to write data
to next distributed cache file in the same map task, the size of random data generated in
each iteration can become small based on the particular case. This can make this dist cache
data generation slow.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message