hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amar Kamat (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-3008) [Gridmix] Improve cumulative CPU usage emulation for short running tasks
Date Wed, 14 Sep 2011 13:13:09 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-3008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Amar Kamat updated MAPREDUCE-3008:

    Attachment: mapreduce-2591-v1.4.2.patch

Attaching a patch that improves CPU emulation for short running tasks. Areas of improvements:
1. Sorter/Comparator now is CPU emulation aware
2. For tasks with no spills/merges, aggressive CPU emulation is done.

tets-patch and JUnit tests for Gridmix passed.

> [Gridmix] Improve cumulative CPU usage emulation for short running tasks
> ------------------------------------------------------------------------
>                 Key: MAPREDUCE-3008
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3008
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>          Components: contrib/gridmix
>    Affects Versions: 0.24.0
>            Reporter: Amar Kamat
>              Labels: cpu-emulation, gridmix
>             Fix For: 0.24.0
>         Attachments: mapreduce-2591-v1.4.2.patch
> CPU emulation in Gridmix fails to meet the expected target if the map has no data to
sort/spill/merge. There are 2 major reasons for this:
> 1. The map task end immediately ends soon after the map task. The map progress is 67%
while the map phase ends. 
> 2. Currently, the sort (comparator) doesnt emulate CPU. If the map is short lived, the
CPU emulation thread (spawned from the map task in cleanup) doesn't get a chance to emulate.

This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message