hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Todd Lipcon (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-3951) Tasks are not evenly spread throughout cluster in MR2
Date Thu, 01 Mar 2012 01:27:57 GMT
Tasks are not evenly spread throughout cluster in MR2

                 Key: MAPREDUCE-3951
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3951
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: scheduler
    Affects Versions: 0.23.0, 0.24.0
            Reporter: Todd Lipcon

In MR1 (at least with the fair and fifo schedulers), if you submit a job that needs fewer
resources than the cluster can provide, the tasks are spread relatively evenly across the
node. For example, submitting a 100-map job to a 50-node cluster, each with 10 slots, results
in 2 tasks on each machine. In MR2, however, the tasks would pile up on the first 10 nodes
of the cluster, leaving the other nodes unused. This is highly suboptimal for many use cases.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message