Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 46077 invoked from network); 27 Jun 2008 08:34:10 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 27 Jun 2008 08:34:10 -0000 Received: (qmail 59499 invoked by uid 500); 27 Jun 2008 08:34:08 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 59434 invoked by uid 500); 27 Jun 2008 08:34:08 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 59411 invoked by uid 99); 27 Jun 2008 08:34:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Jun 2008 01:34:08 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 27 Jun 2008 08:33:26 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 4F565234C153 for ; Fri, 27 Jun 2008 01:33:46 -0700 (PDT) Message-ID: <1975845381.1214555626324.JavaMail.jira@brutus> Date: Fri, 27 Jun 2008 01:33:46 -0700 (PDT) From: "Vivek Ratan (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3651) When assigning tasks to trackers, the job tracker should try to balance the number of tasks among the available trackers In-Reply-To: <891824603.1214518664961.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12608688#action_12608688 ] Vivek Ratan commented on HADOOP-3651: ------------------------------------- In the current JT, the code for determining which task to hand a TT uses the following logic: the JT first figures out the 'remaining load' per TT for maps/reduces (which is the total number of map and reduce tasks that need to be run across all running jobs, divided by the num of TTs). It then figures out how many maximum map or reduce tasks should be run on the TT (which is the minimum of the TT's capacity and the 'remaining load') - call this the 'max load'. Finally, if a TT can run something (ie, if the # of maps/reduces it is running is less than the 'max load'), it looks to give it a map task or a reduce task. As I had mentioned in a mail I sent to core-dev on 5/23, this logic can result in some TTs not getting a task to run, even when there are tasks waiting to be run. It can also result in a skewed distribution of tasks among TTs. Maye something like that is happening here. I don't know if it's possible to see the log files and determine what exactly happened. The new Resource Manager will, I think, result in a better distribution. For one, a TT's request is never rejected if there is a task to run. for another, the load will likely be spread out more evenly. > When assigning tasks to trackers, the job tracker should try to balance the number of tasks among the available trackers > ------------------------------------------------------------------------------------------------------------------------ > > Key: HADOOP-3651 > URL: https://issues.apache.org/jira/browse/HADOOP-3651 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Affects Versions: 0.17.0 > Reporter: Runping Qi > > I encounter a number of situations like this: > A job tracker has 200 task trackers, each with 2 mapper slots and reducer slots. > When a job with 200 or fewer reducers was submitted to the job tracker, > one normally each task tracker will run one reducer. > Unfortunately, it seems that only about 1/3 of trackers have one reducer, and 1/3 trackers don't have reducer, and 1/3 have 2 reducers! -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.