Return-Path: Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: (qmail 6435 invoked from network); 11 Sep 2010 01:22:13 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 11 Sep 2010 01:22:13 -0000 Received: (qmail 55786 invoked by uid 500); 11 Sep 2010 01:22:12 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 55704 invoked by uid 500); 11 Sep 2010 01:22:12 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 55688 invoked by uid 99); 11 Sep 2010 01:22:12 -0000 Received: from Unknown (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Sep 2010 01:22:12 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Sep 2010 01:21:54 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o8B1LX8e014966 for ; Sat, 11 Sep 2010 01:21:33 GMT Message-ID: <19537330.129781284168093150.JavaMail.jira@thor> Date: Fri, 10 Sep 2010 21:21:33 -0400 (EDT) From: "Joydeep Sen Sarma (JIRA)" To: mapreduce-dev@hadoop.apache.org Subject: [jira] Created: (MAPREDUCE-2062) speculative execution is too aggressive under certain conditions MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org speculative execution is too aggressive under certain conditions ---------------------------------------------------------------- Key: MAPREDUCE-2062 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2062 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Reporter: Joydeep Sen Sarma The function canBeSpeculated has subtle bugs that cause too much speculation in certain cases. - it compares the current progress of the task with the last observed mean of all the tasks. if only one task is in question - then the progress rate decays as time progresses (in the absence of updates) and std-dev is zero. So a job with a single reducer or mapper is almost always speculated. - is only a single task has reported progress - then the stddev is zero. so other tasks may be speculated aggressively. - several tasks take a while to report progress initially. they seem to get speculated as soon as speculative-lag is over. the lag should be configurable at the minimum. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.