Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 40022 invoked from network); 5 Apr 2008 20:06:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 5 Apr 2008 20:06:02 -0000 Received: (qmail 91719 invoked by uid 500); 5 Apr 2008 20:06:01 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 91686 invoked by uid 500); 5 Apr 2008 20:06:01 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 91677 invoked by uid 99); 5 Apr 2008 20:06:01 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Apr 2008 13:06:01 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Apr 2008 20:05:27 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id EC2D8234C0BE for ; Sat, 5 Apr 2008 13:03:24 -0700 (PDT) Message-ID: <1661817632.1207425804966.JavaMail.jira@brutus> Date: Sat, 5 Apr 2008 13:03:24 -0700 (PDT) From: "Runping Qi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3130) Shuffling takes too long to get the last map output. In-Reply-To: <988740632.1206751104225.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12586047#action_12586047 ] Runping Qi commented on HADOOP-3130: ------------------------------------ actually, I think the getInputStream method has logic error. You should update timeout when catching exception, not the other way around. The easist way to implement the logic is to measure the elapse time difference when you catch the exception. If the elapse time is bigger than the given timeout, then throw the exception. > Shuffling takes too long to get the last map output. > ---------------------------------------------------- > > Key: HADOOP-3130 > URL: https://issues.apache.org/jira/browse/HADOOP-3130 > Project: Hadoop Core > Issue Type: Bug > Reporter: Runping Qi > Assignee: Amar Kamat > Attachments: HADOOP-3130-v2.patch, HADOOP-3130-v2.patch, HADOOP-3130.patch, shuffling.log > > > I noticed that towards the end of shufflling, the map output fetcher of the reducer backs off too aggressively. > I attach a fraction of one reduce log of my job. > Noticed that the last map output was not fetched in 2 minutes. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.