Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 17535 invoked from network); 6 May 2009 16:11:54 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 6 May 2009 16:11:54 -0000 Received: (qmail 59407 invoked by uid 500); 6 May 2009 16:11:53 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 59352 invoked by uid 500); 6 May 2009 16:11:52 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 59342 invoked by uid 99); 6 May 2009 16:11:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 May 2009 16:11:52 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 May 2009 16:11:50 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 8C82B234C48C for ; Wed, 6 May 2009 09:11:30 -0700 (PDT) Message-ID: <1279195948.1241626290574.JavaMail.jira@brutus> Date: Wed, 6 May 2009 09:11:30 -0700 (PDT) From: "Ravi Gummadi (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Updated: (HADOOP-5572) The map progress value should have a separate phase for doing the final sort. In-Reply-To: <966550785.1237992292737.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi Gummadi updated HADOOP-5572: --------------------------------- Attachment: HADOOP-5572.v1.patch Incorporated Jothi's 1st 3 comments. Discussed with Jothi offline regarding comments 4 & 5. For comment 4, there seems to be no cleaner way, so keeping it that way. Regarding comment 5, it seems checking for empty segments(by reading segments) before actual merges seem to be costly in terms of performance. So not handling empty segments separately in our estimation assuming that it wouldn't hurt much in the approximation of mergeProgress. Fixed an issue in informReduceProgress() by changing the call from Progress.get() to Progress.getInternal() because we need progress for this phase/node only(and not for the whole tree). Made Progress.getInternal() public. Attaching patch with the above changes. Please review and provide your comments. > The map progress value should have a separate phase for doing the final sort. > ----------------------------------------------------------------------------- > > Key: HADOOP-5572 > URL: https://issues.apache.org/jira/browse/HADOOP-5572 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Reporter: Owen O'Malley > Assignee: Ravi Gummadi > Attachments: HADOOP-5572.patch, HADOOP-5572.v1.patch > > > Currently, the final spill and sort doesn't record any progress while it runs, leading to the perception that the map is done, but "stuck". -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.