Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 52361 invoked from network); 8 Jun 2007 11:19:51 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 8 Jun 2007 11:19:51 -0000 Received: (qmail 82870 invoked by uid 500); 8 Jun 2007 11:19:54 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 82579 invoked by uid 500); 8 Jun 2007 11:19:53 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 82570 invoked by uid 99); 8 Jun 2007 11:19:53 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Jun 2007 04:19:53 -0700 X-ASF-Spam-Status: No, hits=-100.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Jun 2007 04:19:49 -0700 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 1F6297141E0 for ; Fri, 8 Jun 2007 04:19:29 -0700 (PDT) Message-ID: <3595230.1181301569076.JavaMail.jira@brutus> Date: Fri, 8 Jun 2007 04:19:29 -0700 (PDT) From: "Vivek Ratan (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Assigned: (HADOOP-1201) Progress reporting can be improved for both Map/Reduce tasks In-Reply-To: <3283582.1175665832240.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-1201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vivek Ratan reassigned HADOOP-1201: ----------------------------------- Assignee: Vivek Ratan > Progress reporting can be improved for both Map/Reduce tasks > ------------------------------------------------------------ > > Key: HADOOP-1201 > URL: https://issues.apache.org/jira/browse/HADOOP-1201 > Project: Hadoop > Issue Type: Improvement > Components: mapred > Reporter: Devaraj Das > Assignee: Vivek Ratan > > Both the map and reduce tasks do progress reporting in separate threads. However, in the ReduceTask, after the sort phase, the progress reporting happens inline with the reducer invocations. This slows down the Reduce phase since RPC is involved for every progress report. The better thing to do would be to do progress reporting for all phases in separate threads and have the tasks just update the progress fields. > One proposal is to extract out the reporting stuff that is there in MapTask/ReduceTask and put it in the Task superclass as a new class, and have methods in the new class that control what/when progress is reported. Thoughts? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.