Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 5305 invoked from network); 23 May 2008 20:40:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 23 May 2008 20:40:19 -0000 Received: (qmail 90241 invoked by uid 500); 23 May 2008 20:40:19 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 90218 invoked by uid 500); 23 May 2008 20:40:18 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 90207 invoked by uid 99); 23 May 2008 20:40:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 May 2008 13:40:18 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 23 May 2008 20:39:40 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 14EB8234C11B for ; Fri, 23 May 2008 13:39:56 -0700 (PDT) Message-ID: <2013156046.1211575196084.JavaMail.jira@brutus> Date: Fri, 23 May 2008 13:39:56 -0700 (PDT) From: "Ari Rabkin (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Updated: (HADOOP-3441) Pass the size of the MapReduce input to JobInProgress In-Reply-To: <1624353651.1211573635741.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ari Rabkin updated HADOOP-3441: ------------------------------- Comment: was deleted > Pass the size of the MapReduce input to JobInProgress > ----------------------------------------------------- > > Key: HADOOP-3441 > URL: https://issues.apache.org/jira/browse/HADOOP-3441 > Project: Hadoop Core > Issue Type: Improvement > Components: mapred > Affects Versions: 0.17.0 > Environment: all > Reporter: Ari Rabkin > Assignee: Ari Rabkin > Priority: Minor > Fix For: 0.18.0 > > Attachments: addDataSize.patch > > > Currently, there's no easy way for the JobInProgress to know how large the job's input data is. > This patch corrects the problem, by storing the size of the input split's data through the RawSplit. The sizes of each split are then totaled up and made available via JobInProgress.getInputSize(). > This is needed, among other reasons, so that the JobInProgress knows how much data it's being run on, which will help build smarter schedulers. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.