Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 12759 invoked from network); 18 Dec 2009 03:26:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 18 Dec 2009 03:26:41 -0000 Received: (qmail 13511 invoked by uid 500); 18 Dec 2009 03:26:41 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 13372 invoked by uid 500); 18 Dec 2009 03:26:41 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 13353 invoked by uid 99); 18 Dec 2009 03:26:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Dec 2009 03:26:40 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Dec 2009 03:26:39 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 5B0FB234C4CC for ; Thu, 17 Dec 2009 19:26:18 -0800 (PST) Message-ID: <514759599.1261106778371.JavaMail.jira@brutus> Date: Fri, 18 Dec 2009 03:26:18 +0000 (UTC) From: "Aaron Kimball (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Updated: (MAPREDUCE-1059) distcp can generate uneven map task assignments In-Reply-To: <848137403.1254786451469.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/MAPREDUCE-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Kimball updated MAPREDUCE-1059: ------------------------------------- Status: Patch Available (was: Open) > distcp can generate uneven map task assignments > ----------------------------------------------- > > Key: MAPREDUCE-1059 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1059 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: distcp > Reporter: Aaron Kimball > Assignee: Aaron Kimball > Attachments: MAPREDUCE-1059.2.patch, MAPREDUCE-1059.3.patch, MAPREDUCE-1059.patch > > > distcp writes out a SequenceFile containing the source files to transfer, and their sizes. Map tasks are created over spans of this file, representing files which each mapper should transfer. In practice, some transfer loads yield many empty map tasks and a few tasks perform the bulk of the work. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.