Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 33669 invoked from network); 31 Jul 2008 05:18:35 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 31 Jul 2008 05:18:35 -0000 Received: (qmail 85529 invoked by uid 500); 31 Jul 2008 05:18:27 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 85466 invoked by uid 500); 31 Jul 2008 05:18:26 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 85444 invoked by uid 99); 31 Jul 2008 05:18:26 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 Jul 2008 22:18:26 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ankur.goel@corp.aol.com designates 64.236.137.26 as permitted sender) Received: from [64.236.137.26] (HELO r2d2.aoltw.net) (64.236.137.26) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 31 Jul 2008 05:17:32 +0000 Received: from AOLMTCMEH01.ad.office.aol.com (aolmtcmeh01.office.aol.com [10.178.121.20]) by r2d2.aoltw.net (8.10.0/8.10.0) with ESMTP id m6V5Hv706928; Wed, 30 Jul 2008 22:17:57 -0700 (PDT) Received: from EVSBNG02.ad.office.aol.com ([10.146.190.243]) by AOLMTCMEH01.ad.office.aol.com with Microsoft SMTPSVC(6.0.3790.3959); Thu, 31 Jul 2008 01:17:58 -0400 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: RE: How can I control Number of Mappers of a job? Date: Thu, 31 Jul 2008 10:47:53 +0530 Message-ID: <8F11722A0562BB4F80A680ED4CFED0D5049B393D@EVSBNG02.ad.office.aol.com> In-Reply-To: <467215.1317.qm@web59713.mail.ac4.yahoo.com> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: How can I control Number of Mappers of a job? Thread-Index: AcjymSd2cysMKGvtRNOtsRjJPd0IJQAMrPJw From: "Goel, Ankur" To: Cc: X-OriginalArrivalTime: 31 Jul 2008 05:17:58.0131 (UTC) FILETIME=[CB7D8430:01C8F2CC] X-Virus-Checked: Checked by ClamAV on apache.org How big is your cluster? Assuming you are running a single node cluster, Hadoop-default.xml has a parameter 'mapred.map.tasks' that is set to 2. So By default, no matter how many map tasks are calculated by framework, only 2 map task will execute on a single node cluster. -----Original Message----- From: Gopal Gandhi [mailto:gopal.gandhi2008@yahoo.com]=20 Sent: Thursday, July 31, 2008 4:38 AM To: core-user@hadoop.apache.org Cc: core-dev@hadoop.apache.org Subject: How can I control Number of Mappers of a job? The motivation is to control the max # of mappers of a job. For example, the input data is 246MB, divided by 64M is 4. If by default there will be 4 mappers launched on the 4 blocks.=20 What I want is to set its max # of mappers as 2, so that 2 mappers are launched first and when they completes on the first 2 blocks, another 2 mappers start on the rest 2 blocks. Does Hadoop provide a way? =20