Return-Path: Delivered-To: apmail-lucene-hadoop-user-archive@locus.apache.org Received: (qmail 79933 invoked from network); 12 May 2006 01:14:37 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 12 May 2006 01:14:37 -0000 Received: (qmail 16050 invoked by uid 500); 12 May 2006 01:11:37 -0000 Delivered-To: apmail-lucene-hadoop-user-archive@lucene.apache.org Received: (qmail 14904 invoked by uid 500); 12 May 2006 01:11:25 -0000 Mailing-List: contact hadoop-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-user@lucene.apache.org Delivered-To: mailing list hadoop-user@lucene.apache.org Received: (qmail 13125 invoked by uid 99); 12 May 2006 01:10:58 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May 2006 18:10:58 -0700 X-ASF-Spam-Status: No, hits=1.4 required=10.0 tests=DNS_FROM_RFC_ABUSE,DNS_FROM_RFC_WHOIS X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [216.145.54.173] (HELO mrout3.yahoo.com) (216.145.54.173) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May 2006 17:47:44 -0700 Received: from excon01-bur.Search.Corpsys.P4pnet.net (excon01-bur.search.corpsys.p4pnet.net [172.30.80.71]) by mrout3.yahoo.com (8.13.6/8.13.4/y.out) with ESMTP id k4C0juuC059037 for ; Thu, 11 May 2006 17:45:56 -0700 (PDT) Received: from EXCHG02-BUR.Search.Corpsys.P4pnet.net ([172.30.80.40]) by excon01-bur.Search.Corpsys.P4pnet.net with Microsoft SMTPSVC(5.0.2195.6713); Thu, 11 May 2006 17:45:56 -0700 X-MimeOLE: Produced By Microsoft Exchange V6.0.6603.0 content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Subject: Map Tasks assignment to nodes Date: Thu, 11 May 2006 17:45:56 -0700 Message-ID: <9EAEB8710A398D4D960257830756F5780616902C@exchg02-bur.search.corpsys.p4pnet.net> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Map Tasks assignment to nodes Thread-Index: AcZ1XW8XEiEsKBU+SZ+DLlpoOgTY6w== From: "Vijay Murthi" To: X-OriginalArrivalTime: 12 May 2006 00:45:56.0793 (UTC) FILETIME=[6E33BA90:01C6755D] X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N I am trying to run 8 map tasks with 2 reduce on 3 machines. Each task runs on a 6 MB text file and 500 such files. The monitoring page shows very few number of Map tasks running than intended. Sometimes some nodes doesn't even get any tasks assigned though there are large number of files remaining needs to be scheduled for map operation. Is it due to distributing the files across nodes? In fact, my file system is set to local. Some important parameters are listed below Io.sort.factor=3D100 Io.sort.mb =3D 1000 Io.file.buffer.size =3D 4096000 Io.bytes.checksum=3D128 Mapred.map.tasks=3D16 Mapred.reduce.tasks=3D2 Mapred.tasktracker.tasks.maximum=3D4 Mapred.combine.buffer.size=3D100000 Is there any parameter I am missing to maximize the use of all CPUS?=20 Thanks, VJ