Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D0A8110E27 for ; Wed, 11 Dec 2013 16:22:12 +0000 (UTC) Received: (qmail 69326 invoked by uid 500); 11 Dec 2013 16:22:06 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 69135 invoked by uid 500); 11 Dec 2013 16:22:03 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 69101 invoked by uid 99); 11 Dec 2013 16:22:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Dec 2013 16:22:02 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mirko.kaempf@gmail.com designates 209.85.160.42 as permitted sender) Received: from [209.85.160.42] (HELO mail-pb0-f42.google.com) (209.85.160.42) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Dec 2013 16:21:56 +0000 Received: by mail-pb0-f42.google.com with SMTP id uo5so10327417pbc.29 for ; Wed, 11 Dec 2013 08:21:35 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=dJrBUhTulB1Zqay//2mMyJdlCGJq/SrY73gp8oJWOn4=; b=hDSJtFDdVf7o7C5h2oTJOD2hcpGgYQrKHDI7iuse9WTUaDAuDniS84FeAwIsWcbKj1 q75KSLEioOdZASG9xD1RFcxDgXdaMIC1iGUuRMBKbCD6xKT/V4srO/VKylT9+fystFwB bp7qoGYx4YXOaJSqQNtwpYspkxMoYc75Wf0YM56653PeFZ3zNFr1XQQ12UxASlCqF8UM 7OcUqaZSRGfv1wMBqAydvF8Lsli8T3zV0A5ax1flSn2+XAAyiLDVDZS4wCJics4hoEsB 0zXuGHoEq96MJPRNgS02KFtFuiZDYRIRbxFOeiHbU54BNl1jUYBXyoWx6mdY3XBxB+ez INTQ== X-Received: by 10.68.231.105 with SMTP id tf9mr2810059pbc.4.1386778895197; Wed, 11 Dec 2013 08:21:35 -0800 (PST) MIME-Version: 1.0 Received: by 10.66.197.195 with HTTP; Wed, 11 Dec 2013 08:21:15 -0800 (PST) In-Reply-To: References: From: =?ISO-8859-1?Q?Mirko_K=E4mpf?= Date: Wed, 11 Dec 2013 16:21:15 +0000 Message-ID: Subject: Re: Why is Hadoop always running just 4 tasks? To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=047d7b33cac86e2d1604ed449fff X-Virus-Checked: Checked by ClamAV on apache.org --047d7b33cac86e2d1604ed449fff Content-Type: text/plain; charset=ISO-8859-1 Hi, what is the command you execute to submit the job? Please share also the driver code .... So we can troubleshoot better. Best wishes Mirko 2013/12/11 Dror, Ittay > I have a cluster of 4 machines with 24 cores and 7 disks each. > > On each node I copied from local a file of 500G. So I have 4 files in hdfs > with many blocks. My replication factor is 1. > > I run a job (a scalding flow) and while there are 96 reducers pending, > there are only 4 active map tasks. > > What am I doing wrong? Below is the configuration > > Thanks, > Ittay > > > > mapred.job.tracker > master:54311 > > > > mapred.map.tasks > 96 > > > > mapred.reduce.tasks > 96 > > > > mapred.local.dir > > /hdfs/0/mapred/local,/hdfs/1/mapred/local,/hdfs/2/mapred/local,/hdfs/3/mapred/local,/hdfs/4/mapred/local,/hdfs/5/mapred/local,/hdfs/6/mapred/local,/hdfs/7/mapred/local > > > > mapred.tasktracker.map.tasks.maximum > 24 > > > > mapred.tasktracker.reduce.tasks.maximum > 24 > > > --047d7b33cac86e2d1604ed449fff Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi,
=A0
what is the command you e= xecute to submit the job?
Please share also the driver code ....
=A0
So we can troubleshoot better.
=A0
Bes= t wishes
Mirko
=A0
=A0


2013/12/11 Dror, Ittay <idror@akamai.c= om>
I have a cluster of 4 machines w= ith 24 cores and 7 disks each.

On each node I copied from local a file of 500G. So I h= ave 4 files in hdfs with many blocks. My replication factor is 1.

I run a job (a scalding flow) and while there are 96 reduce= rs pending, there are only 4 active map tasks.=A0

What am I doing wrong? Below is the configuration
=

Thanks,
Ittay

&= lt;configuration>
<property>
<name>mapred.job.t= racker</name>
= =A0 <value>master:54311&= lt;/value>
</property>

<property= >
=A0<name&g= t;mapred.map.tasks</name>
=A0<value>96</= value>
</prope= rty>

<property>
=A0 <name>mapred.reduce.tasks</name>
<= div> =A0 <value>96</value>
</property>

<property= >
<name>ma= pred.local.dir</name>
<value>/hdfs/0/map= red/local,/hdfs/1/mapred/local,/hdfs/2/mapred/local,/hdfs/3/mapred/local,/h= dfs/4/mapred/local,/hdfs/5/mapred/local,/hdfs/6/mapred/local,/hdfs/7/mapred= /local</value>
</property>

<property= >
<name>ma= pred.tasktracker.map.tasks.maximum</name>
<value>24</valu= e>
</property&= gt;

&= lt;property>
=A0 =A0 <name>mapr= ed.tasktracker.reduce.tasks.maximum</name>
=A0 =A0 <value>24</value>
</property>
</configuration>

--047d7b33cac86e2d1604ed449fff--