Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2B5989276 for ; Mon, 12 Dec 2011 05:04:00 +0000 (UTC) Received: (qmail 9644 invoked by uid 500); 12 Dec 2011 05:03:58 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 8866 invoked by uid 500); 12 Dec 2011 05:03:52 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 8854 invoked by uid 99); 12 Dec 2011 05:03:51 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Dec 2011 05:03:51 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of harsh@cloudera.com designates 209.85.210.48 as permitted sender) Received: from [209.85.210.48] (HELO mail-pz0-f48.google.com) (209.85.210.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Dec 2011 05:03:44 +0000 Received: by dadi14 with SMTP id i14so6644072dad.35 for ; Sun, 11 Dec 2011 21:03:24 -0800 (PST) Received: by 10.68.72.6 with SMTP id z6mr18923936pbu.73.1323666204199; Sun, 11 Dec 2011 21:03:24 -0800 (PST) MIME-Version: 1.0 Received: by 10.142.254.9 with HTTP; Sun, 11 Dec 2011 21:03:02 -0800 (PST) In-Reply-To: <2011121212530212660634@163.com> References: <2011121211184481597714@163.com> <8EB66E99-D6E9-4B11-86E0-ADCED15198F3@cloudera.com> <2011121212530212660634@163.com> From: Harsh J Date: Mon, 12 Dec 2011 10:33:02 +0530 Message-ID: Subject: Re: Re: About slots of tasktracker and munber of map taskers To: mapreduce-user@hadoop.apache.org, tanjun_2525 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Tan, As an admin, I can even choose to set configuration to even 100 slots on a 4-core node, if I feel like burning the box. There is no hardware auto-detection, and the slot limit is entirely controlled by the mapred-site.xml for that TaskTracker. The book merely tries to tell that you need to set these maximum slot settings based on your hardware knowledge on each node -- TaskTrackers do nothing of that sort on their own. There is some CPU/Memory considerations taken into account by a variety of non-default Schedulers in JobTracker, but your slot limits per tasktracker is entirely controlled by configuration. 2011/12/12 Tan Jun : > Hi Harsh, > Now I know the number of maps and reduces run simultaneously is set by th= e > administrator in mapred-site.xml with default value 2. > But I cant get the point about number of slots. > For my understanding by now, > the number of=A0 slots is decides by hardware that administrator cannot > change. > Is that wright? > > ________________________________ > Tan Jun > > From:=A0Harsh J > Date:=A02011-12-12=A012:22 > To:=A0mapreduce-user > Subject:=A0Re: About slots of tasktracker and munber of map taskers > Hi Tan, > > On 12-Dec-2011, at 8:48 AM, Tan Jun wrote: > > Hi, > I dont really understand the meaning of the sentences in "The Definitive > Guide"(page 155): > > Tasktrackers=A0have=A0a=A0fixed=A0number=A0of=A0slots=A0for=A0map=A0tasks= =A0and=A0for=A0reduce=A0tasks:=A0for=A0example, > a=A0tasktracker=A0may=A0be=A0able=A0to=A0run=A0two=A0map=A0tasks=A0and=A0= two=A0reduce=A0tasks=A0simultaneously. > (The=A0precise=A0number=A0depends=A0on=A0the=A0number=A0of=A0cores=A0and= =A0the=A0amount=A0of > memory=A0on=A0the=A0tasktracker;=A0see=A0=93Memory=94=A0on=A0page=A0254.) > > Does that mean the=A0number of slots is fixed and the number of maps run > simultaneously is set by user? > > > Not by the user, but by the administrator. Each tasktracker is configured= in > production with a 'task slot' upper limit - say, 8 maps and 4 reducers fo= r a > 12-core machine. This is not auto-configured (unless you use auto cluster > setup+configuration tools that determine it for you [0]), and has to be s= et > when configuring Hadoop daemons. > > The book means to imply that you need to set these, based on the memory a= nd > CPU configuration of your machines. By default, tasktrackers have limits = of > 2+2. > > See=A0http://wiki.apache.org/hadoop/LimitingTaskSlotUsage > > [0] -=A0http://www.cloudera.com/products-services/tools/=A0is one. --=20 Harsh J