Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 8418 invoked from network); 22 Nov 2010 16:52:25 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Nov 2010 16:52:25 -0000 Received: (qmail 41080 invoked by uid 500); 22 Nov 2010 16:52:55 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 41039 invoked by uid 500); 22 Nov 2010 16:52:55 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 41031 invoked by uid 99); 22 Nov 2010 16:52:55 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 16:52:55 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of qwertymaniac@gmail.com designates 209.85.160.48 as permitted sender) Received: from [209.85.160.48] (HELO mail-pw0-f48.google.com) (209.85.160.48) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Nov 2010 16:52:49 +0000 Received: by pwj9 with SMTP id 9so1986408pwj.35 for ; Mon, 22 Nov 2010 08:52:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=QrMKIUMO0PYIkUWKABUiURvEvB9qHXrYgN92RdxRmyU=; b=qF68Js2BS0VP88WRJ0QJesnAkPaByQWfLNejN7R23OSW5wMMSTXeffpgaJSDbFR/Cy 85lLBV9HwFHFeaKhrLWrad4YcZ2lbuY2xybpFXr+T5nLS/OkXKyQkkaAzpcwC8gD4jph k0XNqVJP89dR5F7d7rQ7j6G7flNE2SHdGAKeo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=DjgFDP9OLz9REkS5LbJEsVNhCCmvzuAyDNaGIq84/lXTuyswJM7fL+V/B5lXetAxbo 574D1prwrNzP60N8ddz2koPsPUU8iTobZlEavf07OsHr4+0k5RorCqZq3Sgcw/FHb7kK QDt6QwqfSAnLsi9GOVmAx5XGS07Sy3bnOkEsY= Received: by 10.223.110.148 with SMTP id n20mr4362989fap.48.1290444746328; Mon, 22 Nov 2010 08:52:26 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.113.145 with HTTP; Mon, 22 Nov 2010 08:52:06 -0800 (PST) In-Reply-To: <126335.8536.qm@web112111.mail.gq1.yahoo.com> References: <126335.8536.qm@web112111.mail.gq1.yahoo.com> From: Harsh J Date: Mon, 22 Nov 2010 22:22:06 +0530 Message-ID: Subject: Re: Hadoop - how exactly is a slot defined To: general@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hi, On Mon, Nov 22, 2010 at 10:02 PM, Grandl Robert wrote: > Hi all, > > I have troubles in understanding what exactly a slot is. Always we are ta= lking about tasks assigned to slots, but I did not found anywhere what exac= tly a slot is. I assume it represent some allocation of RAM memory as well = as with some computation power. > > However, can somebody explain me what exactly a slot means (in terms of r= esources allocated for a slot) and how this mapping(between slot and physic= al resources) is done in Hadoop ? Or give me some hints about the files in = the Hadoop=A0 where it may should be ? A slot is of two types -- Map slot and Reduce slot. A slot represents an ability to run one of these "Tasks" (map/reduce tasks) individually at a point of time. Therefore, multiple slots on a TaskTracker means multiple "Tasks" may execute in parallel. Right now total slots in a TaskTracker is =3D=3D mapred.tasktracker.map.tasks.maximum for Maps and mapred.tasktracker.reduce.tasks.maximum for Reduces. Hadoop is indeed trying to go towards the dynamic slot concept, which could rely on the current resources available on a system, but work for this is still in conceptual phases. TaskTrackers emit system status (like CPU load, utilization, memory available/user, load averages) in their heartbeats today (and is utilized by certain schedulers, I think Capacity Scheduler uses it to determine stuff), but the concept of slots is still fixed as a maximum to the above two configurations on each TaskTracker. For code on how slots are checked/utilized, see any Scheduler plugin's code -- LimitTasksPerJobTaskScheduler, CapacityTaskScheduler for example. > > Thanks a lot, > Robert > > > --=20 Harsh J www.harshj.com