Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8F07E11836 for ; Mon, 13 May 2013 21:45:48 +0000 (UTC) Received: (qmail 19179 invoked by uid 500); 13 May 2013 18:52:23 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 19055 invoked by uid 500); 13 May 2013 18:52:23 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 19046 invoked by uid 99); 13 May 2013 18:52:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 May 2013 18:52:23 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of harsh@cloudera.com designates 209.85.210.171 as permitted sender) Received: from [209.85.210.171] (HELO mail-ia0-f171.google.com) (209.85.210.171) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 13 May 2013 18:52:17 +0000 Received: by mail-ia0-f171.google.com with SMTP id r13so7973531iar.2 for ; Mon, 13 May 2013 11:51:56 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type:x-gm-message-state; bh=ndmtfEibPXvBSDK/GG+qghRhRHUjoA8iS8Zz0oyF+m8=; b=bQMGYO9DWtQx2pxREacdhUvMl4OiP7EpG63+dzWeMDm4k37lpEqG1rmF6uIxCpXAYJ xgysHfnpkNsUN32QnxMf8V0soJYX9jBZDGwmX1wv1Wp3qCHG38/ytf2VPj6FEJS5erHi DGlUf+qX2qDrOswy1PDoT6kz1l2VTa9NlnOGjA20Inqf8fyNEXLts9tcQ3tOP4rbAL/b 00lmapY7VAHut0BnjvQBQjM14WJluO3ZV8KOUH0671lotQCEze7GU56MtPLxm7+mtQ1N YyNa7/l8u5pC2AXXrxEGO41WvV0bICdLLTMfE04IJS2jbhZDEt/+rNUV/fMZKI3KbiBb eNVw== X-Received: by 10.42.53.129 with SMTP id n1mr15054980icg.28.1368471116371; Mon, 13 May 2013 11:51:56 -0700 (PDT) MIME-Version: 1.0 Received: by 10.50.95.34 with HTTP; Mon, 13 May 2013 11:51:36 -0700 (PDT) In-Reply-To: References: From: Harsh J Date: Tue, 14 May 2013 00:21:36 +0530 Message-ID: Subject: Re: Hadoop schedulers! To: "" Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQkkgIt3/fnjNLJq4EOUZotozYaXMBse6LEBLo7wzLfjNQ9nr2vy7XnqLTIdo9tD4aqY/UYs X-Virus-Checked: Checked by ClamAV on apache.org Hi, On Sat, May 11, 2013 at 8:31 PM, Rahul Bhattacharjee wrote: > Hi, > > I was going through the job schedulers of Hadoop and could not see any major > operational difference between the capacity scheduler and the fair share > scheduler apart from the fact that fair share scheduler supports preemption > and capacity scheduler doesn't. I'd suggest reading design of both schedulers. The preemption feature is not the only difference - there is also differences in how the queues behave and how the tasks from various lined jobs are picked for scheduling (i.e. the base algorithm). > Another thing is the former creates logical pools based on certain attribute > like username , user group etc and the later has a notion of job queues. Can > someone point me to any other major differences between these two types of > schedulers. Note that FairScheduler can also reuse the queues concept if you point the pool name property at the queue name property config. > Another question in this regard is the capacity scheduler uses a FIFO > queue.So its still possible that a high priority long running job using all > the capacity allocated to the queue to block all the other jobs after it in > the queue.I think this is the expected behavior , but wanted to confirm. I think this is the case, yes, if all the capacity has been soaked up currently. However, the CS doesn't wait on job completions to schedule next jobs if slots are free (like say, in the last wave). > Thanks, > Rahul > > -- Harsh J