Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 619A22009F4 for ; Thu, 26 May 2016 09:10:15 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 60346160A2C; Thu, 26 May 2016 07:10:15 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id A3F44160A2B for ; Thu, 26 May 2016 09:10:14 +0200 (CEST) Received: (qmail 91595 invoked by uid 500); 26 May 2016 07:10:13 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 91559 invoked by uid 99); 26 May 2016 07:10:13 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 26 May 2016 07:10:13 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id DB7EF2C1F5C for ; Thu, 26 May 2016 07:10:12 +0000 (UTC) Date: Thu, 26 May 2016 07:10:12 +0000 (UTC) From: "Karthik Kambatla (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-5077) Fix FSLeafQueue#getFairShare() for queues with weight 0.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 26 May 2016 07:10:15 -0000 [ https://issues.apache.org/jira/browse/YARN-5077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15301649#comment-15301649 ] Karthik Kambatla commented on YARN-5077: ---------------------------------------- Actually, thinking more about this, I am beginning to doubt our initial approach. Or, it has just been a long day. A queue should get resources assigned even if the fairshare is zero. However, the AM cannot be spawned because the maxAMShare is zero if the fairshare is zero. [~yufeigu] - can you confirm if this is the case? If this is indeed the case, we should probably look at this maxAMShare thing more comprehensively. > Fix FSLeafQueue#getFairShare() for queues with weight 0.0 > --------------------------------------------------------- > > Key: YARN-5077 > URL: https://issues.apache.org/jira/browse/YARN-5077 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Yufei Gu > Assignee: Yufei Gu > Attachments: YARN-5077.001.patch, YARN-5077.002.patch, YARN-5077.003.patch, YARN-5077.004.patch, YARN-5077.005.patch > > > 1) When a queue's weight is set to 0.0, FSLeafQueue#getFairShare() returns > 2) When a queue's weight is nonzero, FSLeafQueue#getFairShare() returns > In case 1), that means no container ever gets allocated for an AM because from the viewpoint of the RM, there is never any headroom to allocate a container on that queue. > For example, we have a pool with the following weights: > - root.dev 0.0 > - root.product 1.0 > The root.dev is a best effort pool and should only get resources if root.product is not running. In our tests, with no jobs running under root.product, jobs started in root.dev queue stay stuck in ACCEPT phase and never start. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: yarn-issues-help@hadoop.apache.org