Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5772718AFF for ; Thu, 29 Oct 2015 17:08:31 +0000 (UTC) Received: (qmail 8425 invoked by uid 500); 29 Oct 2015 17:08:28 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 8384 invoked by uid 500); 29 Oct 2015 17:08:28 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 8271 invoked by uid 99); 29 Oct 2015 17:08:28 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Oct 2015 17:08:28 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id E0DD02C1F7C for ; Thu, 29 Oct 2015 17:08:27 +0000 (UTC) Date: Thu, 29 Oct 2015 17:08:27 +0000 (UTC) From: "Wangda Tan (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-4287) Capacity Scheduler: Rack Locality improvement MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-4287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980816#comment-14980816 ] Wangda Tan commented on YARN-4287: ---------------------------------- I think maybe it's better not deprecate original option, we can support both in the same option. Just like html set element size, you can set either px or percentage of parent's width/height. > Capacity Scheduler: Rack Locality improvement > --------------------------------------------- > > Key: YARN-4287 > URL: https://issues.apache.org/jira/browse/YARN-4287 > Project: Hadoop YARN > Issue Type: Improvement > Components: capacityscheduler > Affects Versions: 2.7.1 > Reporter: Nathan Roberts > Assignee: Nathan Roberts > Attachments: YARN-4287-minimal.patch, YARN-4287-v2.patch, YARN-4287-v3.patch, YARN-4287-v4.patch, YARN-4287.patch > > > YARN-4189 does an excellent job describing the issues with the current delay scheduling algorithms within the capacity scheduler. The design proposal also seems like a good direction. > This jira proposes a simple interim solution to the key issue we've been experiencing on a regular basis: > - rackLocal assignments trickle out due to nodeLocalityDelay. This can have significant impact on things like CombineFileInputFormat which targets very specific nodes in its split calculations. > I'm not sure when YARN-4189 will become reality so I thought a simple interim patch might make sense. The basic idea is simple: > 1) Separate delays for rackLocal, and OffSwitch (today there is only 1) > 2) When we're getting rackLocal assignments, subsequent rackLocal assignments should not be delayed > Patch will be uploaded shortly. No big deal if the consensus is to go straight to YARN-4189. -- This message was sent by Atlassian JIRA (v6.3.4#6332)