Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7227D18BF3 for ; Wed, 13 Jan 2016 22:58:40 +0000 (UTC) Received: (qmail 82660 invoked by uid 500); 13 Jan 2016 22:58:40 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 82616 invoked by uid 500); 13 Jan 2016 22:58:40 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 82149 invoked by uid 99); 13 Jan 2016 22:58:40 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jan 2016 22:58:40 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id D62EF2C1F64 for ; Wed, 13 Jan 2016 22:58:39 +0000 (UTC) Date: Wed, 13 Jan 2016 22:58:39 +0000 (UTC) From: "Jason Lowe (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-4311) Removing nodes from include and exclude lists will not remove them from decommissioned nodes list MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-4311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097218#comment-15097218 ] Jason Lowe commented on YARN-4311: ---------------------------------- Thanks for updating the patch! I'd like to see the getTimestamp/setTimestamp methods be a bit more specific, like getInactiveTimestamp or getUntrackedTimestamp or something similar. Currently it sounds like a generic timestamp and therefore is unclear what the context is and when it should and should not be used. Similarly I think the property names are a bit too vague. Would like to see "inactive", "untracked" or some other adjective in the name to qualify when the properties apply. Nit: interval and timeout sound similar without qualification, e.g.: there's already expiry-interval properties that are really timeouts. Should interval be check-interval or something to distinguish it? And would it make sense to derive one from the other, similar to what is done for AM/NM expiry interval? (e.g: interval could be a fraction of timeout, potentially with a maximum interval cap of 10 minutes or something.) That's another way to eliminate the potential for confusion or misconfig, but it does make it a little more inflexible. Nit: There should be whitespace between the properties for readability and consistency with the rest of the file. Typically the property name and default are clustered together with gaps between different properties. Why does isUntrackedNode do a conf lookup and assign includesFile? It seems unrelated to what it's doing, and if it is related, why not also excludesFile? This gets called in a loop, potentially over many nodes. conf lookups can be a bit pricey, and we should avoid doing them if they're not needed. Time.monotonicNow should be used for the timestamps rather than System.currentTimeMillis to avoid problems when the system time is adjusted. The call to Timer.purge after Timer.cancel is redundant. There are no tasks on the timer after Timer.cancel is called. > Removing nodes from include and exclude lists will not remove them from decommissioned nodes list > ------------------------------------------------------------------------------------------------- > > Key: YARN-4311 > URL: https://issues.apache.org/jira/browse/YARN-4311 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 2.6.1 > Reporter: Kuhu Shukla > Assignee: Kuhu Shukla > Attachments: YARN-4311-v1.patch, YARN-4311-v2.patch, YARN-4311-v3.patch, YARN-4311-v4.patch, YARN-4311-v5.patch, YARN-4311-v6.patch, YARN-4311-v7.patch > > > In order to fully forget about a node, removing the node from include and exclude list is not sufficient. The RM lists it under Decomm-ed nodes. The tricky part that [~jlowe] pointed out was the case when include lists are not used, in that case we don't want the nodes to fall off if they are not active. -- This message was sent by Atlassian JIRA (v6.3.4#6332)