From common-commits-return-91666-archive-asf-public=cust-asf.ponee.io@hadoop.apache.org Sat Dec 15 01:04:42 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 857D7180679 for ; Sat, 15 Dec 2018 01:04:41 +0100 (CET) Received: (qmail 672 invoked by uid 500); 15 Dec 2018 00:04:33 -0000 Mailing-List: contact common-commits-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list common-commits@hadoop.apache.org Received: (qmail 99710 invoked by uid 99); 15 Dec 2018 00:04:33 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 15 Dec 2018 00:04:33 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id AD27CE1467; Sat, 15 Dec 2018 00:04:32 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: jhung@apache.org To: common-commits@hadoop.apache.org Date: Sat, 15 Dec 2018 00:04:47 -0000 Message-Id: <99a6a055bc9b4a9e8944a243307488b1@git.apache.org> In-Reply-To: <192fd66c10fc4a009945c1ce5c1645ab@git.apache.org> References: <192fd66c10fc4a009945c1ce5c1645ab@git.apache.org> X-Mailer: ASF-Git Admin Mailer Subject: [16/50] [abbrv] hadoop git commit: YARN-7502. Nodemanager restart docs should describe nodemanager supervised property. Contributed by Suma Shivaprasad. YARN-7502. Nodemanager restart docs should describe nodemanager supervised property. Contributed by Suma Shivaprasad. (cherry picked from commit e50334513c781a10368e7353bec6674f89eb0a7b) Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo Commit: http://git-wip-us.apache.org/repos/asf/hadoop/commit/0c39681b Tree: http://git-wip-us.apache.org/repos/asf/hadoop/tree/0c39681b Diff: http://git-wip-us.apache.org/repos/asf/hadoop/diff/0c39681b Branch: refs/heads/branch-2-jhung-test Commit: 0c39681b2cc24723171c34676760a86ffefeda07 Parents: 03ff0f1 Author: Sunil G Authored: Mon Oct 22 14:15:51 2018 +0530 Committer: Jonathan Hung Committed: Fri Dec 14 16:03:55 2018 -0800 ---------------------------------------------------------------------- .../hadoop-yarn-site/src/site/markdown/NodeManager.md | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/hadoop/blob/0c39681b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/markdown/NodeManager.md ---------------------------------------------------------------------- diff --git a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/markdown/NodeManager.md b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/markdown/NodeManager.md index 12201b9..18aeff4 100644 --- a/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/markdown/NodeManager.md +++ b/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-site/src/site/markdown/NodeManager.md @@ -77,13 +77,19 @@ Step 2. Configure a path to the local file-system directory where the NodeManag |:---- |:---- | | `yarn.nodemanager.recovery.dir` | The local filesystem directory in which the node manager will store state when recovery is enabled. The default value is set to `$hadoop.tmp.dir/yarn-nm-recovery`. | -Step 3. Configure a valid RPC address for the NodeManager. +Step 3: Enable NM supervision under recovery to prevent running containers from getting cleaned up when NM exits. + +| Property | Description | +|:---- |:---- | +| `yarn.nodemanager.recovery.supervised` | If enabled, NodeManager running will not try to cleanup containers as it exits with the assumption it will be immediately be restarted and recover containers The default value is set to 'false'. | + +Step 4. Configure a valid RPC address for the NodeManager. | Property | Description | |:---- |:---- | | `yarn.nodemanager.address` | Ephemeral ports (port 0, which is default) cannot be used for the NodeManager's RPC server specified via yarn.nodemanager.address as it can make NM use different ports before and after a restart. This will break any previously running clients that were communicating with the NM before restart. Explicitly setting yarn.nodemanager.address to an address with specific port number (for e.g 0.0.0.0:45454) is a precondition for enabling NM restart. | -Step 4. Auxiliary services. +Step 5. Auxiliary services. * NodeManagers in a YARN cluster can be configured to run auxiliary services. For a completely functional NM restart, YARN relies on any auxiliary service configured to also support recovery. This usually includes (1) avoiding usage of ephemeral ports so that previously running clients (in this case, usually containers) are not disrupted after restart and (2) having the auxiliary service itself support recoverability by reloading any previous state when NodeManager restarts and reinitializes the auxiliary service. --------------------------------------------------------------------- To unsubscribe, e-mail: common-commits-unsubscribe@hadoop.apache.org For additional commands, e-mail: common-commits-help@hadoop.apache.org