hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Devaraj K (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-41) The RM should handle the graceful shutdown of the NM.
Date Thu, 14 May 2015 16:02:02 GMT

    [ https://issues.apache.org/jira/browse/YARN-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14543929#comment-14543929
] 

Devaraj K commented on YARN-41:
-------------------------------

|| NM Graceful Shutdown||NM Recovery Enabled\\yarn.nodemanager.recovery.enabled=true||NM is
Running Under Supervision\\yarn.nodemanager.recovery.supervised=true||*NM Unregister*||
|Y		   |N		    | Y		       || *Y*	     ||
|Y		   |N		    | N	 	       || *Y*	     ||
|Y		   |Y		    | Y		       || *N*	     ||
|Y		   |Y		    | N		       || *N*	     ||
|N		   |Y/N		    | Y/N	       || *N*	     ||

\\ 
\\
* If the NM shutdown is graceful and NM recovery is not enabled then NM unregisters with RM.
Here, irrespective of NM is under supervision or not if the NM recovery is not enabled then
NM will unregister with RM.
* If the NM shutdown is graceful and NM recovery is enabled then NM will not unregister with
RM.
* If the NM gets killed abruptly(using kill -9 nm-pid) then NM will not unregister with RM
as the process itself will not be available to notify anything.

As per my understanding I assumed here that NM is under supervision enabled only when the
NM recovery is enabled. Please correct me If I am wrong.


Please give your comments, I will update the patch accordingly if there are no issues on this.

> The RM should handle the graceful shutdown of the NM.
> -----------------------------------------------------
>
>                 Key: YARN-41
>                 URL: https://issues.apache.org/jira/browse/YARN-41
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: nodemanager, resourcemanager
>            Reporter: Ravi Teja Ch N V
>            Assignee: Devaraj K
>              Labels: BB2015-05-TBR
>         Attachments: MAPREDUCE-3494.1.patch, MAPREDUCE-3494.2.patch, MAPREDUCE-3494.patch,
YARN-41-1.patch, YARN-41-2.patch, YARN-41-3.patch, YARN-41-4.patch, YARN-41.patch
>
>
> Instead of waiting for the NM expiry, RM should remove and handle the NM, which is shutdown
gracefully.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message