hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bikas Saha (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-230) Make changes for RM restart phase 1
Date Thu, 13 Dec 2012 18:10:14 GMT

    [ https://issues.apache.org/jira/browse/YARN-230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13531255#comment-13531255
] 

Bikas Saha commented on YARN-230:
---------------------------------

I think the NullRMStateStore is a coding convenience as opposed to a user visible artifact.
Having the recovery.enabled config is a clearer way of communicating that state storage is
really on. Also, following your comments in YARN-231, enabling state store should enable a
real valid store (eg. using the FileSystemStore using the default FileSystem instead of a
NullStore). Doing this by modifying the store class impl config is not that cognitive IMO.
So I am in favor of leaving the config as is. 
I had explicitly decided to return null in NullStore.loadState because I dont want the RM
to recover from an empty state and start responding to clients in case the NullStore has been
inadvertently made the real store impl during actual recovery.
                
> Make changes for RM restart phase 1
> -----------------------------------
>
>                 Key: YARN-230
>                 URL: https://issues.apache.org/jira/browse/YARN-230
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Bikas Saha
>            Assignee: Bikas Saha
>         Attachments: PB-impl.patch, Recovery.patch, Store.patch, Test.patch, YARN-230.1.patch,
YARN-230.4.patch, YARN-230.5.patch
>
>
> As described in YARN-128, phase 1 of RM restart puts in place mechanisms to save application
state and read them back after restart. Upon restart, the NM's are asked to reboot and the
previously running AM's are restarted.
> After this is done, RM HA and work preserving restart can continue in parallel. For more
details please refer to the design document in YARN-128

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message