hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhijie Shen (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1430) InvalidStateTransition exceptions are ignored in state machines
Date Thu, 21 Nov 2013 16:51:35 GMT

    [ https://issues.apache.org/jira/browse/YARN-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13829087#comment-13829087
] 

Zhijie Shen commented on YARN-1430:
-----------------------------------

Personally, I think ignoring InvalidStateTransition is not bad idea, though it may hide some
existing bugs from us. However, it particularly results in isolation, preventing RM/NM from
be crashed by a single race condition. To be more accurate, if InvalidStateTransition happens
to one application, this application will not be completed, crash or anything, while other
applications that are running simultaneously should not be affected.

> InvalidStateTransition exceptions are ignored in state machines
> ---------------------------------------------------------------
>
>                 Key: YARN-1430
>                 URL: https://issues.apache.org/jira/browse/YARN-1430
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Omkar Vinit Joshi
>            Assignee: Omkar Vinit Joshi
>
> We have all state machines ignoring InvalidStateTransitions. These exceptions will get
logged but will not crash the RM / NM. We definitely should crash it as they move the system
into some invalid / unacceptable state.
> * Places where we hide this exception :-
> ** JobImpl
> ** TaskAttemptImpl
> ** TaskImpl
> ** NMClientAsyncImpl
> ** ApplicationImpl
> ** ContainerImpl
> ** LocalizedResource
> ** RMAppAttemptImpl
> ** RMAppImpl
> ** RMContainerImpl
> ** RMNodeImpl
> thoughts?



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message