hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jian He (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-2721) Race condition: ZKRMStateStore retry logic may throw NodeExist exception
Date Tue, 21 Oct 2014 20:47:34 GMT

    [ https://issues.apache.org/jira/browse/YARN-2721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14179067#comment-14179067
] 

Jian He commented on YARN-2721:
-------------------------------

Curator should handle the retry properly which is addressed in YARN-2716.
As a temporary fix, we can simply ignore the potential NodeExist exception for now. Creating
a patch. 

> Race condition: ZKRMStateStore retry logic may throw NodeExist exception 
> -------------------------------------------------------------------------
>
>                 Key: YARN-2721
>                 URL: https://issues.apache.org/jira/browse/YARN-2721
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Jian He
>            Assignee: Jian He
>             Fix For: 2.6.0
>
>
> Blindly retrying operations in zookeeper will not work for non-idempotent operations
(like create znode). The reason is that the client can do a create znode, but the response
may not be returned because the server can die or timeout. In case of retrying the create
znode, it will throw a NODE_EXISTS exception from the earlier create from the same session.
 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message