hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Omkar Vinit Joshi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1321) NMTokenCache is a a singleton, prevents multiple AMs running in a single JVM to work correctly
Date Mon, 21 Oct 2013 21:23:47 GMT

    [ https://issues.apache.org/jira/browse/YARN-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13801095#comment-13801095
] 

Omkar Vinit Joshi commented on YARN-1321:
-----------------------------------------

bq. Change containsNMToken to containsToken and removeNMToken to removeToken for consistency
with getToken and setToken? Also, should setToken not be putToken?
can you please make it consistent for all apis xxxNMToken() ?

Can you please add a test case for the multi AM use case?

> NMTokenCache is a a singleton, prevents multiple AMs running in a single JVM to work
correctly
> ----------------------------------------------------------------------------------------------
>
>                 Key: YARN-1321
>                 URL: https://issues.apache.org/jira/browse/YARN-1321
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: client
>    Affects Versions: 2.2.0
>            Reporter: Alejandro Abdelnur
>            Assignee: Alejandro Abdelnur
>            Priority: Blocker
>             Fix For: 2.2.1
>
>         Attachments: YARN-1321.patch, YARN-1321.patch, YARN-1321.patch
>
>
> NMTokenCache is a singleton. Because of this, if running multiple AMs in a single JVM
NMTokens for the same node from different AMs step on each other and starting containers fail
due to mismatch tokens.
> The error observed in the client side is something like:
> {code}
> ERROR org.apache.hadoop.security.UserGroupInformation: PriviledgedActionException as:llama
(auth:PROXY) via llama (auth:SIMPLE) cause:org.apache.hadoop.yarn.exceptions.YarnException:
Unauthorized request to start container. 
> NMToken for application attempt : appattempt_1382038445650_0002_000001 was used for starting
container with container token issued for application attempt : appattempt_1382038445650_0001_000001
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message