hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1386) NodeManager mistakenly loses resources and relocalizes them
Date Wed, 13 Nov 2013 13:49:37 GMT

    [ https://issues.apache.org/jira/browse/YARN-1386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13821347#comment-13821347
] 

Hudson commented on YARN-1386:
------------------------------

SUCCESS: Integrated in Hadoop-Hdfs-trunk #1581 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1581/])
YARN-1386. NodeManager mistakenly loses resources and relocalizes them (Jason Lowe via jeagles)
(jeagles: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1541375)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/ContainerLocalizer.java
* /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestContainerLocalizer.java


> NodeManager mistakenly loses resources and relocalizes them
> -----------------------------------------------------------
>
>                 Key: YARN-1386
>                 URL: https://issues.apache.org/jira/browse/YARN-1386
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 0.23.10, 2.2.0
>            Reporter: Jason Lowe
>            Assignee: Jason Lowe
>            Priority: Critical
>             Fix For: 3.0.0, 2.3.0, 0.23.10
>
>         Attachments: YARN-1386.patch, YARN-1386.patch
>
>
> When a local resource that should already be present is requested again, the nodemanager
checks to see if it still present.  However the method it uses to check for presence is via
File.exists() as the user of the nodemanager process. If the resource was a private resource
localized for another user, it will be localized to a location that is not accessible by the
nodemanager user.  Therefore File.exists() returns false, the nodemanager mistakenly believes
the resource is no longer available, and it proceeds to localize it over and over.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message