hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jason Lowe (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (YARN-1386) NodeManager mistakenly loses resources and relocalizes them
Date Mon, 11 Nov 2013 22:18:17 GMT

     [ https://issues.apache.org/jira/browse/YARN-1386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jason Lowe updated YARN-1386:

    Attachment: YARN-1386.patch

Patch to change the ContainerLocalizer so it creates the cache directories with 0710 (rwx--x---)
permissions so the nodemanager user can check for existence of files in the cache without
having to become that user.  This also matches the behavior of the DefaultContainerExecutor.

> NodeManager mistakenly loses resources and relocalizes them
> -----------------------------------------------------------
>                 Key: YARN-1386
>                 URL: https://issues.apache.org/jira/browse/YARN-1386
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 0.23.10, 2.2.0
>            Reporter: Jason Lowe
>            Priority: Critical
>         Attachments: YARN-1386.patch
> When a local resource that should already be present is requested again, the nodemanager
checks to see if it still present.  However the method it uses to check for presence is via
File.exists() as the user of the nodemanager process. If the resource was a private resource
localized for another user, it will be localized to a location that is not accessible by the
nodemanager user.  Therefore File.exists() returns false, the nodemanager mistakenly believes
the resource is no longer available, and it proceeds to localize it over and over.

This message was sent by Atlassian JIRA

View raw message