hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1274) LCE fails to run containers that don't have resources to localize
Date Sun, 06 Oct 2013 14:15:52 GMT

    [ https://issues.apache.org/jira/browse/YARN-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13787613#comment-13787613
] 

Hudson commented on YARN-1274:
------------------------------

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1570 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1570/])
YARN-1274. Fixed NodeManager's LinuxContainerExecutor to create user, app-dir and log-dirs
correctly even when there are no resources to localize for the container. Contributed by Siddharth
Seth. (vinodkv: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1529555)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/impl/container-executor.c


> LCE fails to run containers that don't have resources to localize
> -----------------------------------------------------------------
>
>                 Key: YARN-1274
>                 URL: https://issues.apache.org/jira/browse/YARN-1274
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.1.1-beta
>            Reporter: Alejandro Abdelnur
>            Assignee: Siddharth Seth
>            Priority: Blocker
>             Fix For: 2.1.2-beta
>
>         Attachments: YARN-1274.1.txt, YARN-1274.trunk.1.txt, YARN-1274.trunk.2.txt
>
>
> LCE container launch assumes the usercache/USER directory exists and it is owned by the
user running the container process.
> But the directory is created only if there are resources to localize by the LCE localization
command, if there are not resourcdes to localize, LCE localization never executes and launching
fails reporting 255 exit code and the NM logs have something like:
> {code}
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
main : command provided 1
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
main : user is llama
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
Can't create directory llama in /yarn/nm/usercache/llama/appcache/application_1380853306301_0004/container_1380853306301_0004_01_000004
- Permission denied
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message