hadoop-yarn-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Abdelnur (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (YARN-1274) LCE fails to run containers that don't have resources to localize
Date Sat, 05 Oct 2013 05:16:50 GMT

    [ https://issues.apache.org/jira/browse/YARN-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13786942#comment-13786942
] 

Alejandro Abdelnur commented on YARN-1274:
------------------------------------------

[~sseth], now that you mention the log dirs, while debugging this with [~rvs] he noticed that
as well, I've forgot to mentioned here as that does not seem to stop things from working,
but we should fix that as well. 

> LCE fails to run containers that don't have resources to localize
> -----------------------------------------------------------------
>
>                 Key: YARN-1274
>                 URL: https://issues.apache.org/jira/browse/YARN-1274
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: nodemanager
>    Affects Versions: 2.1.1-beta
>            Reporter: Alejandro Abdelnur
>            Assignee: Siddharth Seth
>            Priority: Blocker
>
> LCE container launch assumes the usercache/USER directory exists and it is owned by the
user running the container process.
> But the directory is created only if there are resources to localize by the LCE localization
command, if there are not resourcdes to localize, LCE localization never executes and launching
fails reporting 255 exit code and the NM logs have something like:
> {code}
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
main : command provided 1
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
main : user is llama
> 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor:
Can't create directory llama in /yarn/nm/usercache/llama/appcache/application_1380853306301_0004/container_1380853306301_0004_01_000004
- Permission denied
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message