Return-Path: X-Original-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 177C5FC7E for ; Sat, 5 Oct 2013 05:17:55 +0000 (UTC) Received: (qmail 22536 invoked by uid 500); 5 Oct 2013 05:17:07 -0000 Delivered-To: apmail-hadoop-yarn-issues-archive@hadoop.apache.org Received: (qmail 22430 invoked by uid 500); 5 Oct 2013 05:16:53 -0000 Mailing-List: contact yarn-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-issues@hadoop.apache.org Delivered-To: mailing list yarn-issues@hadoop.apache.org Received: (qmail 22414 invoked by uid 99); 5 Oct 2013 05:16:50 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 05 Oct 2013 05:16:50 +0000 Date: Sat, 5 Oct 2013 05:16:50 +0000 (UTC) From: "Alejandro Abdelnur (JIRA)" To: yarn-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (YARN-1274) LCE fails to run containers that don't have resources to localize MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/YARN-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13786942#comment-13786942 ] Alejandro Abdelnur commented on YARN-1274: ------------------------------------------ [~sseth], now that you mention the log dirs, while debugging this with [~rvs] he noticed that as well, I've forgot to mentioned here as that does not seem to stop things from working, but we should fix that as well. > LCE fails to run containers that don't have resources to localize > ----------------------------------------------------------------- > > Key: YARN-1274 > URL: https://issues.apache.org/jira/browse/YARN-1274 > Project: Hadoop YARN > Issue Type: Bug > Components: nodemanager > Affects Versions: 2.1.1-beta > Reporter: Alejandro Abdelnur > Assignee: Siddharth Seth > Priority: Blocker > > LCE container launch assumes the usercache/USER directory exists and it is owned by the user running the container process. > But the directory is created only if there are resources to localize by the LCE localization command, if there are not resourcdes to localize, LCE localization never executes and launching fails reporting 255 exit code and the NM logs have something like: > {code} > 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: main : command provided 1 > 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: main : user is llama > 2013-10-04 14:07:56,425 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Can't create directory llama in /yarn/nm/usercache/llama/appcache/application_1380853306301_0004/container_1380853306301_0004_01_000004 - Permission denied > {code} -- This message was sent by Atlassian JIRA (v6.1#6144)