Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 46B6A10479 for ; Wed, 26 Mar 2014 22:41:48 +0000 (UTC) Received: (qmail 47737 invoked by uid 500); 26 Mar 2014 22:41:40 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 47501 invoked by uid 500); 26 Mar 2014 22:41:39 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 47494 invoked by uid 99); 26 Mar 2014 22:41:39 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Mar 2014 22:41:39 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,WEIRD_QUOTING X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of john.lilley@redpoint.net designates 206.225.164.218 as permitted sender) Received: from [206.225.164.218] (HELO hub021-nj-3.exch021.serverdata.net) (206.225.164.218) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Mar 2014 22:41:32 +0000 Received: from MBX021-E3-NJ-2.exch021.domain.local ([10.240.4.78]) by HUB021-NJ-3.exch021.domain.local ([10.240.4.36]) with mapi id 14.03.0174.001; Wed, 26 Mar 2014 15:41:10 -0700 From: John Lilley To: "user@hadoop.apache.org" Subject: RE: Getting error message from AM container launch Thread-Topic: Getting error message from AM container launch Thread-Index: Ac9JQ9AIbYP8V4D3TxaZaE0vVomP8wAAJaqg Date: Wed, 26 Mar 2014 22:41:09 +0000 Message-ID: <869970D71E26D7498BDAC4E1CA92226B86EA8C15@MBX021-E3-NJ-2.exch021.domain.local> References: <869970D71E26D7498BDAC4E1CA92226B86EA8BCB@MBX021-E3-NJ-2.exch021.domain.local> In-Reply-To: <869970D71E26D7498BDAC4E1CA92226B86EA8BCB@MBX021-E3-NJ-2.exch021.domain.local> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [173.160.43.60] Content-Type: multipart/alternative; boundary="_000_869970D71E26D7498BDAC4E1CA92226B86EA8C15MBX021E3NJ2exch_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_869970D71E26D7498BDAC4E1CA92226B86EA8C15MBX021E3NJ2exch_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable We do have a fairly long container command-line. Not huge, around 200 char= acters. John From: John Lilley [mailto:john.lilley@redpoint.net] Sent: Wednesday, March 26, 2014 4:38 PM To: user@hadoop.apache.org Subject: Getting error message from AM container launch Running a non-MapReduce YARN application, one of the containers launched by= the AM is failing with an error message I've never seen. Any ideas? I'm = not sure who exactly is running "nice" or why its argument list would be to= o long. Thanks john Container for appattempt_1395755163053_0030_000001 exited with exitCode: 0= due to: Exception from container-launch: java.io.IOException: Cannot run program ""nice"" (in directory ""/ephemeral= 02/hadoop/yarn/local/usercache/SYSTEM/appcache/application_1395755163053_00= 30/container_1395755163053_0030_01_000001""): java.io.IOException: error=3D= 7, Argument list too long at java.lang.ProcessBuilder.start(ProcessBuilder.java:460) at org.apache.hadoop.util.Shell.runCommand(Shell.java:407) at org.apache.hadoop.util.Shell.run(Shell.java:379) at org.apache.hadoop.util.Shell$ShellCommandExecutor.execut= e(Shell.java:589) at org.apache.hadoop.yarn.server.nodemanager.DefaultContain= erExecutor.launchContainer(DefaultContainerExecutor.java:195) at org.apache.hadoop.yarn.server.nodemanager.containermanag= er.launcher.ContainerLaunch.call(ContainerLaunch.java:283) at org.apache.hadoop.yarn.server.nodemanager.containermanag= er.launcher.ContainerLaunch.call(ContainerLaunch.java:79) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask= .java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(T= hreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(Threa= dPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) Caused by: java.io.IOException: java.io.IOException: error=3D7, Argument li= st too long at java.lang.UNIXProcess.(UNIXProcess.java:148) at java.lang.ProcessImpl.start(ProcessImpl.java:65) at java.lang.ProcessBuilder.start(ProcessBuilder.java:453) ... 11 more --_000_869970D71E26D7498BDAC4E1CA92226B86EA8C15MBX021E3NJ2exch_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

We do have a fairly lo= ng container command-line.  Not huge, around 200 characters.

John=

 

From: John Lil= ley [mailto:john.lilley@redpoint.net]
Sent: Wednesday, March 26, 2014 4:38 PM
To: user@hadoop.apache.org
Subject: Getting error message from AM container launch

 

Running a non-MapReduce YARN application, one of the= containers launched by the AM is failing with an error message I’ve = never seen.  Any ideas?  I’m not sure who exactly is runnin= g “nice” or why its argument list would be too long.=

Thanks

john

 

Container for appattempt_1395755163053_0030_000001 e= xited with  exitCode: 0 due to: Exception from container-launch:

java.io.IOException: Cannot run program ""= nice"" (in directory ""/ephemeral02/hadoop/yarn/local/u= sercache/SYSTEM/appcache/application_1395755163053_0030/container_139575516= 3053_0030_01_000001""): java.io.IOException: error=3D7, Argument = list too long

        &nbs= p;       at java.lang.ProcessBuilder.start(Pr= ocessBuilder.java:460)

        &nbs= p;       at org.apache.hadoop.util.Shell.runC= ommand(Shell.java:407)

        &nbs= p;       at org.apache.hadoop.util.Shell.run(= Shell.java:379)

        &nbs= p;       at org.apache.hadoop.util.Shell$Shel= lCommandExecutor.execute(Shell.java:589)

        &nbs= p;       at org.apache.hadoop.yarn.server.nod= emanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.= java:195)

        &nbs= p;       at org.apache.hadoop.yarn.server.nod= emanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.jav= a:283)

        &nbs= p;       at org.apache.hadoop.yarn.server.nod= emanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.jav= a:79)

        &nbs= p;       at java.util.concurrent.FutureTask$S= ync.innerRun(FutureTask.java:303)

        &nbs= p;       at java.util.concurrent.FutureTask.r= un(FutureTask.java:138)

        &nbs= p;       at java.util.concurrent.ThreadPoolEx= ecutor$Worker.runTask(ThreadPoolExecutor.java:886)

        &nbs= p;       at java.util.concurrent.ThreadPoolEx= ecutor$Worker.run(ThreadPoolExecutor.java:908)

        &nbs= p;       at java.lang.Thread.run(Thread.java:= 662)

Caused by: java.io.IOException: java.io.IOException:= error=3D7, Argument list too long

        &nbs= p;       at java.lang.UNIXProcess.<init>= ;(UNIXProcess.java:148)

        &nbs= p;       at java.lang.ProcessImpl.start(Proce= ssImpl.java:65)

        &nbs= p;       at java.lang.ProcessBuilder.start(Pr= ocessBuilder.java:453)

        &nbs= p;       ... 11 more

 

--_000_869970D71E26D7498BDAC4E1CA92226B86EA8C15MBX021E3NJ2exch_--