Return-Path: X-Original-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Delivered-To: apmail-hadoop-yarn-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 04D85DB2C for ; Tue, 30 Oct 2012 11:20:19 +0000 (UTC) Received: (qmail 40315 invoked by uid 500); 30 Oct 2012 11:20:18 -0000 Delivered-To: apmail-hadoop-yarn-dev-archive@hadoop.apache.org Received: (qmail 39818 invoked by uid 500); 30 Oct 2012 11:20:14 -0000 Mailing-List: contact yarn-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: yarn-dev@hadoop.apache.org Delivered-To: mailing list yarn-dev@hadoop.apache.org Received: (qmail 39768 invoked by uid 99); 30 Oct 2012 11:20:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 30 Oct 2012 11:20:12 +0000 Date: Tue, 30 Oct 2012 11:20:12 +0000 (UTC) From: "xiajunluan (JIRA)" To: yarn-dev@hadoop.apache.org Message-ID: <1718249553.44155.1351596012589.JavaMail.jiratomcat@arcas> Subject: [jira] [Created] (YARN-190) Issues when running distributedShell example in hadoop-2.0.1-alpha cluster MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 xiajunluan created YARN-190: ------------------------------- Summary: Issues when running distributedShell example in hadoo= p-2.0.1-alpha cluster Key: YARN-190 URL: https://issues.apache.org/jira/browse/YARN-190 Project: Hadoop YARN Issue Type: Bug Components: applications/distributed-shell Affects Versions: 2.0.1-alpha Environment: ubuntu 11.04 Reporter: xiajunluan Priority: Critical I have successfully run distributed Shell example in single node that dep= loyed the hadoop-2.0.1-alpha. But when I run distributedShell in cluster environment, sometimes it work= s well, but sometimes it will failed, following is my detail configuration =20 A: NameNode, ResourceManager B: DataNode, NodeManager C: DataNode, NodeManager I run the distributedShell with command=20 =E2=80=9C./bin/hadoop jar share/hadoop/mapreduce/hadoop-yarn-applicatio= ns-distributedshell-2.0.1-alpha.jar org.apache.hadoop.yarn.applications.dis= tributedshell.Client -jar share/hadoop/mapreduce/hadoop-yarn-applications-d= istributedshell-2.0.1-alpha.jar -shell_command whoami -debug=E2=80=9D =20 When application master is launched, it will create the container to run= shell command =E2=80=9Cwhoami=E2=80=9D, so application master will run on = node B or C, and container will also randomly run on B or C, if application= master and shell command container run on the same node(for example all on= node B), above command will run successfully, but if in different node, th= at is to say, if application master launched successfully in node B and it = create the container that will run on node C, I will receive the error mess= age: =E2=80=A6=E2=80=A6. 12/10/29 19:18:02 INFO distributedshell.Client: Application did finished un= successfully. YarnState=3DFINISHED, DSFinalStatus=3DFAILED. Breaking monito= ring loop 12/10/29 19:18:02 ERROR distributedshell.Client: Application failed to comp= lete successfully=E2=80=9D -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrato= rs For more information on JIRA, see: http://www.atlassian.com/software/jira