Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8C61BDC6A for ; Tue, 31 Jul 2012 18:27:34 +0000 (UTC) Received: (qmail 86995 invoked by uid 500); 31 Jul 2012 18:27:30 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 86926 invoked by uid 500); 31 Jul 2012 18:27:30 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 86918 invoked by uid 99); 31 Jul 2012 18:27:30 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2012 18:27:30 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FSL_RCVD_USER,HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of anilgupta84@gmail.com designates 209.85.213.48 as permitted sender) Received: from [209.85.213.48] (HELO mail-yw0-f48.google.com) (209.85.213.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 31 Jul 2012 18:27:24 +0000 Received: by yhfq46 with SMTP id q46so7322338yhf.35 for ; Tue, 31 Jul 2012 11:27:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=BEXiX5frbOShngKN4OwdZcmmWNiqHgPUS46zMG4HPNo=; b=Ze9i0MxUl+v730hrevG2C284OqAUBbdb7Nx2jh+cD3+ygC1sAiqmn0j8lCF823hMGT FNl7gwV95Q2nm9OV8hww9YKrx6D07Fq9COK3eSBPpDhHJY5bKwqPx54ZF/5SVIrHOPEe ssmk1YREuXDNaDMcO0yg8ieJ+TaDL6tUGwiC4YN47RRCmkt3MEQT6xxlOu53IPwAMYmv HcwXTWR+/HWoNeBmczbMFTY1bUVFbh6l46UW7R6Am2Tb1gyn8dDR1/lPQ8BN+2zgYs4e ijGF2DHD2oI8kIRyiknhLJjhynJD/LEoYy2ENWP4ziDGzZ7wQxSZcDYcsxCzUUs2lne9 FUYA== Received: by 10.50.186.131 with SMTP id fk3mr1595292igc.31.1343759222954; Tue, 31 Jul 2012 11:27:02 -0700 (PDT) MIME-Version: 1.0 Received: by 10.64.63.12 with HTTP; Tue, 31 Jul 2012 11:26:42 -0700 (PDT) In-Reply-To: References: From: anil gupta Date: Tue, 31 Jul 2012 11:26:42 -0700 Message-ID: Subject: Re: YARN Pi example job stuck at 0%(No MR tasks are started by ResourceManager) To: common-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=14dae9340a8d25db3a04c6245342 X-Virus-Checked: Checked by ClamAV on apache.org --14dae9340a8d25db3a04c6245342 Content-Type: text/plain; charset=ISO-8859-1 Hi Harsh and Others, I was able to run the job when I login as user "hdfs". However, it fails if i run it as "root". I was suspecting this as a problem before also and it came out to be true. Thanks, Anil gupta On Mon, Jul 30, 2012 at 9:21 PM, abhiTowson cal wrote: > Hi anil, > > Was trying several things.I didn't hadoop-env.sh, so i created it. > > Regards > Abhishek > > On Mon, Jul 30, 2012 at 11:51 PM, anil gupta > wrote: > > That's pretty interesting!! From where you figured out that you need to > add > > that property? Just trying to understand how adding that property fixed > the > > issue. > > > > On Mon, Jul 30, 2012 at 8:12 PM, abhiTowson cal > > wrote: > > > >> Hi anil, > >> > >> Adding property resolved issue for me, and i also made this change > >> > >> vim hadoop-env.sh > >> > >> export JAVA_HOME=/usr/lib/java-1.6.0/jdk1.6.0_33 > >> if [ "$JAVA_HOME" != "" ]; then > >> #echo "run java in $JAVA_HOME" > >> JAVA_HOME=$JAVA_HOME > >> fi > >> > >> if [ "$JAVA_HOME" = "" ]; then > >> echo "Error: JAVA_HOME is not set." > >> exit 1 > >> fi > >> > >> JAVA=$JAVA_HOME/bin/java > >> JAVA_HEAP_MAX=-Xmx1000m > >> > >> Regards > >> Abhishek > >> > >> > >> On Mon, Jul 30, 2012 at 10:47 PM, anil gupta > >> wrote: > >> > Hi Abhishek, > >> > > >> > Did you mean that adding yarn.resourcemanager.resource-tracker.address > >> > along with yarn.log-aggregation-enable in my configuration will > resolve > >> the > >> > problem in which map-reduce job fails at 0% with the following error: > In > >> > the web page of > >> > > >> > http://data-node:8042/node/containerlogs/container_1343687008058_0003_01_000001/rootthe > >> > page says: > >> > Failed redirect for container_1343687008058_0003_01_000001 Failed > while > >> > trying to construct the redirect url to the log server. Log Server url > >> may > >> > not be configured. Unknown container. Container either has not > started or > >> > has already completed or doesn't belong to this node at all. > >> > Please let me know. > >> > > >> > Thanks, > >> > Anil Gupta > >> > > >> > On Mon, Jul 30, 2012 at 7:30 PM, abhiTowson cal > >> > wrote: > >> > > >> >> hi anil, > >> >> > >> >> Adding these help me resolve the issue for me > >> >> yarn.resourcemanager.resource-tracker.address > >> >> > >> >> Regards > >> >> Abhishek > >> >> > >> >> On Mon, Jul 30, 2012 at 7:56 PM, anil gupta > >> wrote: > >> >> > Hi Rahul, > >> >> > > >> >> > Thanks for your response. I can certainly enable the > >> >> > yarn.log-aggregation-enable to true. But after enabling this what > >> manual > >> >> > steps i will have to take to run jobs. Could you please elaborate. > >> >> > > >> >> > Thanks, > >> >> > Anil > >> >> > > >> >> > On Mon, Jul 30, 2012 at 4:26 PM, Rahul Jain > wrote: > >> >> > > >> >> >> The inability to look at map-reduce logs for failed logs is due to > >> >> number > >> >> >> of open issues in yarn; see my recent comment here: > >> >> >> > >> >> >> > >> >> > >> > https://issues.apache.org/jira/browse/MAPREDUCE-4428?focusedCommentId=13412995&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13412995 > >> >> >> > >> >> >> You can workaround this by enabling log aggregation and manually > >> copying > >> >> >> job logs from HDFS log location. Of course that is a painful way > till > >> >> the > >> >> >> yarn log collection and history bugs are resolved in an upcoming > >> >> release. > >> >> >> > >> >> >> -Rahul > >> >> >> > >> >> >> > >> >> >> > 12/07/27 09:38:27 INFO mapred.ResourceMgrDelegate: Submitted > >> >> application > >> >> >> > application_1343365114818_0002 to ResourceManager at ihub-an-l1/ > >> >> >> > 172.31.192.151:8040 > >> >> >> > 12/07/27 09:38:27 INFO mapreduce.Job: The url to track the job: > >> >> >> > http://ihub-an-l1:9999/proxy/application_1343365114818_0002/ > >> >> >> > 12/07/27 09:38:27 INFO mapreduce.Job: Running job: > >> >> job_1343365114818_0002 > >> >> >> > > >> >> >> > No Map-Reduce task are started by the cluster. I dont see any > >> errors > >> >> >> > anywhere in the application. Please help me in resolving this > >> problem. > >> >> >> > > >> >> >> > Thanks, > >> >> >> > Anil Gupta > >> >> >> > > >> >> >> > >> >> > > >> >> > > >> >> > > >> >> > -- > >> >> > Thanks & Regards, > >> >> > Anil Gupta > >> >> > >> > > >> > > >> > > >> > -- > >> > Thanks & Regards, > >> > Anil Gupta > >> > > > > > > > > -- > > Thanks & Regards, > > Anil Gupta > -- Thanks & Regards, Anil Gupta --14dae9340a8d25db3a04c6245342--