Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5967A10CD5 for ; Thu, 6 Mar 2014 02:05:23 +0000 (UTC) Received: (qmail 13840 invoked by uid 500); 6 Mar 2014 02:05:14 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 13716 invoked by uid 500); 6 Mar 2014 02:05:13 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 13708 invoked by uid 99); 6 Mar 2014 02:05:13 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Mar 2014 02:05:13 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of mshi@gopivotal.com designates 209.85.213.172 as permitted sender) Received: from [209.85.213.172] (HELO mail-ig0-f172.google.com) (209.85.213.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Mar 2014 02:05:08 +0000 Received: by mail-ig0-f172.google.com with SMTP id uq10so9429431igb.5 for ; Wed, 05 Mar 2014 18:04:47 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=zzpBxz5Xdk8i9Hqy9ykojrCWdVMUbx2tihQqeoYTr0Y=; b=PK79nEI8iLr/ZxsdiIhFmCwQrhMvPeY+q/YPt2teuaTrC8I2CGbl0e3PDD22wlifd7 I6Uqs3lNXRtH7IAlKMhHMd0GgWCk5RaZSG8GDnPntXdbc5REux1cUWwYTZ/YUeaCOd7w H4XDptYeLJDjqHzFxzFYDS+QXXh80c3FrgJapO4pL93C0EM16VJG2IbSkla/R+fNrkVn GvzW63rmFyIg/P01+PZDZ1nOODFUGGzxkhvZwV0pGzUonKuAzRvcKMGhLBj6cVzyyZl2 SLov4JxBbv481c41EM70xaRyIcWUcIiUAIuCGA7I9VBBBvscfbs85EEP4W7XN2AXHjwk J6wg== X-Gm-Message-State: ALoCoQl3U9/7+Jah2uN6jbiforbIJkM7FMHFvqUgncrSsBck+mFFynMufHA9aEE8wS1aABFaXMW5 X-Received: by 10.50.128.38 with SMTP id nl6mr12646696igb.37.1394071487838; Wed, 05 Mar 2014 18:04:47 -0800 (PST) MIME-Version: 1.0 Received: by 10.64.7.234 with HTTP; Wed, 5 Mar 2014 18:04:27 -0800 (PST) In-Reply-To: References: From: Mingjiang Shi Date: Thu, 6 Mar 2014 10:04:27 +0800 Message-ID: Subject: Re: App Master issue. To: user Content-Type: multipart/alternative; boundary=089e013a23b0d3126504f3e68fc0 X-Virus-Checked: Checked by ClamAV on apache.org --089e013a23b0d3126504f3e68fc0 Content-Type: text/plain; charset=ISO-8859-1 Sorry, it should be accessing http://:8042/conf to check the value of yarn.resourcemanager. scheduler.address on the node manager. On Thu, Mar 6, 2014 at 9:36 AM, Mingjiang Shi wrote: > Hi Sai, > A few questions: > 1. which version of hadoop are you using? yarn.resourcemanager.hostname is > a new configuration which is not available old versions. > 2. Does your yarn-site.xml contains > yarn.resourcemanager.scheduler.address? If yes, what's the value? > 3. or you could access http://:8088/conf to check the value > of yarn.resourcemanager.scheduler.address. > > > > On Thu, Mar 6, 2014 at 3:29 AM, Sai Prasanna wrote: > >> Hi, >> >> I have a five node cluster. One master and 4 slaves. Infact master also >> has a data node running. When ever app master is launched in the master >> node, simple wordcount program runs fine. But if it is launched in some >> slave nodes, the progress of the application gets hung. >> The problem is, though i have set the yarn.resourcemanager.hostname to >> the ip-address of the master, the slave connects only to the default, >> 0.0.0.0:8030. >> What could be the reason ??? >> >> I get the following message in the logs of app.master in web-UI. >> *"...Configuration: job.xml:an attempt to override final parameter: >> mapreduce.job.end-notification.max.retry.interval; Ignoring.* >> >> >> *2014-03-05 20:15:50,597 WARN [main] >> org.apache.hadoop.conf.Configuration: job.xml:an attempt to override final >> parameter: mapreduce.job.end-notification.max.attempts; Ignoring. >> 2014-03-05 20:15:50,603 INFO [main] org.apache.hadoop.yarn.client.RMProxy: >> Connecting to ResourceManager at /0.0.0.0:8030 >> 2014-03-05 20:15:56,632 INFO [main] >> org.apache.hadoop.ipc.Client: Retrying connect to server: >> 0.0.0.0/0.0.0.0:8030 . Already tried 0 >> time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, >> sleepTime=1000 MILLISECONDS)"* >> >> >> -- >> *Sai Prasanna. AN* >> *II M.Tech (CS), SSSIHL* >> >> >> *Entire water in the ocean can never sink a ship, Unless it gets inside. >> All the pressures of life can never hurt you, Unless you let them in.* >> > > > > -- > Cheers > -MJ > -- Cheers -MJ --089e013a23b0d3126504f3e68fc0 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Sor= ry, it should be accessing http://<node_manager_ip>:8042/conf to chec= k the value of yarn.resourcemanager.
scheduler.= address on the node manager.


On Thu, Mar 6, 2014 at 9:36 AM, Mingjiang Shi <mshi@gopivotal.com> wrote:
Hi Sai,
A few questions:
1. which version of hadoop are you using? yarn.resourcemanager.hostname is = a new configuration which is not available old versions.
2. Does your y= arn-site.xml contains yarn.resourcemanager.scheduler.address? If yes, what&= #39;s the value?
3. or you could access http://<resource_mgr>:8088/conf to check the v= alue of yarn.resourcemanager.scheduler.address.



On Thu, Mar 6, 2014 at 3:29 AM, Sai Pras= anna <ansaiprasanna@gmail.com> wrote:
Hi,

I have a five node cluster. One mas= ter and 4 slaves. Infact master also has a data node running. When ever app= master is launched in the master node, simple wordcount program runs fine.= But if it is launched in some slave nodes, the progress of the application= gets hung.=A0
The problem is, though i have set the yarn.resourcemanager.hostname to= the ip-address of the master, the slave connects only to the default, 0.0.0.0:8030.
What could be the reason ???

I get the following message in the logs of app.master i= n web-UI.
"...Configuration: jo= b.xml:an attempt to override final parameter: mapreduce.job.end-notificatio= n.max.retry.interval; =A0Ignoring.
2014-03-05 20:15= :50,597 WARN [main] org.apache.hadoop.conf.Configuration: job.xml:an attemp= t to override final parameter: mapreduce.job.end-notification.max.attempts;= =A0Ignoring.
2014-03-05 20:15:50,603 INFO [main] org.apache.hadoop.yarn.client.RMProxy: = Connecting to ResourceManager at /0.0.0.0:8030
2014-03-05 20:15:56,632 INFO [main] org.apache= .hadoop.ipc.Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8030. Already tried 0 ti= me(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10,= sleepTime=3D1000 MILLISECONDS)"

=A0

--
Sai Prasanna. AN
II M.Tech (CS), SSSIHL

Entire water in the oce= an can never sink a ship, Unless i= t gets inside.
All the pressures of life can never hurt = you, Unless you let them in.




--
Cheers
-= MJ



--
C= heers
-MJ
--089e013a23b0d3126504f3e68fc0--