Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 46CEF109F0 for ; Wed, 5 Feb 2014 17:14:06 +0000 (UTC) Received: (qmail 479 invoked by uid 500); 5 Feb 2014 17:13:56 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 294 invoked by uid 500); 5 Feb 2014 17:13:54 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 280 invoked by uid 99); 5 Feb 2014 17:13:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Feb 2014 17:13:54 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of vinodkv@hortonworks.com designates 209.85.160.49 as permitted sender) Received: from [209.85.160.49] (HELO mail-pb0-f49.google.com) (209.85.160.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Feb 2014 17:13:47 +0000 Received: by mail-pb0-f49.google.com with SMTP id up15so617012pbc.22 for ; Wed, 05 Feb 2014 09:13:23 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:from:message-id:mime-version:subject:date :references:to:in-reply-to:content-type; bh=2oQZs+QePn+U1k9oDwcGFp6s6QR6RclHWpBmrG7ooS0=; b=XBpBvzNWYDiKP/EDQ4It/MUFwDVr+m9/cX3PqoLblisPGtTjKtWuU7eR8xPCN9KJQQ nvPMc0RVzUeQnK8FPEuyX4ms/88m3JbDa/FtRE6u4cLGLqpsGJfHYTEeNWknKyyqLdCw GrVCHxDGhsuBM6/duZoMtIKRUo7fz8J4RL+ryu2uR+mBQ2Od+Z3SnGNPdOTOWJ04r5hZ opDQfWE01CWbHWPKH5pfWRaYPOlGUxvnOHqIVUuiflBk1gUZGTkrOEWAF4Vh7R2sgLXf nUtladTLoqIv2bdOuS1ia9oaLIRQPG0b5h4O1J1zOIXmfjR5CROpg/SxcOFBkFw5eEJy PB+A== X-Gm-Message-State: ALoCoQkU5mRLDYUR6Zog2ZEUMlADg5gv1VbTduAq7d4KipPDnJOzv073kQJFNQZFOJDNZNDyGtfv8dhNhGxxagXt5jhx7tXOkkHzPiff9yJIGMxXPNyOS6Y= X-Received: by 10.68.197.8 with SMTP id iq8mr3476472pbc.124.1391620403106; Wed, 05 Feb 2014 09:13:23 -0800 (PST) Received: from [10.11.2.242] ([192.175.27.2]) by mx.google.com with ESMTPSA id js7sm16918309pbc.35.2014.02.05.09.13.21 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Wed, 05 Feb 2014 09:13:21 -0800 (PST) Sender: Vinod Kumar Vavilapalli From: Vinod Kumar Vavilapalli Message-Id: Mime-Version: 1.0 (Mac OS X Mail 7.1 \(1827\)) Subject: Re: what happens to a client attempting to get a new app when the resource manager is already down Date: Wed, 5 Feb 2014 09:13:20 -0800 References: <1391183579.23486.YahooMailNeo@web121902.mail.ne1.yahoo.com> To: user@hadoop.apache.org, REYANE OUKPEDJO In-Reply-To: <1391183579.23486.YahooMailNeo@web121902.mail.ne1.yahoo.com> X-Mailer: Apple Mail (2.1827) Content-Type: multipart/mixed; boundary=e89a8ff1c428cc6d5c04f1abdfd6 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8ff1c428cc6d5c04f1abdfd6 Content-Type: multipart/alternative; boundary="Apple-Mail=_FC1979EA-018F-4EB7-AD36-395C5847DBEF" --Apple-Mail=_FC1979EA-018F-4EB7-AD36-395C5847DBEF Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=ISO-8859-1 Is this on trunk or a released version? I think the default behavior (when RM HA is not enabled) shouldn't have cli= ent loop forever. Let me know and we can see if this needs fixing. Thanks, +vinod On Jan 31, 2014, at 7:52 AM, REYANE OUKPEDJO wrote: > Hi there, >=20 > I am trying to solve a problem. My client run as a server. And was trying= to make my client aware about the fact the resource manager is down but I = could not figure out. The reason is that the call : yarnClient.createAppli= cation(); never return when the resource manager is down. However it just s= tay in a loops and sleep after 10 iteration and continue the same loops. Be= low you can find the logs. Any idea how to leave this loop ? is there any p= arameter that control the number of seconds before giving up. >=20 > Thanks >=20 > Reyane OUKPEDJO >=20 >=20 >=20 >=20 >=20 >=20 >=20 > logs > 14/01/31 10:48:05 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 8 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:06 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 9 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:37 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 0 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:38 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 1 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:39 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 2 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:40 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 3 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:41 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 4 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:42 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 5 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:43 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 6 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:44 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 7 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:45 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 8 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:48:46 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 9 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:17 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 0 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:18 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 1 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:19 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 2 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:20 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 3 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:21 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 4 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) > 14/01/31 10:49:22 INFO ipc.Client: Retrying connect to server: isblade2/9= .32.160.125:8032. Already tried 5 time(s); retry policy is RetryUpToMaximum= CountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS) >=20 --=20 CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to= =20 which it is addressed and may contain information that is confidential,=20 privileged and exempt from disclosure under applicable law. If the reader= =20 of this message is not the intended recipient, you are hereby notified that= =20 any printing, copying, dissemination, distribution, disclosure or=20 forwarding of this communication is strictly prohibited. If you have=20 received this communication in error, please contact the sender immediately= =20 and delete it from your system. Thank You. --Apple-Mail=_FC1979EA-018F-4EB7-AD36-395C5847DBEF Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=ISO-8859-1 Is this on trunk or a re= leased version?

I think the default behavior (when RM HA= is not enabled) shouldn't have client  loop forever. Let me know and = we can see if this needs fixing.

Thanks,
+vinod


On Jan 31, 2014, at 7:52 AM,= REYANE OUKPEDJO <r.oukpedjo@yah= oo.com> wrote:

Hi there,

I am trying to solve a problem. My client run as a server.= And was trying to make my client aware about the fact the resource manager= is down but I could not figure out. The reason is that the call :  ya= rnClient.createApplication(); never return when the resource manager is dow= n. However it just stay in a loops and sleep after 10 iteration and continu= e the same loops. Below you can find the logs. Any idea how to leave this l= oop ? is there any parameter that control the number of seconds before givi= ng up.

T= hanks

Re= yane OUKPEDJO







logs
14/01/31 10:48:05 INFO= ipc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. Alread= y tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxR= etries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:06 INFO ipc.Client: Retrying connect to server: = isblade2/9.32.160.125:8032. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS= )
14/01/31 10:48:37 INFO= ipc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. Alread= y tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxR= etries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:38 INFO ipc.Client: Retrying connect to server: = isblade2/9.32.160.125:8032. Already tried 1 time(s); retry policy is RetryU= pToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)
=
14/01/31 10:48:39 INFO ipc.Cl= ient: Retrying connect to server: isblade2/9.32.160.125:8032. Already tried= 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries= =3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:40 INFO ipc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. Already tried 3 time(s); retry policy is Retry= UpToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:41 INFO ipc.C= lient: Retrying connect to server: isblade2/9.32.160.125:8032. Already trie= d 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries= =3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:42 INFO ipc.Client: Retrying connect to server: isblad= e2/9.32.160.125:8032. Already tried 5 time(s); retry policy is RetryUpToMax= imumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:43 INFO ipc.Client: = Retrying connect to server: isblade2/9.32.160.125:8032. Already tried 6 tim= e(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10, = sleepTime=3D1 SECONDS)
1= 4/01/31 10:48:44 INFO ipc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. A= lready tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep= (maxRetries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:48:45 INFO ipc.Client: Retrying connect to ser= ver: isblade2/9.32.160.125:8032. Already tried 8 time(s); retry policy is R= etryUpToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)<= /div>
14/01/31 10:48:46 INFO i= pc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. Already = tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRet= ries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:49:17 INFO ipc.Client: Retrying connect to server: is= blade2/9.32.160.125:8032. Already tried 0 time(s); retry policy is RetryUpT= oMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:49:18 INFO ipc.Clie= nt: Retrying connect to server: isblade2/9.32.160.125:8032. Already tried 1= time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D= 10, sleepTime=3D1 SECONDS)
14/01/31 10:49:19 INFO ipc.Client: Retrying connect to server: isblade2/= 9.32.160.125:8032. Already tried 2 time(s); retry policy is RetryUpToMaximu= mCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)
14/01/31 10:49:20 INFO ipc.Client: Ret= rying connect to server: isblade2/9.32.160.125:8032. Already tried 3 time(s= ); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10, sle= epTime=3D1 SECONDS)
14/0= 1/31 10:49:21 INFO ipc.Client: Retrying connect to server: isblade2/9.32.16= 0.125:8032. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS= )
14/01/31 10:49:22 INFO= ipc.Client: Retrying connect to server: isblade2/9.32.160.125:8032. Alread= y tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxR= etries=3D10, sleepTime=3D1 SECONDS)

=


CONFIDENTIALITY NOTICE
NOTICE: This message is = intended for the use of the individual or entity to which it is addressed a= nd may contain information that is confidential, privileged and exempt from= disclosure under applicable law. If the reader of this message is not the = intended recipient, you are hereby notified that any printing, copying, dis= semination, distribution, disclosure or forwarding of this communication is= strictly prohibited. If you have received this communication in error, ple= ase contact the sender immediately and delete it from your system. Thank Yo= u. --Apple-Mail=_FC1979EA-018F-4EB7-AD36-395C5847DBEF-- --e89a8ff1c428cc6d5c04f1abdfd6 Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP using GPGMail -----BEGIN PGP SIGNATURE----- Comment: GPGTools - http://gpgtools.org iQEcBAEBAgAGBQJS8nEwAAoJECVa31bDbF8PMUwH/1xb4kvxsAUNuE7uDQgZbZ65 8C1pJL3mj8XAVpjazrcKV1qTiMhnoowADaKtx9eRCvb97Vq+c/3Uz0ZYT8Q9gh8T U0kuHJ9sbaWAsNkr0z63T38n49RLn6vAEtBT/EZvGTa4/y/LIYjn372ynU7RisdP MpjCWXSXGp62kEYUuM3oPoNITTIeii6EwW0n46AduNdJ/u0v7+sC7KsBhTtxy8YH oE5gfNCMKlLxmOyWHv5OxbJrZpoGNwHHkbmW7ctvAX7mM7jBdO1KMC9g0lrEMddt FNCSEgPKlA2Q8HhFwM2K168g4MyHZFLCUaD2W4TPuQtFRX0Uqo10VZg2ppp0uCQ= =Uaip -----END PGP SIGNATURE----- --e89a8ff1c428cc6d5c04f1abdfd6--