Return-Path: X-Original-To: apmail-ambari-dev-archive@www.apache.org Delivered-To: apmail-ambari-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6EDBA1882F for ; Sun, 14 Jun 2015 04:58:01 +0000 (UTC) Received: (qmail 11125 invoked by uid 500); 14 Jun 2015 04:58:01 -0000 Delivered-To: apmail-ambari-dev-archive@ambari.apache.org Received: (qmail 11090 invoked by uid 500); 14 Jun 2015 04:58:01 -0000 Mailing-List: contact dev-help@ambari.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ambari.apache.org Delivered-To: mailing list dev@ambari.apache.org Received: (qmail 11075 invoked by uid 99); 14 Jun 2015 04:58:01 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 14 Jun 2015 04:58:01 +0000 Date: Sun, 14 Jun 2015 04:58:01 +0000 (UTC) From: "Jaimin D Jetly (JIRA)" To: dev@ambari.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (AMBARI-11908) ResourceManager fails initial start when not colocated with Namenode MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/AMBARI-11908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14584937#comment-14584937 ] Jaimin D Jetly commented on AMBARI-11908: ----------------------------------------- [~miharp] The task log as shown in the description has following error message {code} Fail: Execution of 'curl -s 'http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus'' returned 7. {code} Curl error code "7" stands for *curl failing to connect*. Do you know any obvious reason why host with RM2 was not able to connect to helios1.labs.teradata.com host? It could be something like firewall selinux, namenode process was in INSTALLED state and so not running, namenode host is getting rebooted, etc > ResourceManager fails initial start when not colocated with Namenode > -------------------------------------------------------------------- > > Key: AMBARI-11908 > URL: https://issues.apache.org/jira/browse/AMBARI-11908 > Project: Ambari > Issue Type: Bug > Environment: sles11sp3 > hdp-2.3.0.0-2346 > ambari-2.1.0-1064 > Reporter: Michael Harp > Fix For: 2.1.0 > > Original Estimate: 72h > Remaining Estimate: 72h > > Deploying with blueprint and Namenode-HA and Yarn-HA enabled resoucemanager fails initial start. Subsequent starts succeed. > {code} > 2015-06-13 03:41:24,922 - Getting jmx metrics from NN failed. URL: http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus > Traceback (most recent call last): > File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/jmx.py", line 37, in get_value_from_jmx > _, data = shell.checked_call(cmd, user=run_user, quiet=False) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70, in inner > result = function(command, **kwargs) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 92, in checked_call > tries=tries, try_sleep=try_sleep) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140, in _call_wrapper > result = _call(command, **kwargs_copy) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 282, in _call > raise Fail(err_msg) > Fail: Execution of 'curl -s 'http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus'' returned 7. > 2015-06-13 03:41:25,000 - Getting jmx metrics from NN failed. URL: http://helios2.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)