ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Harp (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AMBARI-11908) ResourceManager fails initial start when not colocated with Namenode
Date Sun, 14 Jun 2015 18:06:00 GMT

    [ https://issues.apache.org/jira/browse/AMBARI-11908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14585163#comment-14585163
] 

Michael Harp commented on AMBARI-11908:
---------------------------------------

No obvious reason, we do not have firewall or selinux installed. Also our 2 Master blueprint
has NN and RM collocated and does not exhibit this issue. My guess is a timing issue related
to the move to parallel deployments?
https://github.com/hortonworks/td-ape-tigereye/blob/master/hadoopBuilder/src/config/ambari/2-master-HDP-2.3.json

> ResourceManager fails initial start when not colocated with Namenode
> --------------------------------------------------------------------
>
>                 Key: AMBARI-11908
>                 URL: https://issues.apache.org/jira/browse/AMBARI-11908
>             Project: Ambari
>          Issue Type: Bug
>         Environment: sles11sp3
> hdp-2.3.0.0-2346
> ambari-2.1.0-1064
>            Reporter: Michael Harp
>             Fix For: 2.1.0
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> Deploying with blueprint and Namenode-HA and Yarn-HA enabled resoucemanager fails initial
start. Subsequent starts succeed.
> {code}
> 2015-06-13 03:41:24,922 - Getting jmx metrics from NN failed. URL: http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus
> Traceback (most recent call last):
>   File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/jmx.py",
line 37, in get_value_from_jmx
>     _, data = shell.checked_call(cmd, user=run_user, quiet=False)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70,
in inner
>     result = function(command, **kwargs)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 92,
in checked_call
>     tries=tries, try_sleep=try_sleep)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140,
in _call_wrapper
>     result = _call(command, **kwargs_copy)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 282,
in _call
>     raise Fail(err_msg)
> Fail: Execution of 'curl -s 'http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus''
returned 7. 
> 2015-06-13 03:41:25,000 - Getting jmx metrics from NN failed. URL: http://helios2.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message