ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitry Lysnichenko (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AMBARI-6748) Nimbus start failed after deployment
Date Tue, 05 Aug 2014 19:04:11 GMT
Dmitry Lysnichenko created AMBARI-6748:
------------------------------------------

             Summary: Nimbus start failed after deployment
                 Key: AMBARI-6748
                 URL: https://issues.apache.org/jira/browse/AMBARI-6748
             Project: Ambari
          Issue Type: Bug
          Components: agent
    Affects Versions: 1.7.0
            Reporter: Dmitry Lysnichenko
            Assignee: Dmitry Lysnichenko
             Fix For: 1.7.0


Deployed HDP-2.1, start all services failed. Nimbus cannot start.

error log:
{code}
stderr: 
2014-08-04 18:12:28,049 - Error while executing command 'start':
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 122, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/nimbus.py",
line 43, in start
    service("nimbus", action="start")
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/service.py",
line 64, in service
    try_sleep=10
  File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 148, in __init__
    self.env.run()
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 149,
in run
    self.run_action(resource, action)
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 115,
in run_action
    provider_action()
  File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line
241, in action_run
    raise ex
Fail: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f "^java.+backtype.storm.daemon.nimbus$"
> /var/run/storm/nimbus.pid' returned 1.
 stdout:
2014-08-04 18:11:37,197 - Execute['mkdir -p /tmp/HDP-artifacts/;     curl -kf -x "" --retry
10     http://ambsmoke6-4-1407193726-1.cs1cloud.internal:8080/resources//UnlimitedJCEPolicyJDK7.zip
-o /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] {'environment': ..., 'not_if': 'test -e
/tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip', 'ignore_failures': True, 'path': ['/bin',
'/usr/bin/']}
2014-08-04 18:11:37,211 - Skipping Execute['mkdir -p /tmp/HDP-artifacts/;     curl -kf -x
"" --retry 10     http://ambsmoke6-4-1407193726-1.cs1cloud.internal:8080/resources//UnlimitedJCEPolicyJDK7.zip
-o /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] due to not_if
2014-08-04 18:11:37,212 - Execute['rm -f local_policy.jar; rm -f US_export_policy.jar; unzip
-o -j -q /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip'] {'path': ['/bin/', '/usr/bin'],
'only_if': 'test -e /usr/jdk64/jdk1.7.0_45/jre/lib/security && test -f /tmp/HDP-artifacts//UnlimitedJCEPolicyJDK7.zip',
'cwd': '/usr/jdk64/jdk1.7.0_45/jre/lib/security'}
2014-08-04 18:11:37,390 - Directory['/etc/hadoop/conf.empty'] {'owner': 'root', 'group': 'root',
'recursive': True}
2014-08-04 18:11:37,391 - Link['/etc/hadoop/conf'] {'not_if': 'ls /etc/hadoop/conf', 'to':
'/etc/hadoop/conf.empty'}
2014-08-04 18:11:37,404 - Skipping Link['/etc/hadoop/conf'] due to not_if
2014-08-04 18:11:37,419 - File['/etc/hadoop/conf/hadoop-env.sh'] {'content': InlineTemplate(...),
'owner': 'root'}
2014-08-04 18:11:37,419 - XmlConfig['core-site.xml'] {'owner': 'hdfs', 'group': 'hadoop',
'conf_dir': '/etc/hadoop/conf', 'configuration_attributes': ..., 'configurations': ...}
2014-08-04 18:11:37,429 - Generating config: /etc/hadoop/conf/core-site.xml
2014-08-04 18:11:37,430 - File['/etc/hadoop/conf/core-site.xml'] {'owner': 'hdfs', 'content':
InlineTemplate(...), 'group': 'hadoop', 'mode': None, 'encoding': 'UTF-8'}
2014-08-04 18:11:37,431 - Writing File['/etc/hadoop/conf/core-site.xml'] because contents
don't match
2014-08-04 18:11:37,443 - Execute['/bin/echo 0 > /selinux/enforce'] {'only_if': 'test -f
/selinux/enforce'}
2014-08-04 18:11:37,456 - Skipping Execute['/bin/echo 0 > /selinux/enforce'] due to only_if
2014-08-04 18:11:37,457 - Execute['mkdir -p /usr/lib/hadoop/lib/native/Linux-i386-32; ln -sf
/usr/lib/libsnappy.so /usr/lib/hadoop/lib/native/Linux-i386-32/libsnappy.so'] {}
2014-08-04 18:11:37,488 - Execute['mkdir -p /usr/lib/hadoop/lib/native/Linux-amd64-64; ln
-sf /usr/lib64/libsnappy.so /usr/lib/hadoop/lib/native/Linux-amd64-64/libsnappy.so'] {}
2014-08-04 18:11:37,500 - Directory['/grid/0/log/hadoop'] {'owner': 'root', 'group': 'root',
'recursive': True}
2014-08-04 18:11:37,501 - Directory['/var/run/hadoop'] {'owner': 'root', 'group': 'root',
'recursive': True}
2014-08-04 18:11:37,502 - Directory['/tmp/hadoop-hdfs'] {'owner': 'hdfs', 'recursive': True}
2014-08-04 18:11:37,506 - File['/etc/hadoop/conf/commons-logging.properties'] {'content':
Template('commons-logging.properties.j2'), 'owner': 'root'}
2014-08-04 18:11:37,508 - File['/etc/hadoop/conf/health_check'] {'content': Template('health_check-v2.j2'),
'owner': 'root'}
2014-08-04 18:11:37,509 - File['/etc/hadoop/conf/log4j.properties'] {'content': '...', 'owner':
'hdfs', 'group': 'hadoop', 'mode': 0644}
2014-08-04 18:11:37,516 - File['/etc/hadoop/conf/hadoop-metrics2.properties'] {'content':
Template('hadoop-metrics2.properties.j2'), 'owner': 'hdfs'}
2014-08-04 18:11:37,517 - File['/etc/hadoop/conf/task-log4j.properties'] {'content': StaticFile('task-log4j.properties'),
'mode': 0755}
2014-08-04 18:11:37,517 - File['/etc/hadoop/conf/configuration.xsl'] {'owner': 'hdfs', 'group':
'hadoop'}
2014-08-04 18:11:37,705 - Directory['/var/log/storm'] {'owner': 'storm', 'group': 'hadoop',
'recursive': True}
2014-08-04 18:11:37,707 - Directory['/var/run/storm'] {'owner': 'storm', 'group': 'hadoop',
'recursive': True}
2014-08-04 18:11:37,707 - Directory['/grid/0/hadoop/storm'] {'owner': 'storm', 'group': 'hadoop',
'recursive': True}
2014-08-04 18:11:37,707 - Directory['/etc/storm/conf'] {'owner': 'storm', 'group': 'hadoop',
'recursive': True}
2014-08-04 18:11:37,714 - File['/etc/storm/conf/config.yaml'] {'owner': 'storm', 'content':
Template('config.yaml.j2'), 'group': 'hadoop'}
2014-08-04 18:11:37,719 - File['/etc/storm/conf/storm.yaml'] {'owner': 'storm', 'content':
InlineTemplate(...), 'group': 'hadoop', 'mode': None}
2014-08-04 18:11:37,721 - Writing File['/etc/storm/conf/storm.yaml'] because contents don't
match
2014-08-04 18:11:37,722 - File['/etc/storm/conf/storm-env.sh'] {'content': '\n#!/bin/bash\n\n#
Set Storm specific environment variables here.\n\n# The java implementation to use.\nexport
JAVA_HOME={{java_home}}\n\n# export STORM_CONF_DIR=""', 'owner': 'storm'}
2014-08-04 18:11:37,722 - TemplateConfig['/etc/storm/conf/storm_jaas.conf'] {'owner': 'storm'}
2014-08-04 18:11:37,724 - File['/etc/storm/conf/storm_jaas.conf'] {'content': Template('storm_jaas.conf.j2'),
'owner': 'storm', 'group': None, 'mode': None}
2014-08-04 18:11:37,725 - Execute['env JAVA_HOME=/usr/jdk64/jdk1.7.0_45 PATH=$PATH:/usr/jdk64/jdk1.7.0_45/bin
/usr/bin/storm nimbus > /var/log/storm/nimbus.out 2>&1'] {'wait_for_finish': False,
'not_if': 'ls /var/run/storm/nimbus.pid >/dev/null 2>&1 && ps `cat /var/run/storm/nimbus.pid`
>/dev/null 2>&1', 'user': 'storm'}
2014-08-04 18:11:37,752 - Execute['pgrep -f "^java.+backtype.storm.daemon.nimbus$" &&
pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'] {'logoutput':
True, 'tries': 6, 'user': 'storm', 'try_sleep': 10}
2014-08-04 18:11:37,790 - Retrying after 10 seconds. Reason: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$"
&& pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'
returned 1. 
2014-08-04 18:11:47,827 - Retrying after 10 seconds. Reason: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$"
&& pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'
returned 1. 
2014-08-04 18:11:57,881 - Retrying after 10 seconds. Reason: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$"
&& pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'
returned 1. 
2014-08-04 18:12:07,946 - Retrying after 10 seconds. Reason: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$"
&& pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'
returned 1. 
2014-08-04 18:12:17,989 - Retrying after 10 seconds. Reason: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$"
&& pgrep -f "^java.+backtype.storm.daemon.nimbus$" > /var/run/storm/nimbus.pid'
returned 1. 
2014-08-04 18:12:28,049 - Error while executing command 'start':
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 122, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/nimbus.py",
line 43, in start
    service("nimbus", action="start")
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.1/services/STORM/package/scripts/service.py",
line 64, in service
    try_sleep=10
  File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 148, in __init__
    self.env.run()
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 149,
in run
    self.run_action(resource, action)
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 115,
in run_action
    provider_action()
  File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line
241, in action_run
    raise ex
Fail: Execution of 'pgrep -f "^java.+backtype.storm.daemon.nimbus$" && pgrep -f "^java.+backtype.storm.daemon.nimbus$"
> /var/run/storm/nimbus.pid' returned 1.
{code}

Nimbus.out
{code}
Traceback (most recent call last):
  File "/usr/lib/storm/bin/storm.py", line 463, in <module>
    main()
  File "/usr/lib/storm/bin/storm.py", line 460, in main
    (COMMANDS.get(COMMAND, unknown_command))(*ARGS)
  File "/usr/lib/storm/bin/storm.py", line 276, in nimbus
    jvmopts = parse_args(confvalue("nimbus.childopts", cppaths)) + [
  File "/usr/lib/storm/bin/storm.py", line 87, in confvalue
    p = sub.Popen(command, stdout=sub.PIPE)
  File "/usr/lib64/python2.6/subprocess.py", line 642, in __init__
    errread, errwrite)
  File "/usr/lib64/python2.6/subprocess.py", line 1234, in _execute_child
    raise child_exception
OSError: [Errno 2] No such file or directory
{code}




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message