ambari-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Onischuk (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (AMBARI-18228) Restarting services with stale configs failed while starting oozie server
Date Mon, 22 Aug 2016 14:35:20 GMT

     [ https://issues.apache.org/jira/browse/AMBARI-18228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Andrew Onischuk updated AMBARI-18228:
-------------------------------------
    Description: 
*Problem:*
During "restart all services" NN and Oozie server were restarting at the same time, therefore
Oozie couldn't create HDFS folders.
Fyi we already had similar issue with Yarn and Oozie (   BUG-60813 RESOLVED  ).
*Solution:*
Add dependency to role_command_order for Oozie to restart only after NN restart.

  was:
Restarting services with stale configs failed while restarting oozie server.
The service restart failed with below error :

    
    
    
    stderr:   /var/lib/ambari-agent/data/errors-226.txt
    
    Traceback (most recent call last):
      File "/var/lib/ambari-agent/cache/common-services/OOZIE/4.0.0.2.0/package/scripts/oozie_server.py",
line 215, in <module>
        OozieServer().execute()
      File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 280, in execute
        method(env)
      File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 720, in restart
        self.start(env, upgrade_type=upgrade_type)
      File "/var/lib/ambari-agent/cache/common-services/OOZIE/4.0.0.2.0/package/scripts/oozie_server.py",
line 95, in start
        oozie_service(action='start', upgrade_type=upgrade_type)
      File "/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line 89, in
thunk
        return fn(*args, **kwargs)
      File "/var/lib/ambari-agent/cache/common-services/OOZIE/4.0.0.2.0/package/scripts/oozie_service.py",
line 127, in oozie_service
        list_status = util.run_command(params.hdfs_share_dir, 'GETFILESTATUS', method='GET',
ignore_status_codes=['404'], assertable_result=False)
      File "/usr/lib/python2.6/site-packages/resource_management/libraries/providers/hdfs_resource.py",
line 179, in run_command
        _, out, err = get_user_call_output(cmd, user=self.run_user, logoutput=self.logoutput,
quiet=False)
      File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/get_user_call_output.py",
line 61, in get_user_call_output
        raise Fail(err_msg)
    resource_management.core.exceptions.Fail: Execution of 'curl -sS -L -w '%{http_code}'
-X GET 'http://nat-r7-yhzu-ambari-api-eu-1-re-1.openstacklocal:20070/webhdfs/v1/user/oozie/share?op=GETFILESTATUS&user.name=oozie'
1>/tmp/tmp03eMiV 2>/tmp/tmpb3STnH' returned 7. curl: (7) Failed connect to nat-r7-yhzu-ambari-api-eu-1-re-1.openstacklocal:20070;
Connection refused
    000
    

Looks like this issue occurred since namenode is in stopped state.

Live cluster is available here :
<http://172.22.109.35:8080/#/main/dashboard/metrics>

Artifacts are available here : <http://qelog.hortonworks.com/log/nat-r7-yhzu-
ambari-api-eu-1-re/test-logs/ambari-api-eu-integration/>




> Restarting services with stale configs failed while starting oozie server
> -------------------------------------------------------------------------
>
>                 Key: AMBARI-18228
>                 URL: https://issues.apache.org/jira/browse/AMBARI-18228
>             Project: Ambari
>          Issue Type: Bug
>            Reporter: Andrew Onischuk
>            Assignee: Andrew Onischuk
>             Fix For: 2.4.0
>
>         Attachments: AMBARI-18228.patch
>
>
> *Problem:*
> During "restart all services" NN and Oozie server were restarting at the same time, therefore
Oozie couldn't create HDFS folders.
> Fyi we already had similar issue with Yarn and Oozie (   BUG-60813 RESOLVED  ).
> *Solution:*
> Add dependency to role_command_order for Oozie to restart only after NN restart.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message