ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AMBARI-7882) Decommission of JobTracker fails on secure cluster
Date Tue, 21 Oct 2014 16:03:33 GMT

    [ https://issues.apache.org/jira/browse/AMBARI-7882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178562#comment-14178562
] 

Hudson commented on AMBARI-7882:
--------------------------------

FAILURE: Integrated in Ambari-trunk-Commit #651 (See [https://builds.apache.org/job/Ambari-trunk-Commit/651/])
AMBARI-7882. Decommission of JobTracker fails on secure cluster (aonishuk) (aonishuk: http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=99de55ab85662d0050b7030a8f17175723e0cf97)
* ambari-server/src/test/python/stacks/1.3.2/HDFS/test_hdfs_client.py
* ambari-server/src/main/resources/stacks/HDP/1.3.2/services/MAPREDUCE/package/scripts/jobtracker.py
* ambari-server/src/test/python/stacks/1.3.2/MAPREDUCE/test_mapreduce_jobtracker.py
* ambari-server/src/main/resources/stacks/HDP/1.3.2/services/MAPREDUCE/package/scripts/params.py


> Decommission of JobTracker fails on secure cluster
> --------------------------------------------------
>
>                 Key: AMBARI-7882
>                 URL: https://issues.apache.org/jira/browse/AMBARI-7882
>             Project: Ambari
>          Issue Type: Bug
>            Reporter: Andrew Onischuk
>            Assignee: Andrew Onischuk
>             Fix For: 1.7.0
>
>
> Exception text:
>     
>     
>     
>     {
>       "href" : "http://ec2-54-165-160-62.compute-1.amazonaws.com:8080/api/v1/clusters/cl1/requests/21/tasks/235",
>       "Tasks" : {
>         "attempt_cnt" : 1,
>         "cluster_name" : "cl1",
>         "command" : "CUSTOM_COMMAND",
>         "command_detail" : "DECOMMISSION, Excluded: ip-172-31-37-151.ec2.internal",
>         "custom_command_name" : "DECOMMISSION",
>         "end_time" : 1413796875994,
>         "error_log" : "/var/lib/ambari-agent/data/errors-235.txt",
>         "exit_code" : 1,
>         "host_name" : "ip-172-31-37-148.ec2.internal",
>         "id" : 235,
>         "output_log" : "/var/lib/ambari-agent/data/output-235.txt",
>         "request_id" : 21,
>         "role" : "JOBTRACKER",
>         "stage_id" : 1,
>         "start_time" : 1413796870551,
>         "status" : "FAILED",
>         "stderr" : "2014-10-20 09:21:15,291 - Error while executing command 'decommission':\nTraceback
(most recent call last):\n  File \"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py\",
line 122, in execute\n    method(env)\n  File \"/var/lib/ambari-agent/cache/stacks/HDP/1.3.2/services/MAPREDUCE/package/scripts/jobtracker.py\",
line 78, in decommission\n    kinit_override=True)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/base.py\",
line 148, in __init__\n    self.env.run()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 149, in run\n    self.run_action(resource, action)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 115, in run_action\n    provider_action()\n  File \"/usr/lib/python2.6/site-packages/resource_management/libraries/providers/execute_hadoop.py\",
line 50, in action_run\n    path        = self.resource.bin_dir\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/base.py\",
line 148, in __init__\n    self.env.run()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 149, in run\n    self.run_action(resource, action)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 115, in run_action\n    provider_action()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py\",
line 237, in action_run\n    raise ex\nFail: Execution of 'hadoop --config /etc/hadoop/conf
mradmin -refreshNodes' returned 255. 14/10/20 09:21:15 ERROR security.UserGroupInformation:
PriviledgedActionException as:mapred cause:javax.security.sasl.SaslException: GSS initiate
failed [Caused by GSSException: No valid credentials provided (Mechanism level: Failed to
find any Kerberos tgt)]\n14/10/20 09:21:15 WARN ipc.Client: Exception encountered while connecting
to the server : javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException:
No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt)]\n14/10/20
09:21:15 ERROR security.UserGroupInformation: PriviledgedActionException as:mapred cause:java.io.IOException:
javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException: No valid credentials
provided (Mechanism level: Failed to find any Kerberos tgt)]\nrefreshNodes: Call to ip-172-31-37-148.ec2.internal/172.31.37.148:50300
failed on local exception: java.io.IOException: javax.security.sasl.SaslException: GSS initiate
failed [Caused by GSSException: No valid credentials provided (Mechanism level: Failed to
find any Kerberos tgt)]",
>         "stdout" : "2014-10-20 09:21:11,334 - File['/etc/hadoop/conf/mapred.exclude']
{'owner': 'mapred', 'content': Template('exclude_hosts_list.j2'), 'group': 'hadoop'}\n2014-10-20
09:21:11,338 - Writing File['/etc/hadoop/conf/mapred.exclude'] because contents don't match\n2014-10-20
09:21:11,339 - ExecuteHadoop['mradmin -refreshNodes'] {'conf_dir': '/etc/hadoop/conf', 'kinit_override':
True, 'user': 'mapred'}\n2014-10-20 09:21:11,341 - Execute['hadoop --config /etc/hadoop/conf
mradmin -refreshNodes'] {'logoutput': False, 'path': [], 'tries': 1, 'user': 'mapred', 'try_sleep':
0}\n2014-10-20 09:21:15,291 - Error while executing command 'decommission':\nTraceback (most
recent call last):\n  File \"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py\",
line 122, in execute\n    method(env)\n  File \"/var/lib/ambari-agent/cache/stacks/HDP/1.3.2/services/MAPREDUCE/package/scripts/jobtracker.py\",
line 78, in decommission\n    kinit_override=True)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/base.py\",
line 148, in __init__\n    self.env.run()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 149, in run\n    self.run_action(resource, action)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 115, in run_action\n    provider_action()\n  File \"/usr/lib/python2.6/site-packages/resource_management/libraries/providers/execute_hadoop.py\",
line 50, in action_run\n    path        = self.resource.bin_dir\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/base.py\",
line 148, in __init__\n    self.env.run()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 149, in run\n    self.run_action(resource, action)\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/environment.py\",
line 115, in run_action\n    provider_action()\n  File \"/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py\",
line 237, in action_run\n    raise ex\nFail: Execution of 'hadoop --config /etc/hadoop/conf
mradmin -refreshNodes' returned 255. 14/10/20 09:21:15 ERROR security.UserGroupInformation:
PriviledgedActionException as:mapred cause:javax.security.sasl.SaslException: GSS initiate
failed [Caused by GSSException: No valid credentials provided (Mechanism level: Failed to
find any Kerberos tgt)]\n14/10/20 09:21:15 WARN ipc.Client: Exception encountered while connecting
to the server : javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException:
No valid credentials provided (Mechanism level: Failed to find any Kerberos tgt)]\n14/10/20
09:21:15 ERROR security.UserGroupInformation: PriviledgedActionException as:mapred cause:java.io.IOException:
javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSException: No valid credentials
provided (Mechanism level: Failed to find any Kerberos tgt)]\nrefreshNodes: Call to ip-172-31-37-148.ec2.internal/172.31.37.148:50300
failed on local exception: java.io.IOException: javax.security.sasl.SaslException: GSS initiate
failed [Caused by GSSException: No valid credentials provided (Mechanism level: Failed to
find any Kerberos tgt)]",
>         "structured_out" : { }
>       }
>     }
>     



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message