ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitry Lysnichenko (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (AMBARI-5036) Secured: Start All Services task got stuck forever
Date Tue, 11 Mar 2014 20:34:48 GMT

     [ https://issues.apache.org/jira/browse/AMBARI-5036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Dmitry Lysnichenko updated AMBARI-5036:
---------------------------------------

    Description: 
# Deployed 2-node cluster.
# Added 3rd node.
# Enabled security.

After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a very
long time.
VMs are alive, ambari-server and all ambari-agents are running.

Finally got a reproduce using 2 commands
{code}
curl 'http://vm-0.vm:8080/api/v1/clusters/cc/services?params/run_smoke_test=false' -X PUT
-H 'X-Requested-By: X-Requested-By' -u admin:admin --data '{"RequestInfo": {"context": "Start
All Services"}, "Body": {"ServiceInfo": {"state": "STARTED"}}}' ; sleep 3;  curl 'http://vm-0.vm:8080/api/v1/clusters/cc/hosts/vm-0.vm/host_components/APP_TIMELINE_SERVER'
-X DELETE -H 'X-Requested-By: X-Requested-By' -u admin:admin
{code}
The way to reproduce is a bit different compared to an original description (I issue a DELETE
request in 3 seconds after START_ALL_SERVICES request has been issued), but the symptoms are
the same: ServiceComponentHostNotFoundException exception is posted to log and operation is
stuck on stage that contains "App Timeline Server Start" command. 


  was:
# Deployed 2-node cluster.
# Added 3rd node.
# Enabled security.

After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a very
long time.
VMs are alive, ambari-server and all ambari-agents are running.



> Secured: Start All Services task got stuck forever
> --------------------------------------------------
>
>                 Key: AMBARI-5036
>                 URL: https://issues.apache.org/jira/browse/AMBARI-5036
>             Project: Ambari
>          Issue Type: Bug
>          Components: controller, test
>    Affects Versions: 1.5.0
>            Reporter: Dmitry Lysnichenko
>            Assignee: Dmitry Lysnichenko
>             Fix For: 1.5.1
>
>
> # Deployed 2-node cluster.
> # Added 3rd node.
> # Enabled security.
> After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a
very long time.
> VMs are alive, ambari-server and all ambari-agents are running.
> Finally got a reproduce using 2 commands
> {code}
> curl 'http://vm-0.vm:8080/api/v1/clusters/cc/services?params/run_smoke_test=false' -X
PUT -H 'X-Requested-By: X-Requested-By' -u admin:admin --data '{"RequestInfo": {"context":
"Start All Services"}, "Body": {"ServiceInfo": {"state": "STARTED"}}}' ; sleep 3;  curl 'http://vm-0.vm:8080/api/v1/clusters/cc/hosts/vm-0.vm/host_components/APP_TIMELINE_SERVER'
-X DELETE -H 'X-Requested-By: X-Requested-By' -u admin:admin
> {code}
> The way to reproduce is a bit different compared to an original description (I issue
a DELETE request in 3 seconds after START_ALL_SERVICES request has been issued), but the symptoms
are the same: ServiceComponentHostNotFoundException exception is posted to log and operation
is stuck on stage that contains "App Timeline Server Start" command. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message