cloudstack-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Abdul Rasool <rasool...@gmail.com>
Subject RE: SSVM agent state is Disconnected
Date Tue, 13 Jan 2015 17:54:02 GMT
Thx Sadhu.  I will check the same tomorrow and update you.

Thx AR
On Jan 13, 2015 10:59 PM, "Suresh Sadhu" <Suresh.Sadhu@citrix.com> wrote:

> Based on your description its looks like your host went in to alert state
> for some time ,please share the complete log so that it's easy to debug
> and comment on it.
> The below  link which I shared will have most of  answers for your
> question(reasons for failures) please go through the link and check your
> settings.
>
> Regards
> Sadhu
>
>
>
>
>
>
>
> -----Original Message-----
> From: Abdul Rasool [mailto:rasoolsam@gmail.com]
> Sent: 13 January 2015 22:10
> To: users@cloudstack.apache.org
> Subject: Re: SSVM agent state is Disconnected
>
> Thanks for your reply
>
> I have executed these blow commands for your reference.
> root@cloud-node001:~# telnet 172.16.206.50 8250 Trying 172.16.206.50...
> Connected to 172.16.206.50.
> Escape character is '^]'.
> ^]
> telnet> q
> Connection closed.
> root@cloud-node001:~# showmount -e 172.16.206.50 Export list for
> 172.16.206.50:
> /export *
>
>
> right now my SSVM & CP is working fine after deleting both of them. but I
> wanted to understand in what condition this 2 VM are not working ? I also
> found that I am not able to mount the secondary volume manually it is
> giving me connection timeout.
>
> I am not able to understand why this SSVM suddenly gone off and come back
> again ? due to this one of my VM status is showing in migration state.
> could you please help me to fix this issue.
>
> Thanks
> AR
>
>
> On 13 January 2015 at 21:28, Suresh Sadhu <Suresh.Sadhu@citrix.com> wrote:
>
> > Check the below command from your ssvm :
> >
> > #telnet  managementIP 8250 and  try to restart the agent on ssvm .
> >
> > Also check below link :
> >
> > https://cwiki.apache.org/confluence/display/CLOUDSTACK/SSVM,+templates
> > ,+Secondary+storage+troubleshooting
> >
> > regards
> > sadhu
> >
> >
> >
> >
> > -----Original Message-----
> > From: Abdul Rasool [mailto:rasoolsam@gmail.com]
> > Sent: 13 January 2015 21:08
> > To: users@cloudstack.apache.org; gopal@assistanz.com
> > Subject: Re: SSVM agent state is Disconnected
> >
> > Hi,
> >
> > Can some one help me with my issue posted below.
> >
> > My SSVM agent is in disconnected state also when I try to mount it
> > manually it is giving me connection timeout. but primary store is
> > mounted in my hypervisors.
> >
> > as per the connectivity part I can able to reach out to NFS storage
> > server
> >
> > Please help me what could be the issue
> >
> > Thanks
> > AR
> >
> > On 12 January 2015 at 17:44, Abdul Rasool <rasoolsam@gmail.com> wrote:
> >
> > > Hi Gopal,
> > >
> > > Thanks for your reply.
> > >
> > > When I restarted management process I get this below logs. also I
> > > have checked my hypersivor cloudstack-agent process is running.
> > >
> > > This same setup was working till the maintenance after all the
> > > management & hypervisor are rebooted I am facing this issue.
> > >
> > >
> > >
> > > *Management Log :-*
> > >
> > > root@cloud-mgmt001:~# /etc/init.d/cloudstack-management restart
> > >  * Stopping CloudStack-specific Tomcat servlet engine
> > > cloudstack-management
> > >               [ OK ]
> > >  * Starting CloudStack-specific Tomcat servlet engine
> > > cloudstack-management
> > >               [ OK ]
> > > root@cloud-mgmt001:~# tail -f
> > > /var/log/cloudstack/management/management-server.log | grep -i -E
> > > 'exception|unable|fail|invalid|leak|warn|error'
> > >
> > > 2015-01-12 16:56:56,670 DEBUG [c.c.s.ConfigurationServerImpl]
> > > (main:null)
> > > mount: warning: /var/lib/cloud/management/systemvm_mnt seems to be
> > > mounted read-only.
> > > 2015-01-12 16:56:56,843 ERROR [c.c.c.ClusterManagerImpl] (main:null)
> > > Unable to ping management server at 172.16.206.50:9090 due to
> > > ConnectException
> > > java.net.ConnectException: Connection refused
> > > 2015-01-12 16:56:57,551 WARN  [c.c.s.d.DownloadMonitorImpl]
> > > (main:null) Only realhostip.com ssl cert is supported, ignoring
> > > self-signed and other certs
> > > 2015-01-12 16:56:57,730 WARN  [c.c.c.ConsoleProxyManagerImpl]
> > > (main:null) Empty console proxy domain, explicitly disabling SSL
> > > 2015-01-12 16:57:10,073 INFO  [c.c.h.x.r.XenServerConnectionPool]
> > > (main:null) XenServer Connection Pool Configs:
> > > sleep.interval.on.error=10000
> > > 2015-01-12 16:57:13,570 WARN  [o.a.c.alerts]
> > > (Cluster-Notification-1:ctx-6e6c9629)  alertType:: 14 //
> > > dataCenterId:: 0 // podId:: 0 // clusterId:: null // message::
> > > Management server node
> > > 172.16.206.50 is up
> > > 2015-01-12 16:57:13,805 WARN  [c.c.c.ClusterManagerImpl]
> > > (Cluster-Notification-1:ctx-6e6c9629) Notifying management server
> > > join event took 379 ms
> > > 2015-01-12 16:57:15,096 WARN  [c.c.a.m.AgentManagerImpl]
> > > (AgentManager-Handler-5:null) Throwing away a request because it
> > > came through as the first command on a connect: Seq 0-41800:  { Cmd ,
> MgmtId:
> > > -1, via: 0, Ver: v1, Flags: 11,
> > > [{"com.cloud.agent.api.PingCommand":{"hostType":"ConsoleProxy","host
> > > Id
> > > ":0,"wait":0}}]
> > > }
> > > 2015-01-12 16:57:15,725 WARN  [c.c.a.m.AgentManagerImpl]
> > > (AgentManager-Handler-10:null) Throwing away a request because it
> > > came through as the first command on a connect: Seq 0-5358:  { Cmd ,
> > > MgmtId: -1,
> > > via: 0, Ver: v1, Flags: 11,
> > >
> >
> [{"com.cloud.agent.api.PingRoutingWithNwGroupsCommand":{"newGroupStates":{},"newStates":{},"_hostVmStateReport":{"i-2-53-VM":{"state":"PowerOn","host":"
> > > cloud-node002.omnesysindia.com"}},"_gatewayAccessible":true,"_vnetAc
> > > ce ssible":true,"hostType":"Routing","hostId":0,"wait":0}}]
> > > }
> > > 2015-01-12 16:57:22,770 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-5:ctx-1ac3e02a ctx-a266ead6) Received unknown
> > > parameters for command listRegions. Unknown parameters : listall
> > > 2015-01-12 16:57:25,543 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-8:ctx-5f1a2859 ctx-4c8c7bcd) Received unknown
> > > parameters for command listZones. Unknown parameters : listall
> > > 2015-01-12 16:57:25,634 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-12:ctx-c25a0f76 ctx-ca7dd015) Received unknown
> > > parameters for command listPods. Unknown parameters : listall
> > > 2015-01-12 16:57:25,725 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-1:ctx-3c89ae04 ctx-4976b536) Received unknown
> > > parameters for command listClusters. Unknown parameters : listall
> > > 2015-01-12 16:57:25,804 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-13:ctx-bf36c88a ctx-44742f02) Received unknown
> > > parameters for command listHosts. Unknown parameters : listall
> > > 2015-01-12 16:57:25,916 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-16:ctx-86e57b7a ctx-5f80737e) Received unknown
> > > parameters for command listStoragePools. Unknown parameters :
> > > listall
> > > 2015-01-12 16:57:26,028 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-15:ctx-eec3ccc8 ctx-7c2755dd) Received unknown
> > > parameters for command listImageStores. Unknown parameters : listall
> > > type
> > > 2015-01-12 16:57:26,124 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-14:ctx-0ba82b9f ctx-e9be8cb7) Received unknown
> > > parameters for command listSystemVms. Unknown parameters : listall
> > > 2015-01-12 16:57:26,787 WARN  [c.c.a.m.AgentManagerImpl]
> > > (StatsCollector-1:ctx-3872619a) Unsupported Command: Unsupported
> > > command issued:com.cloud.agent.api.GetGPUStatsCommand.  Are you sure
> > > you got the right type of server?
> > > 2015-01-12 16:57:26,801 WARN  [c.c.a.m.AgentManagerImpl]
> > > (StatsCollector-1:ctx-3872619a) Unsupported Command: Unsupported
> > > command issued:com.cloud.agent.api.GetGPUStatsCommand.  Are you sure
> > > you got the right type of server?
> > > 2015-01-12 16:57:28,292 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-2:ctx-77d34862 ctx-7aa6f791) Received unknown
> > > parameters for command listSystemVms. Unknown parameters : listall
> > > 2015-01-12 16:57:28,557 WARN  [c.c.a.d.ParamGenericValidationWorker]
> > > (catalina-exec-5:ctx-41c86850 ctx-38e47f1f) Received unknown
> > > parameters for command listHosts. Unknown parameters : listall
> > > 2015-01-12 16:58:11,022 WARN  [o.a.c.f.j.AsyncJobExecutionContext]
> > > (UserVm-Scavenger-1:ctx-fc82b0e0) Job is executed without a context,
> > > setup psudo job for the executing thread
> > > 2015-01-12 16:58:11,162 WARN  [c.c.u.d.Merovingian2]
> > > (UserVm-Scavenger-1:ctx-fc82b0e0) Was unable to find lock for the
> > > key
> > > vm_instance45 and thread id 767297483
> > > 2015-01-12 16:58:13,341 WARN  [c.c.v.UserVmManagerImpl]
> > > (UserVm-Scavenger-1:ctx-fc82b0e0) Unable to expunge
> > > VM[User|i-2-45-VM]
> > > com.cloud.utils.exception.CloudRuntimeException: Unable to locate
> > > datastore with id 3
> > > 2015-01-12 17:00:13,366 WARN  [o.a.c.f.j.AsyncJobExecutionContext]
> > > (UserVm-Scavenger-1:ctx-dcb1cf38) Job is executed without a context,
> > > setup psudo job for the executing thread
> > > 2015-01-12 17:00:13,385 WARN  [c.c.u.d.Merovingian2]
> > > (UserVm-Scavenger-1:ctx-dcb1cf38) Was unable to find lock for the
> > > key
> > > vm_instance45 and thread id 767297483
> > > 2015-01-12 17:00:15,489 WARN  [c.c.v.UserVmManagerImpl]
> > > (UserVm-Scavenger-1:ctx-dcb1c
> > >
> > >
> > > *Hypervisor Log :- *
> > >
> > > When I reload the cloudstack-agent on hypervisor I got this below
> error.
> > > but all my VMs are working.
> > >
> > > 2015-01-12 17:40:51,742 DEBUG
> > > [kvm.resource.LibvirtComputingResource]
> > > (main:null) failing to get physical interface from bridge cloud0,
> > > did not find an eth*, bond*, vlan*, em*, or p*p* in
> > > /sys/devices/virtual/net/cloud0/brif
> > > 2015-01-12 17:40:51,744 DEBUG
> > > [kvm.resource.LibvirtComputingResource]
> > > (main:null) failing to get physical interface from bridge virbr0,
> > > did not find an eth*, bond*, vlan*, em*, or p*p* in
> > > /sys/devices/virtual/net/virbr0/brif
> > > 2015-01-12 17:40:52,949 WARN
> > > [kvm.resource.LibvirtComputingResource]
> > > (Agent-Handler-1:null) Could not read cpuinfo_max_freq
> > >
> > >
> > > Please help me on this.
> > >
> > > Thanks
> > > AR
> > >
> > > On 12 January 2015 at 17:29, Gopalakrishnan S <gopal@assistanz.com>
> > wrote:
> > >
> > >> Hi Abdul,
> > >>
> > >> Did you check the management server logs when you try to reconnect
> > >> your agent?
> > >> Check the agant status in your hypervisor machine.
> > >>
> > >> Please make sure you haven't any issue with the firewall.
> > >>
> > >> Thank You.
> > >> Gopalakrishnan.S
> > >>
> > >>   ----- Original Message -----
> > >>   From: Abdul Rasool
> > >>   To: users@cloudstack.apache.org
> > >>   Sent: Monday, January 12, 2015 5:03 PM
> > >>   Subject: SSVM agent state is Disconnected
> > >>
> > >>
> > >>   Hi,
> > >>
> > >>
> > >>   After rebooting my management & hypervisor server my SSVM agent
> > >> state is showing disconnected. could you please help me to fix this
> > issue.
> > >>
> > >>
> > >>
> > >>
> > >>
> > >>
> > >>    ACS .4.4.1
> > >>   Management server with NFS (Ubuntu 12.04 64bit)
> > >>   Hypervisor = KVM(Ubuntu 12.04 64bit)
> > >>
> > >>
> > >>
> > >>
> > >>   OS = Ubuntu
> > >>
> > >>
> > >>   Thanks
> > >>   AR
> > >
> > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message