cloudstack-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [cloudstack] div8cn commented on issue #3505: Agent LB for CloudStack failed
Date Fri, 19 Jul 2019 15:59:25 GMT
div8cn commented on issue #3505: Agent LB for CloudStack failed
URL: https://github.com/apache/cloudstack/issues/3505#issuecomment-513283089
 
 
   > I'm not able to replicate the issue:
   > 
   > * 4x KVM hosts
   > * 2x Mgmt servers: 10.2.2.71 and 10.2.2.72
   > * indirect.agent.lb.algorithm=roundrobin
   > * indirect.agent.lb.check.interval=60
   > 
   > Host connected to mgmt server 10.2.2.72. When rebooting mgmt server 10.2.2.72 I observe
it connects to the next management server as expected:
   > 
   > ```
   > 2019-07-19 15:37:47,349 INFO  [cloud.agent.Agent] (Host LB Timer:null) (logid:3f187a6a)
Connected to the host: 10.2.2.72
   > 2019-07-19 15:37:50,604 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Proccess agent startup answer, agent id = 0
   > 2019-07-19 15:37:50,605 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Set agent id 0
   > 2019-07-19 15:37:50,606 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Startup Response Received: agent id = 0
   > 2019-07-19 15:37:50,763 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-1:null)
(logid:67bd0e1d) Attempting to create storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109 (NetworkFilesystem)
in libvirt
   > 2019-07-19 15:37:50,763 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-1:null)
(logid:67bd0e1d) Found existing defined storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109,
using it.
   > 2019-07-19 15:37:50,764 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-1:null)
(logid:67bd0e1d) Trying to fetch storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109 from libvirt
   > 2019-07-19 15:37:50,952 INFO  [cloud.agent.Agent] (agentRequest-Handler-1:null) (logid:67bd0e1d)
Processing agent ready command, agent id = 4
   > 2019-07-19 15:37:50,953 INFO  [cloud.agent.Agent] (agentRequest-Handler-1:null) (logid:67bd0e1d)
Set agent id 4
   > 2019-07-19 15:37:50,953 INFO  [cloud.agent.Agent] (agentRequest-Handler-1:null) (logid:67bd0e1d)
Ready command is processed for agent id = 4
   > 2019-07-19 15:37:51,045 INFO  [cloud.agent.Agent] (agentRequest-Handler-2:null) (logid:67bd0e1d)
Processing agent ready command, agent id = 4
   > 2019-07-19 15:37:51,045 INFO  [cloud.agent.Agent] (agentRequest-Handler-2:null) (logid:67bd0e1d)
Set agent id 4
   > 2019-07-19 15:37:51,047 INFO  [cloud.agent.Agent] (agentRequest-Handler-2:null) (logid:67bd0e1d)
Processed new management server list: 10.2.2.72,10.2.2.71@roundrobin
   > 2019-07-19 15:37:51,047 INFO  [cloud.agent.Agent] (agentRequest-Handler-2:null) (logid:67bd0e1d)
Scheduling preferred host timer task with host.lb.interval=60000ms
   > 2019-07-19 15:37:51,047 INFO  [cloud.agent.Agent] (agentRequest-Handler-2:null) (logid:67bd0e1d)
Ready command is processed for agent id = 4
   > 2019-07-19 15:38:01,541 INFO  [cloud.agent.Agent] (Agent-Handler-3:null) (logid:3f187a6a)
Lost connection to host: 10.2.2.72. Attempting reconnection while we still have 0 commands
in progress.
   > 2019-07-19 15:38:01,543 INFO  [utils.nio.NioClient] (Agent-Handler-3:null) (logid:3f187a6a)
NioClient connection closed
   > 2019-07-19 15:38:01,543 INFO  [cloud.agent.Agent] (Agent-Handler-3:null) (logid:3f187a6a)
Reconnecting to host:10.2.2.72
   > 2019-07-19 15:38:01,543 INFO  [utils.nio.NioClient] (Agent-Handler-3:null) (logid:3f187a6a)
Connecting to 10.2.2.72:8250
   > 2019-07-19 15:38:01,543 WARN  [utils.nio.NioConnection] (Agent-Handler-3:null) (logid:3f187a6a)
Unable to connect to remote: is there a server running on port 8250
   > 2019-07-19 15:38:06,544 INFO  [cloud.agent.Agent] (Agent-Handler-3:null) (logid:3f187a6a)
Reconnecting to host:10.2.2.71
   > 2019-07-19 15:38:06,544 INFO  [utils.nio.NioClient] (Agent-Handler-3:null) (logid:3f187a6a)
Connecting to 10.2.2.71:8250
   > 2019-07-19 15:38:06,545 INFO  [utils.nio.Link] (Agent-Handler-3:null) (logid:3f187a6a)
Conf file found: /etc/cloudstack/agent/agent.properties
   > 2019-07-19 15:38:06,666 INFO  [utils.nio.NioClient] (Agent-Handler-3:null) (logid:3f187a6a)
SSL: Handshake done
   > 2019-07-19 15:38:06,667 INFO  [utils.nio.NioClient] (Agent-Handler-3:null) (logid:3f187a6a)
Connected to 10.2.2.71:8250
   > 2019-07-19 15:38:06,670 WARN  [kvm.resource.LibvirtComputingResource] (Agent-Handler-1:null)
(logid:3f187a6a) Could not read cpuinfo_max_freq
   > 2019-07-19 15:38:06,737 INFO  [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:null)
(logid:3f187a6a) Attempting to create storage pool 597ef70d-f468-44d1-bccd-b307ded05802 (Filesystem)
in libvirt
   > 2019-07-19 15:38:06,738 INFO  [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:null)
(logid:3f187a6a) Found existing defined storage pool 597ef70d-f468-44d1-bccd-b307ded05802,
using it.
   > 2019-07-19 15:38:06,738 INFO  [kvm.storage.LibvirtStorageAdaptor] (Agent-Handler-1:null)
(logid:3f187a6a) Trying to fetch storage pool 597ef70d-f468-44d1-bccd-b307ded05802 from libvirt
   > 2019-07-19 15:38:07,622 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Proccess agent startup answer, agent id = 0
   > 2019-07-19 15:38:07,622 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Set agent id 0
   > 2019-07-19 15:38:07,623 INFO  [cloud.agent.Agent] (Agent-Handler-2:null) (logid:3f187a6a)
Startup Response Received: agent id = 0
   > 2019-07-19 15:38:07,668 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-5:null)
(logid:2079e6bf) Attempting to create storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109 (NetworkFilesystem)
in libvirt
   > 2019-07-19 15:38:07,669 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-5:null)
(logid:2079e6bf) Found existing defined storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109,
using it.
   > 2019-07-19 15:38:07,669 INFO  [kvm.storage.LibvirtStorageAdaptor] (agentRequest-Handler-5:null)
(logid:2079e6bf) Trying to fetch storage pool e03f4cba-17cd-3b04-b6ad-c3e916237109 from libvirt
   > 2019-07-19 15:38:07,756 INFO  [cloud.agent.Agent] (agentRequest-Handler-3:null) (logid:2079e6bf)
Processing agent ready command, agent id = 4
   > 2019-07-19 15:38:07,756 INFO  [cloud.agent.Agent] (agentRequest-Handler-3:null) (logid:2079e6bf)
Set agent id 4
   > 2019-07-19 15:38:07,756 INFO  [cloud.agent.Agent] (agentRequest-Handler-3:null) (logid:2079e6bf)
Ready command is processed for agent id = 4
   > 2019-07-19 15:38:07,845 INFO  [cloud.agent.Agent] (agentRequest-Handler-4:null) (logid:2079e6bf)
Processing agent ready command, agent id = 4
   > 2019-07-19 15:38:07,846 INFO  [cloud.agent.Agent] (agentRequest-Handler-4:null) (logid:2079e6bf)
Set agent id 4
   > 2019-07-19 15:38:07,849 INFO  [cloud.agent.Agent] (agentRequest-Handler-4:null) (logid:2079e6bf)
Processed new management server list: 10.2.2.72,10.2.2.71@roundrobin
   > 2019-07-19 15:38:07,849 INFO  [cloud.agent.Agent] (agentRequest-Handler-4:null) (logid:2079e6bf)
Scheduling preferred host timer task with host.lb.interval=60000ms
   > 2019-07-19 15:38:07,850 INFO  [cloud.agent.Agent] (agentRequest-Handler-4:null) (logid:2079e6bf)
Ready command is processed for agent id = 4
   > 2019-07-19 15:38:11,668 INFO  [cloud.agent.Agent] (Agent-Handler-3:null) (logid:3f187a6a)
Connected to the host: 10.2.2.71
   > ```
   
   When reboot or systemctl stop cloudstack-management is executed in os, the agent responds
immediately and completes the switch.
   
   When the simulated management node hardware fails and suddenly loses power, the agent does
not immediately discover and switch, it needs to wait about 15 minutes.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

Mime
View raw message