Return-Path: X-Original-To: apmail-cloudstack-issues-archive@www.apache.org Delivered-To: apmail-cloudstack-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 50C68108A8 for ; Wed, 31 Jul 2013 11:55:50 +0000 (UTC) Received: (qmail 60077 invoked by uid 500); 31 Jul 2013 11:55:50 -0000 Delivered-To: apmail-cloudstack-issues-archive@cloudstack.apache.org Received: (qmail 60061 invoked by uid 500); 31 Jul 2013 11:55:49 -0000 Mailing-List: contact issues-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cloudstack.apache.org Delivered-To: mailing list issues@cloudstack.apache.org Received: (qmail 60048 invoked by uid 500); 31 Jul 2013 11:55:49 -0000 Delivered-To: apmail-incubator-cloudstack-issues@incubator.apache.org Received: (qmail 60044 invoked by uid 99); 31 Jul 2013 11:55:49 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 31 Jul 2013 11:55:49 +0000 Date: Wed, 31 Jul 2013 11:55:49 +0000 (UTC) From: "Koushik Das (JIRA)" To: cloudstack-issues@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CLOUDSTACK-3938) Operation Timed out and Resource unreachable exceptions in clustered management server setup MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CLOUDSTACK-3938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13725158#comment-13725158 ] Koushik Das commented on CLOUDSTACK-3938: ----------------------------------------- As per the logs the MS ip is getting set to 127.0.0.1. In a clustered MS setup the correct ip should be set. Please check the setup. > Operation Timed out and Resource unreachable exceptions in clustered management server setup > -------------------------------------------------------------------------------------------- > > Key: CLOUDSTACK-3938 > URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3938 > Project: CloudStack > Issue Type: Bug > Security Level: Public(Anyone can view this level - this is the default.) > Components: Management Server > Affects Versions: 4.2.0 > Environment: 4.2, Using simulator to setup a load test env > Reporter: Sowmya Krishnan > Priority: Blocker > Fix For: 4.2.0 > > Attachments: vmops.log_MS1_3938.gz, vmops_MS2.log_3938.gz > > > Set up: > 2 Management servers, and mysql DB running in a remote server > Using simulated hosts and resources. > Deployed advanced zone with RVR, and setting up simulator hosts and storage pools. > After deploying few of them, getting the following exceptions constantly: > 2013-07-30 02:34:02,468 DEBUG [agent.transport.Request] (StatsCollector-2:null) Seq 2-197787660: Received: { Ans: , MgmtId: 206915885097283, via: 2, Ver: v1 > , Flags: 10, { GetStorageStatsAnswer } } > 2013-07-30 02:34:02,472 DEBUG [agent.manager.DirectAgentAttache] (DirectAgent-35:null) Seq 1-1644953610: Response Received: > 2013-07-30 02:34:02,472 DEBUG [agent.transport.Request] (StatsCollector-3:null) Seq 1-1644953610: Received: { Ans: , MgmtId: 206915885097283, via: 1, Ver: v1, Flags: 10, { GetHostStatsAnswer } } > 2013-07-30 02:34:02,476 DEBUG [agent.manager.ClusteredAgentAttache] (StatsCollector-2:null) Seq 3-2136014853: Forwarding null to 206915885094132 > 2013-07-30 02:34:02,478 DEBUG [agent.manager.ClusteredAgentAttache] (AgentManager-Handler-15:null) Seq 3-2136014853: Routing from 206915885097283 > 2013-07-30 02:34:02,478 DEBUG [agent.manager.ClusteredAgentAttache] (AgentManager-Handler-15:null) Seq 3-2136014853: Link is closed > 2013-07-30 02:34:02,478 DEBUG [agent.manager.ClusteredAgentManagerImpl] (AgentManager-Handler-15:null) Seq 3-2136014853: MgmtId 206915885097283: Req: Resource [Host:3] is unreachable: Host 3: Link is closed > 2013-07-30 02:34:02,479 DEBUG [agent.manager.ClusteredAgentManagerImpl] (AgentManager-Handler-15:null) Seq 3--1: MgmtId 206915885097283: Req: Routing to peer > 2013-07-30 02:34:02,481 DEBUG [agent.manager.ClusteredAgentManagerImpl] (AgentManager-Handler-16:null) Seq 3--1: MgmtId 206915885097283: Req: Cancel request received > 2013-07-30 02:34:02,481 DEBUG [agent.manager.AgentAttache] (AgentManager-Handler-16:null) Seq 3-2136014853: Cancelling. > 2013-07-30 02:34:02,481 DEBUG [agent.manager.AgentAttache] (StatsCollector-2:null) Seq 3-2136014853: Waiting some more time because this is the current command > 2013-07-30 02:34:02,481 DEBUG [agent.manager.AgentAttache] (StatsCollector-2:null) Seq 3-2136014853: Waiting some more time because this is the current command > 2013-07-30 02:34:02,481 INFO [utils.exception.CSExceptionErrorCode] (StatsCollector-2:null) Could not find exception: com.cloud.exception.OperationTimedoutException in error code list for exceptions > 2013-07-30 02:34:02,482 WARN [agent.manager.AgentAttache] (StatsCollector-2:null) Seq 3-2136014853: Timed out on null > 2013-07-30 02:34:02,482 DEBUG [agent.manager.AgentAttache] (StatsCollector-2:null) Seq 3-2136014853: Cancelling. > 2013-07-30 02:34:02,482 DEBUG [cloud.storage.StorageManagerImpl] (StatsCollector-2:null) Unable to send storage pool command to Pool[2|NetworkFilesystem] via 3 > com.cloud.exception.OperationTimedoutException: Commands 2136014853 to Host 3 timed out after 3600 > at com.cloud.agent.manager.AgentAttache.send(AgentAttache.java:430) > at com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java:486) > at com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java:439) > at com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:977) > at com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:428) > at com.cloud.storage.StorageManagerImpl.sendToPool(StorageManagerImpl.java:442) > at com.cloud.server.StatsCollector$StorageCollector.run(StatsCollector.java:562) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) > at java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351) > at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178) > at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:165) > at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:267) > at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1146) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:679) > 2013-07-30 02:34:02,483 DEBUG [agent.manager.DirectAgentAttache] (DirectAgent-34:null) Seq 2-197787661: Executing request > 2013-07-30 02:34:02,488 DEBUG [agent.manager.ClusteredAgentAttache] (StatsCollector-2:null) Seq 4-1100414979: Forwarding null to 206915885094132 > 2013-07-30 02:34:02,489 DEBUG [agent.manager.DirectAgentAttache] (DirectAgent-34:null) Seq 2-197787661: Response Received: > 2013-07-30 02:34:02,489 DEBUG [agent.transport.Request] (StatsCollector-3:null) Seq 2-197787661: Received: { Ans: , MgmtId: 206915885097283, via: 2, Ver: v1, Flags: 10, { GetHostStatsAnswer } } > 2013-07-30 02:34:02,490 DEBUG [agent.manager.ClusteredAgentAttache] (AgentManager-Handler-17:null) Seq 4-1100414979: Routing from 206915885097283 > 2013-07-30 02:34:02,490 DEBUG [agent.manager.ClusteredAgentAttache] (AgentManager-Handler-17:null) Seq 4-1100414979: Link is closed > 2013-07-30 02:34:02,490 DEBUG [agent.manager.ClusteredAgentManagerImpl] (AgentManager-Handler-17:null) Seq 4-1100414979: MgmtId 206915885097283: Req: Resource [Host:4] is unreachable: Host 4: Link is closed -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira