Return-Path: X-Original-To: apmail-cloudstack-users-archive@www.apache.org Delivered-To: apmail-cloudstack-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE1E417371 for ; Wed, 22 Oct 2014 16:41:11 +0000 (UTC) Received: (qmail 86859 invoked by uid 500); 22 Oct 2014 16:41:10 -0000 Delivered-To: apmail-cloudstack-users-archive@cloudstack.apache.org Received: (qmail 86807 invoked by uid 500); 22 Oct 2014 16:41:10 -0000 Mailing-List: contact users-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@cloudstack.apache.org Delivered-To: mailing list users@cloudstack.apache.org Received: (qmail 86739 invoked by uid 99); 22 Oct 2014 16:41:09 -0000 Received: from mx1-us-east.apache.org (HELO mx1-us-east.apache.org) (54.164.171.186) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Oct 2014 16:41:09 +0000 Received: from mx1-us-east.apache.org (localhost [127.0.0.1]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTP id 93DF5435AE for ; Wed, 22 Oct 2014 16:41:31 +0000 (UTC) Received: by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org, from userid 111) id 8951B43AC9; Wed, 22 Oct 2014 16:41:31 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on mx1-us-east.apache.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=10.0 tests=HTML_MESSAGE,RP_MATCHES_RCVD, SPF_HELO_PASS,SPF_PASS,URIBL_BLOCKED autolearn=disabled version=3.4.0 Received: from mail.arhont.com (mail1.arhont.com [178.248.108.132]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTP id 787AA435AE for ; Wed, 22 Oct 2014 16:41:30 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail1.arhont.com (Postfix) with ESMTP id 6B3F5980116 for ; Wed, 22 Oct 2014 17:41:00 +0100 (BST) Received: from mail.arhont.com ([127.0.0.1]) by localhost (mail1.arhont.com [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id c7zjnfFNV_4E for ; Wed, 22 Oct 2014 17:40:56 +0100 (BST) Received: from localhost (localhost [127.0.0.1]) by mail1.arhont.com (Postfix) with ESMTP id 16706980126 for ; Wed, 22 Oct 2014 17:40:56 +0100 (BST) X-Virus-Scanned: amavisd-new at arhont.com Received: from mail.arhont.com ([127.0.0.1]) by localhost (mail1.arhont.com [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id LrrtFT6cQUGO for ; Wed, 22 Oct 2014 17:40:55 +0100 (BST) Received: from mail1.arhont.com (mail1.arhont.com [178.248.108.132]) by mail1.arhont.com (Postfix) with ESMTP id E047A980116 for ; Wed, 22 Oct 2014 17:40:55 +0100 (BST) Date: Wed, 22 Oct 2014 17:40:55 +0100 (BST) From: Andrei Mikhailovsky To: users@cloudstack.apache.org Message-ID: <25652070.9532.1413996052278.JavaMail.andrei@tuchka> In-Reply-To: <1025288215.1383666.1413989630474.JavaMail.zimbra@saao.ac.za> References: <1025288215.1383666.1413989630474.JavaMail.zimbra@saao.ac.za> Subject: Re: HA issue and Xen resets MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_9531_670338.1413996052278" X-Mailer: Zimbra 8.0.7_GA_6021 (Zimbra Desktop/7.2.5_12038_Linux) Thread-Topic: HA issue and Xen resets Thread-Index: bzoyzWxMstpcTqUutH4S5OogK/NWsp7x3vqJ X-Virus-Scanned: ClamAV using ClamSMTP ------=_Part_9531_670338.1413996052278 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Garith, Have you used HA system offering for the vms that were running on the disconnected server? you need to make sure they are HA enabled. I've had a play a few months ago with XenServer 6.2 and ACS 4.2.1 and HA worked perfectly well. I've disconnected the network (including storage network) from the host and after about 30 seconds the host was shown in Alert state. Shortly after i've seen the management server complaining that the guest vm has stopped unexpectidly and it was automatically restarted on another xenserver. What I haven't checked is what happened to the disconnected server. I have been side tracked and never checked. I think the default behaviour of the XenServer should be the host server restart. This makes sure that all vms are stopped. Can someone confirm this please? thanks Andrei ----- Original Message ----- > From: "Garith Dugmore" > To: users@cloudstack.apache.org > Sent: Wednesday, 22 October, 2014 3:53:50 PM > Subject: HA issue and Xen resets > Hi All, > I'm new to cloudstack and busy testing out ACS 4.3.1 on Centos 6.4 > using Xenserver 6.2. I have the management server setup and have 2 > xen servers that I'm testing out at the moment; specifically the HA > functionality. > After getting an instance up and running I yanked the network cable > out of the one xen server and awaited the HA awesomeness to kick in. > Both hosts still remained in "Up" state even though the one was no > longer pingable and the instance that was hosted on that xen host > still showed "Running" even though I also couldn't ping it. After > some reading one suggestion was setting 'alert.wait' to 30 and > restarting cloudstack-management. After that didn't seem to do > anything after waiting a while I rebooted the management server all > together and found that both hosts were marked as disconnected. > I've tried going in and out of maintenance mode and ended up deleting > the one xen host that was still reachable thinking I could just > re-add it but I received an error in doing so. I have read somewhere > that once you've either reinstalled the management server or removed > a xen host you need to re-install the xen host. Is this true? I was > hoping for a factory reset command of some sort. > Besides my obvious HA problems and host disconnect issues which I'd > love some pointers on are there any pointers on Xen server resets? > Note I have attempted a 4.4.0 install on centos and after a couple > issues that I can no longer recall I ended up with a way easier > install on 4.3.1 which is why I've stuck with it for now. > Any pointers will be greatly appreciated. Willing to try anything! > -- > Garith Dugmore > South African Astronomical Observatory > and Southern African Large Telescope ------=_Part_9531_670338.1413996052278--