Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id AB95C200D2F for ; Wed, 1 Nov 2017 15:29:24 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id AA04D160BEA; Wed, 1 Nov 2017 14:29:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C79CC160BE6 for ; Wed, 1 Nov 2017 15:29:23 +0100 (CET) Received: (qmail 32654 invoked by uid 500); 1 Nov 2017 14:29:22 -0000 Mailing-List: contact users-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@cloudstack.apache.org Delivered-To: mailing list users@cloudstack.apache.org Received: (qmail 32642 invoked by uid 99); 1 Nov 2017 14:29:22 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Nov 2017 14:29:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 7CD7AC6C02 for ; Wed, 1 Nov 2017 14:29:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.301 X-Spam-Level: X-Spam-Status: No, score=-0.301 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=bw-sw-com.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id wXSHNpLpDg02 for ; Wed, 1 Nov 2017 14:29:17 +0000 (UTC) Received: from mail-io0-f175.google.com (mail-io0-f175.google.com [209.85.223.175]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 217815F6BE for ; Wed, 1 Nov 2017 14:29:16 +0000 (UTC) Received: by mail-io0-f175.google.com with SMTP id m16so6467146iod.1 for ; Wed, 01 Nov 2017 07:29:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bw-sw-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=JsKcxa9JgC8W+hEoegu1dDIa228OIcne63SeaQJSRSU=; b=v781zgPBu0PsZWyPx2iPcwQswtHnblFr2ovgZcoRdmA6NRkms+HeuiSa0KxB3YdhCs sQR1vTRdgbpeY6ydb+6y9V+0S1UyHqqElm9kDP2oI04nRGrx47RIIjB6NsBMZ8wr2Y+r Guhok0LoloOKrngsMFKoqIV9h2EXch0wA4ZVhI85AxtrvtAPvJ6w6UaBbM3PjgttWqL4 CE1jvfqgv4s5qtc1NO/znUSpsj1bIcqpfpg0gJY4YGu0TsNAzB2C+CPyI9R3JqfwOMcC qHJlpX1zMn9/ZOd4qH4sZQjdk2Wqk0Rn0JMD8EIm5Mjb7RPkzteeE0+4lRZUpuJ8Ms3Z lNjQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=JsKcxa9JgC8W+hEoegu1dDIa228OIcne63SeaQJSRSU=; b=atAVaRBTF65nUQhxqAiacqzAW5CrKgvtA8jfHwRM6/VpJk0zbb5gncC3RElJOMdFcK Y/Lic2IcMV9/UDPVrPb46+8xRJoAgef7rbxZDthFvOvhCteIift2dQorMszeT9UaNPst QAjLX8QID7GD/mO7IyMvtoiDxf0wBtapuR+Mj5GQCBY6jAa0PLN5IxEcIQfijoy5s/uK nf05npbRGFkmNoW31D6e7VUgyljD5OwUgrpu8vAxvBjWCH0zkQoghYf7muSFKdUSDcnZ 9aIB68dsHpCGTcaKCDlIK8go/hdDV5augRhEDcea8bEUVc/8fO+z2+oi206O4CPnAhpp Zrww== X-Gm-Message-State: AMCzsaWMls62SV4jbDYkoiwyhVB0LkOzyXE/z8aF1uKxoAzdrI5fEJS2 lD0nfJnStdAHafn/KYwHrYExnoLRsK1dMzO68riYXv0C X-Google-Smtp-Source: ABhQp+TZGJiYGNQxjaG/9MdzW4ANR0kJDtJ/nQHPyn7P6/SxP18m31tviC3OHw+sNefyty/o/gfW6r9YzXW3459oflY= X-Received: by 10.36.89.149 with SMTP id p143mr688463itb.17.1509546556018; Wed, 01 Nov 2017 07:29:16 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.183.23 with HTTP; Wed, 1 Nov 2017 07:29:15 -0700 (PDT) Received: by 10.107.183.23 with HTTP; Wed, 1 Nov 2017 07:29:15 -0700 (PDT) In-Reply-To: References: <00762B56-6DE6-4BBB-A18E-78E5836CEB34@shapeblue.com> From: Ivan Kudryavtsev Date: Wed, 1 Nov 2017 21:29:15 +0700 Message-ID: Subject: Re: Problems with KVM HA & STONITH To: users@cloudstack.apache.org Content-Type: multipart/alternative; boundary="001a11429b783e6643055cecb25b" archived-at: Wed, 01 Nov 2017 14:29:24 -0000 --001a11429b783e6643055cecb25b Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Also you can run ceph if you need HA. I met setup description which uses compute nodes for ceph cluster nodes simultaneously. 1 =D0=BD=D0=BE=D1=8F=D0=B1. 2017 =D0=B3. 21:11 =D0=BF=D0=BE=D0=BB=D1=8C=D0= =B7=D0=BE=D0=B2=D0=B0=D1=82=D0=B5=D0=BB=D1=8C "Simon Weller" =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=BB: > James, > > > Try just configuring a single NFS server and see if your setup works. If > you have 3 NFS shares, across all 3 hosts, i'm wondering whether ACS is > picking the one you rebooted as the storage for your VMs and when that > storage goes away (when you bounce the host), all storage for your VMs > vanishes and ACS tries to reboot your other hosts. > > > Normally in a simple ACS setup, you would have a separate storage server > that can serve up NFS to all hosts. If a host dies, then a VM would be > brought up on a spare hosts since all hosts have access to the same stora= ge. > > Your other option is to use local storage, but that won't provide HA. > > > - Si > > > ________________________________ > From: McClune, James > Sent: Monday, October 30, 2017 2:26 PM > To: users@cloudstack.apache.org > Subject: Re: Problems with KVM HA & STONITH > > Hi Dag, > > Thank you for responding back. I am currently running ACS 4.9 on an Ubunt= u > 14.04 VM. I have the three nodes, each having about 1TB of primary storag= e > (NFS) and 1TB of secondary storage (NFS). I added each NFS share into ACS= . > All nodes are in a cluster. > > Maybe I'm not understanding the setup or misconfigured something. I'm > trying to setup an HA environment where if one node goes down, running an > HA marked VM, the VM will start on another host. When I simulate a networ= k > disconnect or reboot of a host, all of the nodes go down (STONITH?). > > I am unsure on how to setup an HA environment, if all the nodes in the > cluster go down. Any help is much appreciated! > > Thanks, > James > > On Mon, Oct 30, 2017 at 3:49 AM, Dag Sonstebo > wrote: > > > Hi James, > > > > I think you possibly have over-configured your KVM hosts. If you use N= FS > > (and no clustered file system like CLVM) then there should be no need t= o > > configure STONITH. CloudStack takes care of your HA, so this is not > > something you offload to the KVM host. > > > > (As mentioned the only time I have played with STONITH and CloudStack w= as > > for CLVM =E2=80=93 and I eventually found it not fit for purpose, too u= nstable > and > > causing too many issues like you describe. Note this was for block > storage > > though =E2=80=93 not NFS). > > > > Regards, > > Dag Sonstebo > > Cloud Architect > > ShapeBlue > > > > On 28/10/2017, 03:40, "Ivan Kudryavtsev" > wrote: > > > > Hi. If the node losts nfs host it reboots (acs agent behaviour). If > you > > really have 3 storages, you'll go clusterwide reboot everytime your > > host is > > down. > > > > 28 =D0=BE=D0=BA=D1=82. 2017 =D0=B3. 3:02 =D0=BF=D0=BE=D0=BB=D1=8C= =D0=B7=D0=BE=D0=B2=D0=B0=D1=82=D0=B5=D0=BB=D1=8C "Simon Weller" > > > > =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=BB: > > > > > Hi James, > > > > > > > > > Can you elaborate a bit further on the storage? You say you're > > running NFS > > > on all 3 nodes, can you explain how it is setup? > > > > > > Also, what version of ACS are you running? > > > > > > > > > - Si > > > > > > > > > > > > > > > ________________________________ > > > From: McClune, James > > > Sent: Friday, October 27, 2017 2:21 PM > > > To: users@cloudstack.apache.org > > > Subject: Problems with KVM HA & STONITH > > > > > > Hello Apache CloudStack Community, > > > > > > My setup consists of the following: > > > > > > - Three nodes (NODE1, NODE2, and NODE3) > > > NODE1 is running Ubuntu 16.04.3, NODE2 is running Ubuntu 16.04.3, > > and NODE3 > > > is running Ubuntu 14.04.5. > > > - Management Server (running on separate VM, not in cluster) > > > > > > The three nodes use KVM as the hypervisor. I also configured > primary > > and > > > secondary storage on all three of the nodes. I'm using NFS for th= e > > primary > > > & secondary storage. VM operations work great. Live migration wor= ks > > great. > > > > > > However, when a host goes down, the HA functionality does not wor= k > > at all. > > > Instead of spinning up the VM on another available host, the down > > host > > > seems to trigger STONITH. When STONITH happens, all hosts in the > > cluster go > > > down. This not only causes no HA, but also downs perfectly good > > VM's. I > > > have read countless articles and documentation related to this > > issue. I > > > still cannot find a viable solution for this issue. I really want > to > > use > > > Apache CloudStack, but cannot implement this in production when > > STONITH > > > happens. > > > > > > I think I have something misconfigured. I thought I would reach o= ut > > to the > > > CloudStack community and ask for some friendly assistance. > > > > > > If there is anything (system-wise) you request in order to furthe= r > > > troubleshoot this issue, please let me know and I'll send. I > > appreciate any > > > help in this issue! > > > > > > -- > > > > > > Thanks, > > > > > > James > > > > > > > > > > > Dag.Sonstebo@shapeblue.com > > www.shapeblue.com > [http://www.shapeblue.com/wp-content/uploads/2017/06/logo.png]< > http://www.shapeblue.com/> > > Shapeblue - The CloudStack Company > www.shapeblue.com > Rapid deployment framework for Apache CloudStack IaaS Clouds. CSForge is = a > framework developed by ShapeBlue to deliver the rapid deployment of a > standardised ... > > > > > 53 Chandos Place, Covent Garden, London WC2N 4HSUK > > @shapeblue > > > > > > > > > > > -- > > > > James McClune > > Technical Support Specialist > > Norwalk City Schools > > Phone: 419-660-6590 > > mcclunej@norwalktruckers.net > --001a11429b783e6643055cecb25b--