Return-Path: X-Original-To: apmail-cloudstack-dev-archive@www.apache.org Delivered-To: apmail-cloudstack-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5921518212 for ; Mon, 4 May 2015 10:04:51 +0000 (UTC) Received: (qmail 24225 invoked by uid 500); 4 May 2015 10:04:50 -0000 Delivered-To: apmail-cloudstack-dev-archive@cloudstack.apache.org Received: (qmail 24171 invoked by uid 500); 4 May 2015 10:04:50 -0000 Mailing-List: contact dev-help@cloudstack.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cloudstack.apache.org Delivered-To: mailing list dev@cloudstack.apache.org Received: (qmail 24160 invoked by uid 99); 4 May 2015 10:04:50 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 May 2015 10:04:50 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: message received from 54.76.25.247 which is an MX secondary for dev@cloudstack.apache.org) Received: from [54.76.25.247] (HELO mx1-eu-west.apache.org) (54.76.25.247) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 May 2015 10:04:23 +0000 Received: from mail-wi0-f176.google.com (mail-wi0-f176.google.com [209.85.212.176]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 3B4102026C for ; Mon, 4 May 2015 10:04:22 +0000 (UTC) Received: by wiun10 with SMTP id n10so104431467wiu.1 for ; Mon, 04 May 2015 03:04:22 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:from:date:message-id:subject:to :content-type; bh=QI3+SZAirCTF0Sv8xGTYNAfqPujMlUAaTll3HoTsBlg=; b=jfiXoVF6vw5Pqt2R3T5cioVIUGYwiANQ8ulOlmUhPTmTbu91MvejValBg76lR6f/qb tVSxIzd89v+OgSF5hkBfA+zwnyuOxX1so33oM8sFXI6CrwI9R73yNZt1UOsd5/29xQf7 bsobwdJUnTUPI7rGVT0Rj/O08A+VJvN1FOlQjGNDvGlA564Z+yxPw6GEMiqkgznc/fir sem7ACn859JLuFCG+SsrPnbfaL/TyFtyLJrAe82+C+pw+JoTmFEg5R1kuWGuEguud3FR BAFui2WM2z/if6+zYV49xop4XF7Cld8rMAQM5Mf3dHllbhlEl2F8oUEJx9EsHwrKkM4j ajiA== X-Gm-Message-State: ALoCoQml709IfIUYaajF6/WjklTMq2aeKUapipGwL/0HuaA/HX8ESmkAJJ6UojsYRUzhg/IBMJIC X-Received: by 10.180.99.166 with SMTP id er6mr18202355wib.58.1430733861879; Mon, 04 May 2015 03:04:21 -0700 (PDT) MIME-Version: 1.0 From: Remi Bergsma Date: Mon, 04 May 2015 10:04:21 +0000 Message-ID: Subject: [DISCUSS] XenServer and HA: the way forward To: "dev@cloudstack.apache.org" Content-Type: multipart/alternative; boundary=f46d04428ef69b183a05153eafe0 X-Virus-Checked: Checked by ClamAV on apache.org --f46d04428ef69b183a05153eafe0 Content-Type: text/plain; charset=UTF-8 Hi all, Since CloudStack 4.4 the implementation of HA in CloudStack was changed to use the XenHA feature of XenServer. As of 4.4, it is expected to have XenHA enabled for the pool (not for the VMs!) and so XenServer will be the one to elect a new pool master, whereas CloudStack did it before. Also, XenHA takes care of fencing the box instead of CloudStack should storage be unavailable. To be exact, they both try to fence but XenHA is usually faster. To be 100% clear: HA on VMs is in all cases done by CloudStack. It's just that without a pool master, no VMs will be recovered anyway. This brought some headaches to me, as first of all I didn't know. We probably need to document this somewhere. This is important, because without XenHA turned on you'll not get a new pool master (a behaviour change). Personally, I don't like the fact that we have "two captains" in case something goes wrong. But, some say they like this behaviour. I'm OK with both, as long as one can choose whatever suits their needs best. In Austin I talked to several people about this. We came up with the idea to have CloudStack check whether XenHA is on or not. If it is, it does the current 4.4+ behaviour (XenHA selects new pool master). When it is not, we do the CloudStack 4.3 behaviour where CloudStack is fully in control. I also talked to Tim Mackey and he wants to help implement this, but he doesn't have much time. The idea is to have someone else join in to code the change and then Tim will be able to help out on a regularly basis should we need in depth knowledge of XenServer or its implementation in CloudStack. Before we kick this off, I'd like to discuss and agree that this is the way forward. Also, if you're interested in joining this effort let me know and I'll kick it off. Regards, Remi --f46d04428ef69b183a05153eafe0--