Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 528DB200CD9 for ; Thu, 3 Aug 2017 11:15:29 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 50CB116B369; Thu, 3 Aug 2017 09:15:29 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id BDB5F16B361 for ; Thu, 3 Aug 2017 11:15:28 +0200 (CEST) Received: (qmail 47904 invoked by uid 500); 3 Aug 2017 09:15:27 -0000 Mailing-List: contact dev-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list dev@ignite.apache.org Received: (qmail 47893 invoked by uid 99); 3 Aug 2017 09:15:27 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Aug 2017 09:15:27 +0000 Received: from mail-lf0-f42.google.com (mail-lf0-f42.google.com [209.85.215.42]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 3E6691A00C7 for ; Thu, 3 Aug 2017 09:15:27 +0000 (UTC) Received: by mail-lf0-f42.google.com with SMTP id o85so3441653lff.3 for ; Thu, 03 Aug 2017 02:15:27 -0700 (PDT) X-Gm-Message-State: AHYfb5hjLyCDI659hnSoZInh/BIzEdDkmoMbkz0HIPLl9BleIGQ30Quz Y/frUf1wt+3y4Lcfu2hS9iPuQIuTzu42 X-Received: by 10.25.145.71 with SMTP id y7mr435017lfj.17.1501751725705; Thu, 03 Aug 2017 02:15:25 -0700 (PDT) MIME-Version: 1.0 Received: by 10.46.83.12 with HTTP; Thu, 3 Aug 2017 02:15:24 -0700 (PDT) In-Reply-To: References: From: Yakov Zhdanov Date: Thu, 3 Aug 2017 12:15:24 +0300 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Cluster auto activation design proposal To: dev@ignite.apache.org Content-Type: multipart/alternative; boundary="94eb2c1cd06a26e8a60555d5d2e0" archived-at: Thu, 03 Aug 2017 09:15:29 -0000 --94eb2c1cd06a26e8a60555d5d2e0 Content-Type: text/plain; charset="UTF-8" > * Based on some sort of policies when the actual cluster topology differs too much from the baseline or when some critical condition happens (e.g., when there are no more backups for a partition) Good point, Alex! I would even go further. If cluster is active and under load and nodes continue joining and leaving then we can have several BT's that are possible to restart on - the main condition is to have all the up to date data partitions. I.e. if you have 4 servers and 3 backups most probably you can have all the data with 2, 3 and, of course, 4 nodes. Makes sense? I would also think of different name. Topology (for me) also implies the version, but here only nodes carrying data are important. How about "restart nodes set"? --Yakov --94eb2c1cd06a26e8a60555d5d2e0--