Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 93CF1D316 for ; Mon, 10 Dec 2012 15:01:09 +0000 (UTC) Received: (qmail 72693 invoked by uid 500); 10 Dec 2012 15:01:07 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 72484 invoked by uid 500); 10 Dec 2012 15:01:04 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 72460 invoked by uid 99); 10 Dec 2012 15:01:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Dec 2012 15:01:03 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of edlinuxguru@gmail.com designates 209.85.210.174 as permitted sender) Received: from [209.85.210.174] (HELO mail-ia0-f174.google.com) (209.85.210.174) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 10 Dec 2012 15:00:56 +0000 Received: by mail-ia0-f174.google.com with SMTP id y25so4748406iay.5 for ; Mon, 10 Dec 2012 07:00:35 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=VQSG1oaTfbR0kexzUm7toZnY5/eRUTx9h0E5MTkfRyk=; b=aM7B5U5+2V3Ggol83PwIQnZRc17wztrJM0KfmyEv3eDFHcbmOfCFMBMiFTqvv57ZPk 1W8Ut0Mma/58LJ438eXjSf4FIaDEM0VisxAAWBIevMAOpzGhFVKiZcuy+Erks5EWz1VV iBbpHtV3yvw3SuYlbVHyVFU8B87hjmry8LA29Dq/I9kZLgsWuKhIP3I0Py2bwVTlh9lH 2kIbZS/k1ISbJQFw0idP1/eUmlPP5octB+h4Kbj8Vb5FlMSuqku/sLXVJqrUdQ4hs1d2 NeMHMqngdA8BsgnhZGzukwh1h+xB2LFBNz1dv8jH3Ll6wsjsZkbQ0xJDbV+nursZgERp qPUQ== MIME-Version: 1.0 Received: by 10.50.6.169 with SMTP id c9mr6892841iga.24.1355151629034; Mon, 10 Dec 2012 07:00:29 -0800 (PST) Received: by 10.64.97.106 with HTTP; Mon, 10 Dec 2012 07:00:28 -0800 (PST) In-Reply-To: References: Date: Mon, 10 Dec 2012 10:00:28 -0500 Message-ID: Subject: Re: Virtual Nodes, lots of physical nodes and potentially increasing outage count? From: Edward Capriolo To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=e89a8f64671777247304d080d399 X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f64671777247304d080d399 Content-Type: text/plain; charset=ISO-8859-1 Assuming you need to work with quorum in a non-vnode scenario. That means that if 2 nodes in a row in the ring are down some number of quorum operations will fail with UnavailableException (TimeoutException right after the failures). This is because the for a given range of tokens quorum will be impossible, but quorum will be possible for others. In a vnode world if any two nodes are down, then the intersection of vnode token ranges they have are unavailable. I think it is two sides of the same coin. On Mon, Dec 10, 2012 at 7:41 AM, Richard Low wrote: > Hi Tyler, > > You're right, the math does assume independence which is unlikely to be > accurate. But if you do have correlated failure modes e.g. same power, > racks, DC, etc. then you can still use Cassandra's rack-aware or DC-aware > features to ensure replicas are spread around so your cluster can survive > the correlated failure mode. So I would expect vnodes to improve uptime in > all scenarios, but haven't done the math to prove it. > > Richard. > --e89a8f64671777247304d080d399 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Assuming you need to work with quorum in a non-vnode scenario. That me= ans that if 2 nodes in a row in the ring are down some number of quorum ope= rations will fail with UnavailableException (TimeoutException right after t= he failures). This is because the for a given range of tokens quorum will b= e impossible, but quorum will be possible for others.

In a vnode world if any two nodes are down, =A0then the= intersection of vnode token ranges they have are unavailable.=A0

I think it is two sides of the same coin.=A0

=

On Mon, Dec 10, 2012 at 7:41 AM, Richard Low= <= rlow@acunu.com> wrote:
Hi Tyler,

You're right, the math do= es assume independence which is unlikely to be accurate. =A0But if you do h= ave correlated failure modes e.g. same power, racks, DC, etc. then you can = still use Cassandra's rack-aware or DC-aware features to ensure replica= s are spread around so your cluster can survive the correlated failure mode= . =A0So I would expect vnodes to improve uptime in all scenarios, but haven= 't done the math to prove it.

Richard.

--e89a8f64671777247304d080d399--