Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <3E66DF50-DBF6-4D06-B0BF-67A2209445B8@toptarif.de>
References: <46C412D5-2781-4682-9F9E-B8D5C1D60664@toptarif.de>
	<AANLkTineG7s-2aiphR9d9TF=O43Z3RrvH3GDASNo5Emc@mail.gmail.com>
	<88F8E565-C3E6-4F07-BB6C-360582A9AB15@toptarif.de>
	<AANLkTimaRRTuYpfJ-gCmmPqn-RuEQZdxB=XLeneAJ_nW@mail.gmail.com>
	<AANLkTinqby-Pvkt8EBocmhij9XxqACgmR+89SWqTc2ia@mail.gmail.com>
	<3E66DF50-DBF6-4D06-B0BF-67A2209445B8@toptarif.de>
Date: Thu, 9 Dec 2010 18:46:39 +0200
Message-ID: <AANLkTinbsH2F70PJnwX-0DfwfcV-0o8NOR_6WWTpYBmU@mail.gmail.com>
Subject: Re: Quorum: killing 1 out of 3 server kills the cluster (?)
From: David Boxenhorn <david@lookin2.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=90e6ba53a1e25133200496fcfb76

--90e6ba53a1e25133200496fcfb76
Content-Type: text/plain; charset=ISO-8859-1

If that is what you want, use CL=ONE

On Thu, Dec 9, 2010 at 6:43 PM, Timo Nentwig <timo.nentwig@toptarif.de>wrote:

>
> On Dec 9, 2010, at 17:39, David Boxenhorn wrote:
>
> > In other words, if you want to use QUORUM, you need to set RF>=3.
> >
> > (I know because I had exactly the same problem.)
>
> I naively assume that if I kill either node that holds N1 (i.e. node 1 or
> 3), N1 will still remain on another node. Only if both fail, I actually lose
> data. But apparently this is not how it works...
>
> > On Thu, Dec 9, 2010 at 6:05 PM, Sylvain Lebresne <sylvain@yakaz.com>
> wrote:
> > I'ts 2 out of the number of replicas, not the number of nodes. At RF=2,
> you have
> > 2 replicas. And since quorum is also 2 with that replication factor,
> > you cannot lose
> > a node, otherwise some query will end up as UnavailableException.
> >
> > Again, this is not related to the total number of nodes. Even with 200
> > nodes, if
> > you use RF=2, you will have some query that fail (altough much less that
> what
> > you are probably seeing).
> >
> > On Thu, Dec 9, 2010 at 5:00 PM, Timo Nentwig <timo.nentwig@toptarif.de>
> wrote:
> > >
> > > On Dec 9, 2010, at 16:50, Daniel Lundin wrote:
> > >
> > >> Quorum is really only useful when RF > 2, since the for a quorum to
> > >> succeed RF/2+1 replicas must be available.
> > >
> > > 2/2+1==2 and I killed 1 of 3, so... don't get it.
> > >
> > >> This means for RF = 2, consistency levels QUORUM and ALL yield the
> same result.
> > >>
> > >> /d
> > >>
> > >> On Thu, Dec 9, 2010 at 4:40 PM, Timo Nentwig <
> timo.nentwig@toptarif.de> wrote:
> > >>> Hi!
> > >>>
> > >>> I've 3 servers running (0.7rc1) with a replication_factor of 2 and
> use quorum for writes. But when I shut down one of them
> UnavailableExceptions are thrown. Why is that? Isn't that the sense of
> quorum and a fault-tolerant DB that it continues with the remaining 2 nodes
> and redistributes the data to the broken one as soons as its up again?
> > >>>
> > >>> What may I be doing wrong?
> > >>>
> > >>> thx
> > >>> tcn
> > >
> > >
> >
>
>

--90e6ba53a1e25133200496fcfb76
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">If that is what you want, use CL=3DONE<br><br><div class=
=3D"gmail_quote">On Thu, Dec 9, 2010 at 6:43 PM, Timo Nentwig <span dir=3D"=
ltr">&lt;<a href=3D"mailto:timo.nentwig@toptarif.de">timo.nentwig@toptarif.=
de</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin: 0pt 0pt 0pt 0.8ex; borde=
r-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;"><div class=3D"im"=
><br>
On Dec 9, 2010, at 17:39, David Boxenhorn wrote:<br>
<br>
&gt; In other words, if you want to use QUORUM, you need to set RF&gt;=3D3.=
<br>
&gt;<br>
&gt; (I know because I had exactly the same problem.)<br>
<br>
</div>I naively assume that if I kill either node that holds N1 (i.e. node =
1 or 3), N1 will still remain on another node. Only if both fail, I actuall=
y lose data. But apparently this is not how it works...<br>
<div><div></div><div class=3D"h5"><br>
&gt; On Thu, Dec 9, 2010 at 6:05 PM, Sylvain Lebresne &lt;<a href=3D"mailto=
:sylvain@yakaz.com">sylvain@yakaz.com</a>&gt; wrote:<br>
&gt; I&#39;ts 2 out of the number of replicas, not the number of nodes. At =
RF=3D2, you have<br>
&gt; 2 replicas. And since quorum is also 2 with that replication factor,<b=
r>
&gt; you cannot lose<br>
&gt; a node, otherwise some query will end up as UnavailableException.<br>
&gt;<br>
&gt; Again, this is not related to the total number of nodes. Even with 200=
<br>
&gt; nodes, if<br>
&gt; you use RF=3D2, you will have some query that fail (altough much less =
that what<br>
&gt; you are probably seeing).<br>
&gt;<br>
&gt; On Thu, Dec 9, 2010 at 5:00 PM, Timo Nentwig &lt;<a href=3D"mailto:tim=
o.nentwig@toptarif.de">timo.nentwig@toptarif.de</a>&gt; wrote:<br>
&gt; &gt;<br>
&gt; &gt; On Dec 9, 2010, at 16:50, Daniel Lundin wrote:<br>
&gt; &gt;<br>
&gt; &gt;&gt; Quorum is really only useful when RF &gt; 2, since the for a =
quorum to<br>
&gt; &gt;&gt; succeed RF/2+1 replicas must be available.<br>
&gt; &gt;<br>
&gt; &gt; 2/2+1=3D=3D2 and I killed 1 of 3, so... don&#39;t get it.<br>
&gt; &gt;<br>
&gt; &gt;&gt; This means for RF =3D 2, consistency levels QUORUM and ALL yi=
eld the same result.<br>
&gt; &gt;&gt;<br>
&gt; &gt;&gt; /d<br>
&gt; &gt;&gt;<br>
&gt; &gt;&gt; On Thu, Dec 9, 2010 at 4:40 PM, Timo Nentwig &lt;<a href=3D"m=
ailto:timo.nentwig@toptarif.de">timo.nentwig@toptarif.de</a>&gt; wrote:<br>
&gt; &gt;&gt;&gt; Hi!<br>
&gt; &gt;&gt;&gt;<br>
&gt; &gt;&gt;&gt; I&#39;ve 3 servers running (0.7rc1) with a replication_fa=
ctor of 2 and use quorum for writes. But when I shut down one of them Unava=
ilableExceptions are thrown. Why is that? Isn&#39;t that the sense of quoru=
m and a fault-tolerant DB that it continues with the remaining 2 nodes and =
redistributes the data to the broken one as soons as its up again?<br>

&gt; &gt;&gt;&gt;<br>
&gt; &gt;&gt;&gt; What may I be doing wrong?<br>
&gt; &gt;&gt;&gt;<br>
&gt; &gt;&gt;&gt; thx<br>
&gt; &gt;&gt;&gt; tcn<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt;<br>
<br>
</div></div></blockquote></div><br></div>

--90e6ba53a1e25133200496fcfb76--