Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of rkalla@gmail.com designates
 209.85.161.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <477F356C-DD18-46DC-8463-EF278DE9C97B@gmail.com>
References: <60E014B3-4010-4987-BD42-CA4242675BB7@gmail.com>
 <1320646450.33642.YahooMailNeo@web95202.mail.in2.yahoo.com>
 <CABn9xAGRTdOTTVNM-RD+Zqdj7BSbDXg3zjxmma2EmAp5APAu0g@mail.gmail.com>
 <477F356C-DD18-46DC-8463-EF278DE9C97B@gmail.com>
From: Riyad Kalla <rkalla@gmail.com>
Date: Mon, 7 Nov 2011 06:03:47 -0700
Message-ID: 
 <CABn9xAF8cfp+gxOTegX68QkRdH1FFnEDtnWkuTDV2+MXJYc9Cg@mail.gmail.com>
Subject: Re: Will writes with < ALL consistency eventually propagate?
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=20cf3005dde649534404b124b1e1

--20cf3005dde649534404b124b1e1
Content-Type: text/plain; charset=ISO-8859-1

Ah! Ok I was interpreting what you were saying to mean that if my RF was
too high, then the ring would die if I lost one.

Ultimately what I want (I think) is:

Replication Factor: 5 (aka "all of my nodes")
Consistency Level: 2

Put another way, when I write a value, I want it to exist on two servers
*at least* before I consider that write "successful" enough for my code to
continue, but in the background I would like Cassandra to keep copying that
value around at its leisure until all the ring nodes know about it.

This sounds like what I need. Thanks for pointing me in the right direction.

Best,
Riyad

On Mon, Nov 7, 2011 at 5:47 AM, Anthony Ikeda
<anthony.ikeda.dev@gmail.com>wrote:

> Riyad, I'm also just getting to know the different settings and values
> myself :)
>
> I believe, and it also depends on your config, CL.ONE Should ignore the
> loss of a node if your RF is 5, once you increase the CL then if you lose a
> node the CL is not met and you will get exceptions returned.
>
> Sent from my iPhone
>
> On 07/11/2011, at 4:32, Riyad Kalla <rkalla@gmail.com> wrote:
>
> Anthony and Jaydeep, thank you for weighing in. I am glad to see that they
> are two different values (makes more sense mentally to me).
>
> Anthony, what you said caught my attention "to ensure all nodes have a
> copy you may not be able to survive the loss of a single node." -- why
> would this be the case?
>
> I assumed (incorrectly?) that a node would simply disappear off the map
> until I could bring it back up again, at which point all the missing values
> that it didn't get while it was done, it would slowly retrieve from other
> members of the ring. Is this the wrong understanding?
>
> If forcing a replication factor equal to the number of nodes in my ring
> will cause a hard-stop when one ring goes down (as I understood your
> comment to mean), it seems to me I should go with a much lower replication
> factor... something along the lines of 3 or roughly ceiling(N / 2) and just
> deal with the latency when one of the nodes has to route a request to
> another server when it doesn't contain the value.
>
> Is there a better way to accomplish what I want, or is keeping the
> replication factor that aggressively high generally a bad thing and using
> Cassandra in the "wrong" way?
>
> Thank you for the help.
>
> -Riyad
>
> On Sun, Nov 6, 2011 at 11:14 PM, chovatia jaydeep <
> chovatia_jaydeep@yahoo.co.in> wrote:
>
>> Hi Riyad,
>>
>> You can set replication = 5 (number of replicas) and write with CL = ONE.
>> There is no hard requirement from Cassandra to write with CL=ALL to
>> replicate the data unless you need it. Considering your example, If you
>> write with CL=ONE then also it will replicate your data to all 5 replicas
>> eventually.
>>
>> Thank you,
>> Jaydeep
>> ------------------------------
>> *From:* Riyad Kalla <rkalla@gmail.com>
>> *To:* "user@cassandra.apache.org" <user@cassandra.apache.org>
>> *Sent:* Sunday, 6 November 2011 9:50 PM
>> *Subject:* Will writes with < ALL consistency eventually propagate?
>>
>> I am new to Cassandra and was curious about the following scenario...
>>
>> Lets say i have a ring of 5 servers. Ultimately I would like each server
>> to be a full replication of the next (master-master-*).
>>
>> In a presentation i watched today on Cassandra, the presenter mentioned
>> that the ring members will shard data and route your requests to the right
>> host when they come in to a server that doesnt physically contain the value
>> you wanted. To the client requesting this is seamless excwpt for the added
>> latency.
>>
>> If i wanted to avoid the routing and latency and ensure every server had
>> the full data set, do i have to write with a consistency level of ALL and
>> wait for all of those writes to return in my code, or can i write with a CL
>> of 1 or 2 and let the ring propagate the rest of the copies to the other
>> servers in the background after my code has continued executing?
>>
>> I dont mind eventual consistency in my case, but i do (eventually) want
>> all nodes to have all values and cannot tell if this is default behavior,
>> or if sharding is the default and i can only force duplicates onto the
>> other servers explicitly with a CL of ALL.
>>
>> Best,
>> Riyad
>>
>>
>

--20cf3005dde649534404b124b1e1
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Ah! Ok I was interpreting what you were saying to mean that if my RF was to=
o high, then the ring would die if I lost one.<div><br></div><div>Ultimatel=
y what I want (I think) is:</div><div><br></div><div>Replication Factor: 5 =
(aka &quot;all of my nodes&quot;)</div>

<div>Consistency Level: 2</div><div><br></div><div>Put another way, when I =
write a value, I want it to exist on two servers *at least* before I consid=
er that write &quot;successful&quot; enough for my code to continue, but in=
 the background I would like Cassandra to keep copying that value around at=
 its=A0leisure until all the ring nodes know about it.</div>

<div><br></div><div>This sounds like what I need. Thanks for pointing me in=
 the right direction.</div><div><br></div><div>Best,</div><div>Riyad=A0<br>=
<br><div class=3D"gmail_quote">On Mon, Nov 7, 2011 at 5:47 AM, Anthony Iked=
a <span dir=3D"ltr">&lt;<a href=3D"mailto:anthony.ikeda.dev@gmail.com">anth=
ony.ikeda.dev@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;"><div bgcolor=3D"#FFFFFF"><div>Riyad, I&#39;=
m also just getting to k<span>now the different s<span>ettings=A0<span>and =
values myself :)</span></span></span></div>

<div><span><span><span><br></span></span></span></div><div><span><span><spa=
n>I believe, and it also depends on your config, CL.ONE Should ignore the l=
oss of a node if your RF is 5, once you increase the CL then if you lose a =
node the CL is not met and y</span></span></span><span>ou will get exceptio=
ns returned.=A0</span></div>

<div><br>Sent from my iPhone</div><div><div class=3D"h5"><div><br>On 07/11/=
2011, at 4:32, Riyad Kalla &lt;<a href=3D"mailto:rkalla@gmail.com" target=
=3D"_blank">rkalla@gmail.com</a>&gt; wrote:<br><br></div><div></div><blockq=
uote type=3D"cite">

<div>Anthony and Jaydeep, thank you for weighing in. I am glad to see that =
they are two different values (makes more sense mentally to me).<div><br></=
div><div>Anthony, what you said caught my attention &quot;<span style=3D"co=
lor:rgb(34, 34, 34);font-family:arial, sans-serif;font-size:13px">to ensure=
 all nodes have a copy you may not be able to survive the loss of a single =
node.&quot; -- why would this be the case?</span></div>


<div><font color=3D"#222222" face=3D"arial, sans-serif"><br></font></div><d=
iv><font color=3D"#222222" face=3D"arial, sans-serif">I assumed (incorrectl=
y?) that a node would simply disappear off the map until I could bring it b=
ack up again, at which point all the missing values that it didn&#39;t get =
while it was done, it would slowly retrieve from other members of the ring.=
 Is this the wrong understanding?</font></div>


<div><font color=3D"#222222" face=3D"arial, sans-serif"><br></font></div><d=
iv><font color=3D"#222222" face=3D"arial, sans-serif">If forcing a replicat=
ion factor equal to the number of nodes in my ring will cause a hard-stop w=
hen one ring goes down (as I understood your comment to mean), it seems to =
me I should go with a much lower replication factor... something along the =
lines of 3 or roughly ceiling(N / 2) and just deal with the latency when on=
e of the nodes has to route a request to another server when it doesn&#39;t=
 contain the value.</font></div>


<div><font color=3D"#222222" face=3D"arial, sans-serif"><br></font></div><d=
iv><font color=3D"#222222" face=3D"arial, sans-serif">Is there a better way=
 to accomplish what I want, or is keeping the replication factor that aggre=
ssively high generally a bad thing and using Cassandra in the &quot;wrong&q=
uot; way?</font></div>


<div><font color=3D"#222222" face=3D"arial, sans-serif"><br></font></div><d=
iv><font color=3D"#222222" face=3D"arial, sans-serif">Thank you for the hel=
p.</font></div><div><font color=3D"#222222" face=3D"arial, sans-serif"><br>
</font></div><div><font color=3D"#222222" face=3D"arial, sans-serif">-Riyad=
<br></font><br><div class=3D"gmail_quote">On Sun, Nov 6, 2011 at 11:14 PM, =
chovatia jaydeep <span dir=3D"ltr">&lt;<a href=3D"mailto:chovatia_jaydeep@y=
ahoo.co.in" target=3D"_blank">chovatia_jaydeep@yahoo.co.in</a>&gt;</span> w=
rote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div><div style=3D"color:#000;background-col=
or:#fff;font-family:arial, helvetica, sans-serif;font-size:12pt"><div><span=
>Hi Riyad,</span></div>


<div><span><br></span></div><div><span>You can set replication =3D 5 (numbe=
r of replicas) and write with CL =3D ONE. There is no hard requirement from=
 Cassandra to write with CL=3DALL to replicate the data unless you need it.=
 Considering your example, If you write with CL=3DONE then also it will rep=
licate your data to all 5 replicas eventually.</span></div>


<div><span><br></span></div><div>Thank you,<br></div><div><span>Jaydeep</sp=
an></div><div style=3D"font-size:12pt;font-family:arial, helvetica, sans-se=
rif"><div style=3D"font-size:12pt;font-family:&#39;times new roman&#39;, &#=
39;new york&#39;, times, serif">


<font size=3D"2" face=3D"Arial"><hr size=3D"1"><b><span style=3D"font-weigh=
t:bold">From:</span></b> Riyad Kalla &lt;<a href=3D"mailto:rkalla@gmail.com=
" target=3D"_blank">rkalla@gmail.com</a>&gt;<br><b><span style=3D"font-weig=
ht:bold">To:</span></b> &quot;<a href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&quot;
 &lt;<a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">user@ca=
ssandra.apache.org</a>&gt;<br><b><span style=3D"font-weight:bold">Sent:</sp=
an></b> Sunday, 6 November 2011 9:50 PM<br><b><span style=3D"font-weight:bo=
ld">Subject:</span></b> Will writes with &lt; ALL consistency eventually pr=
opagate?<br>


</font><div><div><br>I am new to Cassandra and was curious about the follow=
ing scenario...<br><br>Lets say i have a ring of 5 servers. Ultimately I wo=
uld like each server to be a full replication of the next (master-master-*)=
. <br>


<br>In a presentation i watched today on Cassandra, the presenter mentioned=
 that the ring members will shard data and route your requests to the right=
 host when they come in to a server that doesnt physically contain the valu=
e you wanted. To the client requesting this is seamless excwpt for the adde=
d latency.<br>


<br>If i wanted to avoid the routing and latency and ensure every server ha=
d the full data set, do i have to write with a consistency level of ALL and=
 wait for all of those writes to
 return in my code, or can i write with a CL of 1 or 2 and let the ring pro=
pagate the rest of the copies to the other servers in the background after =
my code has continued executing?<br><br>I dont mind eventual consistency in=
 my case, but i do (eventually) want all nodes to have all values and canno=
t tell if this is default behavior, or if sharding is the default and i can=
 only force duplicates onto the other servers explicitly with a CL of ALL.<=
br>


<br>Best,<br>Riyad<br><br></div></div></div></div></div></div></blockquote>=
</div><br></div>
</div></blockquote></div></div></div></blockquote></div><br></div>

--20cf3005dde649534404b124b1e1--