Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of agarwalpranaya@gmail.com
 designates 74.125.82.43 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAL+ArfVzf=WXDc4JEfcNU8JDifE=oT3pHc0bxxcdmrFKSab41A@mail.gmail.com>
References: 
 <CAK7S3dH0yJuYAzX5JmzqU=BuKHeTUFL8kuf8UW8O3bH=PW3gXg@mail.gmail.com>
	<1428630475.53523.YahooMailAndroidMobile@web192906.mail.sg3.yahoo.com>
	<CAL+ArfVzf=WXDc4JEfcNU8JDifE=oT3pHc0bxxcdmrFKSab41A@mail.gmail.com>
Date: Fri, 10 Apr 2015 10:16:30 -0700
Message-ID: 
 <CABEMVoPv1dSkDvum243O_EnR9uhCKcJ03w1JQf+rvkaJZ=w8kw@mail.gmail.com>
Subject: Re: Cassandra Data Loss
From: Pranay Agarwal <agarwalpranaya@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d044402e4e4bcac051361ecb1

--f46d044402e4e4bcac051361ecb1
Content-Type: text/plain; charset=UTF-8

Thanks Anuj and Jens.

So, my initial assumption is correct that cassandra will *attempt* to
replicate data irrespective of the CL value. However, if those asynchronous
calls to replicate is failed, is there any retrial done by cassandra? Will
a full cluster wider node repair will take care this and guarantee the
*all* data is now replicated RF times?

We really don't care about reading stale data that much, but we really want
the data to be guaranteed to be replicated 3 times, so that we don't loose
the data even if 2 nodes fail. We are doing this heavy write/read as
initial import and ideally I would like keep CL 1 so that client is not
blocked but at the same time we want cassandra to take care in background
or asynchronously and ensure replication.

On Fri, Apr 10, 2015 at 1:02 AM, Jens Rantil <jens.rantil@tink.se> wrote:

> Somewhat related: http://wiki.apache.org/cassandra/ReadRepair states
>
> Range scans are not per-key and do not do read repair.
>
>
> Does "key" in "per-key" refer to "partition key" or "partition+clustering
> key"?
>
> Cheers,
> Jens
>
> On Fri, Apr 10, 2015 at 3:47 AM, Anuj Wadehra <anujw_2003@yahoo.co.in>
> wrote:
>
>> Read repair and repair run as part of maintenance will make it
>> consistent. Read repair is usually done on only 10% of reads. You can tune
>> tune read_repair_chance property of cf to adjust that. Till a row is
>> repaired clients may return stale data if cl=1 is used for reads. I would
>> suggest that u should minimize dropping of mutations by tuning if thats the
>> case rather than fixing it.
>>
>> Thanks
>> Anuj Wadehra
>>
>> Sent from Yahoo Mail on Android
>> <https://overview.mail.yahoo.com/mobile/?.src=Android>
>> ------------------------------
>>   *From*:"Kurtis vel" <kurtisvelarde@gmail.com>
>> *Date*:Fri, 10 Apr, 2015 at 7:09 am
>> *Subject*:Re: Cassandra Data Loss
>>
>> Hi Anuj,
>>
>> Assuming cl=1 and rf=3.
>>
>> Will the data ever be consistent if an asynchronous replication call
>> fails?
>>
>> Is this where read repair comes in handy?
>>
>> thanks
>>
>> On Thu, Apr 9, 2015 at 6:24 PM, Anuj Wadehra <anujw_2003@yahoo.co.in>
>> wrote:
>>
>>> Cl=1 means that client will only block for one response. In case of
>>> writes other 2 replicas will be updated asynchronously and eventually
>>> updated. As you are running heavy load make sure that writes /mutations are
>>> not getting dropped using nodetool tpstats on all nodes. Under heavy loads
>>> Cassandra may drop writes and as these were asynchronous,client wont know
>>> about that.
>>>
>>> if cl=1 for both reads and writes. Some reads may return stale data.If
>>> you need absolute guarantee that reads always return up to date data go for
>>> strong consistency r cf + w cf greater than rf. Eg read at quorum and write
>>> at quorum.
>>>
>>> Thanks
>>> Anuj Wadehra
>>>
>>> Sent from Yahoo Mail on Android
>>> <https://overview.mail.yahoo.com/mobile/?.src=Android>
>>> ------------------------------
>>> *From*:"Pranay Agarwal" <agarwalpranaya@gmail.com>
>>> *Date*:Fri, 10 Apr, 2015 at 6:40 am
>>> *Subject*:Cassandra Data Loss
>>>
>>> Hi All.
>>>
>>>
>>> I am using 20 nodes cassandra cluster with RF=3 and CL=1. We are doing
>>> very write/read heavy operations (total 100k ops/sec).
>>>
>>> I have been assuming all along that all the data will be replicated in 3
>>> different place *irrespective of consistency level *as it's a very
>>> application/driver level config. Is that correct or Cassandra guarantees 3
>>> replica only when I also have CL as 3 as well?
>>>
>>>
>>> Thanks
>>> -Pranay
>>>
>>
>>
>
>
> --
> Jens Rantil
> Backend engineer
> Tink AB
>
> Email: jens.rantil@tink.se
> Phone: +46 708 84 18 32
> Web: www.tink.se
>
> Facebook <https://www.facebook.com/#!/tink.se> Linkedin
> <http://www.linkedin.com/company/2735919?trk=vsrp_companies_res_photo&trkInfo=VSRPsearchId%3A1057023381369207406670%2CVSRPtargetId%3A2735919%2CVSRPcmpt%3Aprimary>
>  Twitter <https://twitter.com/tink>
>

--f46d044402e4e4bcac051361ecb1
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks Anuj and Jens.<div><br></div><div>So, my initial as=
sumption is correct that cassandra will *attempt* to replicate data irrespe=
ctive of the CL value. However, if those asynchronous calls to replicate is=
 failed, is there any retrial done by cassandra? Will a full cluster wider =
node repair will take care this and guarantee the *all* data is now replica=
ted RF times?</div><div><br></div><div>We really don&#39;t care about readi=
ng stale data that much, but we really want the data to be guaranteed to be=
 replicated 3 times, so that we don&#39;t loose the data even if 2 nodes fa=
il. We are doing this heavy write/read as initial import and ideally I woul=
d like keep CL 1 so that client is not blocked but at the same time we want=
 cassandra to take care in background or asynchronously and ensure replicat=
ion.</div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">O=
n Fri, Apr 10, 2015 at 1:02 AM, Jens Rantil <span dir=3D"ltr">&lt;<a href=
=3D"mailto:jens.rantil@tink.se" target=3D"_blank">jens.rantil@tink.se</a>&g=
t;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0=
 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Somewha=
t related:=C2=A0<a href=3D"http://wiki.apache.org/cassandra/ReadRepair" tar=
get=3D"_blank">http://wiki.apache.org/cassandra/ReadRepair</a> states<div><=
br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8e=
x;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-styl=
e:solid;padding-left:1ex"><span style=3D"color:rgb(0,0,0);font-family:sans-=
serif;font-size:16px">Range scans are not per-key and do not do read repair=
.</span></blockquote><div><br></div><div>Does &quot;key&quot; in &quot;per-=
key&quot; refer to &quot;partition key&quot; or &quot;partition+clustering =
key&quot;?<br></div><div><br></div><div>Cheers,</div><div>Jens</div></div><=
div class=3D"gmail_extra"><div><div class=3D"h5"><br><div class=3D"gmail_qu=
ote">On Fri, Apr 10, 2015 at 3:47 AM, Anuj Wadehra <span dir=3D"ltr">&lt;<a=
 href=3D"mailto:anujw_2003@yahoo.co.in" target=3D"_blank">anujw_2003@yahoo.=
co.in</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"m=
argin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><table cellsp=
acing=3D"0" cellpadding=3D"0" border=3D"0"><tbody><tr><td valign=3D"top">Re=
ad repair and repair run as part of maintenance will make it consistent. Re=
ad repair is usually done on only 10% of reads. You can tune tune read_repa=
ir_chance property of cf to adjust that. Till a row is repaired clients may=
 return stale data if cl=3D1 is used for reads. I would suggest that u shou=
ld minimize dropping of mutations by tuning if thats the case rather than f=
ixing it.<br><br>Thanks<div>Anuj Wadehra<br><div><span><p><a href=3D"https:=
//overview.mail.yahoo.com/mobile/?.src=3DAndroid" target=3D"_blank">Sent fr=
om Yahoo Mail on Android</a></p> <hr></span><table cellspacing=3D"0" cellpa=
dding=3D"0" border=3D"0"> <tbody> <tr> <td valign=3D"top"> <div style=3D"fo=
nt-family:Roboto,sans-serif;color:#7e7d80"><b>From</b>:&quot;Kurtis vel&quo=
t; &lt;<a href=3D"mailto:kurtisvelarde@gmail.com" target=3D"_blank">kurtisv=
elarde@gmail.com</a>&gt;<br><b>Date</b>:Fri, 10 Apr, 2015 at 7:09
 am<br><b>Subject</b>:Re: Cassandra Data Loss<br><br></div><div><div> <div =
dir=3D"ltr"><div>Hi Anuj,<br clear=3D"none"><br clear=3D"none">Assuming cl=
=3D1 and <span style=3D"background:none repeat scroll 0% 0% yellow">rf</spa=
n>=3D3.<br clear=3D"none"><br clear=3D"none">Will the data ever be consiste=
nt if an asynchronous replication call fails?<br clear=3D"none"><br clear=
=3D"none">Is this where read repair comes in handy?<br clear=3D"none"><br c=
lear=3D"none"></div><div>thanks<br clear=3D"none"></div></div><div><div cla=
ss=3D"gmail_extra"><br clear=3D"none"><div class=3D"gmail_quote">On Thu, Ap=
r 9, 2015 at 6:24 PM, Anuj Wadehra <span dir=3D"ltr">&lt;<a rel=3D"nofollow=
" shape=3D"rect">anujw_2003@yahoo.co.in</a>&gt;</span> wrote:<br clear=3D"n=
one"><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-le=
ft:1px #ccc solid;padding-left:1ex"><table border=3D"0" cellpadding=3D"0" c=
ellspacing=3D"0"><tbody><tr><td colspan=3D"1" rowspan=3D"1" valign=3D"top">=
Cl=3D1 means that client will only block for one response. In case of write=
s other 2 replicas will be updated asynchronously and eventually updated. A=
s you are running heavy load make sure that writes /mutations are not getti=
ng dropped using nodetool tpstats on all nodes. Under heavy loads Cassandra=
 may drop writes and as these were asynchronous,client wont know about that=
.=C2=A0<div><br clear=3D"none"></div><div>if cl=3D1 for both reads and writ=
es. Some reads may return stale data.If you need absolute guarantee that re=
ads always return up to date data go for strong consistency r cf + w cf gre=
ater than rf. Eg read at quorum and write at quorum.</div><div><br clear=3D=
"none"></div><div>Thanks</div><div>Anuj Wadehra<br clear=3D"none"><br clear=
=3D"none"><p><a rel=3D"nofollow" shape=3D"rect" href=3D"https://overview.ma=
il.yahoo.com/mobile/?.src=3DAndroid" target=3D"_blank">Sent from Yahoo Mail=
 on
 Android</a></p> <hr><table border=3D"0" cellpadding=3D"0" cellspacing=3D"0=
"><tbody><tr><td colspan=3D"1" rowspan=3D"1" valign=3D"top"> <div style=3D"=
font-family:Roboto,sans-serif;color:#7e7d80"><b>From</b>:&quot;Pranay Agarw=
al&quot; &lt;<a rel=3D"nofollow" shape=3D"rect">agarwalpranaya@gmail.com</a=
>&gt;<br clear=3D"none"><b>Date</b>:Fri, 10 Apr, 2015 at 6:40 am<br clear=
=3D"none"><b>Subject</b>:Cassandra Data Loss<br clear=3D"none"><br clear=3D=
"none"></div><div><div> <div dir=3D"ltr"><span style=3D"font-size:12.800000=
1907349px">Hi All.</span><div style=3D"font-size:12.8000001907349px"><br cl=
ear=3D"none"></div><div style=3D"font-size:12.8000001907349px"><br clear=3D=
"none"></div><div style=3D"font-size:12.8000001907349px">I am using 20 node=
s cassandra cluster with RF=3D3 and CL=3D1. We are doing very write/read he=
avy operations (total 100k ops/sec).=C2=A0</div><div style=3D"font-size:12.=
8000001907349px"><br clear=3D"none"></div><div style=3D"font-size:12.800000=
1907349px">I have been assuming all along
 that all the data will be replicated in 3 different place=C2=A0<b>irrespec=
tive of consistency level=C2=A0</b>as it&#39;s a very application/driver le=
vel config. Is that correct or Cassandra guarantees 3 replica only when I a=
lso have CL as 3 as well?</div><div style=3D"font-size:12.8000001907349px">=
<br clear=3D"none"></div><div style=3D"font-size:12.8000001907349px"><br cl=
ear=3D"none"></div><div style=3D"font-size:12.8000001907349px">Thanks</div>=
<div style=3D"font-size:12.8000001907349px">-Pranay</div></div></div></div>=
</td></tr></tbody></table></div></td></tr></tbody></table></blockquote></di=
v><br clear=3D"none"></div></div></div></div></td>  </tr>   </tbody>   </ta=
ble></div></div></td></tr></tbody></table></blockquote></div><br><br clear=
=3D"all"><div><br></div></div></div><span class=3D"HOEnZb"><font color=3D"#=
888888">-- <br><div><div dir=3D"ltr"><div>Jens Rantil</div><div>Backend eng=
ineer</div><div>Tink AB</div><div><br></div><div>Email:=C2=A0<a href=3D"mai=
lto:jens.rantil@tink.se" style=3D"color:rgb(17,85,204)" target=3D"_blank">j=
ens.rantil@tink.se</a></div><div>Phone: +46 708 84 18 32</div><div>Web:=C2=
=A0<a href=3D"http://www.tink.se/" style=3D"color:rgb(17,85,204)" target=3D=
"_blank">www.tink.se</a></div><div><br></div><div><a href=3D"https://www.fa=
cebook.com/#!/tink.se" style=3D"color:rgb(17,85,204);font-family:arial;font=
-size:small" target=3D"_blank">Facebook</a><span style=3D"font-family:arial=
;font-size:small">=C2=A0</span><a href=3D"http://www.linkedin.com/company/2=
735919?trk=3Dvsrp_companies_res_photo&amp;trkInfo=3DVSRPsearchId%3A10570233=
81369207406670%2CVSRPtargetId%3A2735919%2CVSRPcmpt%3Aprimary" style=3D"colo=
r:rgb(17,85,204);font-family:arial;font-size:small" target=3D"_blank">Linke=
din</a><span style=3D"font-family:arial;font-size:small">=C2=A0</span><a hr=
ef=3D"https://twitter.com/tink" style=3D"color:rgb(17,85,204);font-family:a=
rial;font-size:small" target=3D"_blank">Twitter</a></div></div></div>
</font></span></div>
</blockquote></div><br></div>

--f46d044402e4e4bcac051361ecb1--