Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <1433862663.54023.YahooMailAndroidMobile@web192903.mail.sg3.yahoo.com>
References: 
 <CA+z1Z6enzFmV0j3n231n3BmkR+R1BUNvSo4XBWGjX6J4HkbE=g@mail.gmail.com>
 <1433862663.54023.YahooMailAndroidMobile@web192903.mail.sg3.yahoo.com>
From: Ken Hancock <ken.hancock@schange.com>
Date: Wed, 10 Jun 2015 14:31:44 -0400
Message-ID: 
 <CA+z1Z6dCLk17OLSuPOh7DDEOh5eZo8dqr8YW7tzy3LV8NUXx4g@mail.gmail.com>
Subject: Re: Hundreds of sstables after every Repair
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d0443044015365605182e1835

--f46d0443044015365605182e1835
Content-Type: text/plain; charset=UTF-8

Perhaps doing a sstable2json on some of the small tables would shed some
illumination.  I was going to suggest the anticompaction feature of C*2.1
(which I'm not familiar with), but you're on 2.0.

On Tue, Jun 9, 2015 at 11:11 AM, Anuj Wadehra <anujw_2003@yahoo.co.in>
wrote:

> We were facing dropped mutations earlier and we increased flush writers.
> Now there are no dropped mutations in tpstats. To repair the damaged vnodes
> / inconsistent data we executed repair -pr on all nodes. Still, we see the
> same problem.
>
> When we analyze repair logs we see 2 strange things:
>
> 1. "Out of sync" ranges for cf which are not being actively being
> written/updated while the repair is going on. When we repaired all data by
> repair -pr on all nodes, why out of sync data?
>
> 2. For some cf , repair logs shows that all ranges are consistent. Still
> we get so many sstables created during repair. When everything is in sync ,
> why repair creates tiny sstables to repair data?
>
> Thanks
> Anuj Wadehra
>
> Sent from Yahoo Mail on Android
> <https://overview.mail.yahoo.com/mobile/?.src=Android>
> ------------------------------
>   *From*:"Ken Hancock" <ken.hancock@schange.com>
> *Date*:Tue, 9 Jun, 2015 at 8:24 pm
> *Subject*:Re: Hundreds of sstables after every Repair
>
> I think this came up recently in another thread.  If you're getting large
> numbers of SSTables after repairs, that means that your nodes are diverging
> from the keys that they're supposed to be having.  Likely you're dropping
> mutations.  Do a nodetool tpstats on each of your nodes and look at the
> mutation droppped counters.  If you're seeing dropped message, my money you
> have a non-zero FlushWriter "All time blocked" stat which is causing
> mutations to be dropped.
>
>
>
> On Tue, Jun 9, 2015 at 10:35 AM, Anuj Wadehra <anujw_2003@yahoo.co.in>
> wrote:
>
>> Any suggestions or comments on this one?
>>
>> Thanks
>> Anuj Wadehra
>>
>> Sent from Yahoo Mail on Android
>> <https://overview.mail.yahoo.com/mobile/?.src=Android>
>> ------------------------------
>>   *From*:"Anuj Wadehra" <anujw_2003@yahoo.co.in>
>> *Date*:Sun, 7 Jun, 2015 at 1:54 am
>> *Subject*:Hundreds of sstables after every Repair
>>
>> Hi,
>>
>> We are using 2.0.3 and vnodes. After every repair -pr operation  50+ tiny
>> sstables( <10K) get created. And these sstables never get compacted due to
>> coldness issue. I have raised
>> https://issues.apache.org/jira/browse/CASSANDRA-9146 for this issue but
>> I have been told to upgrade. Till we upgrade to latest 2.0.x , we are
>> stuck. Upgrade takes time, testing and planning in Production systems :(
>>
>> I have observed that even if vnodes are NOT damaged, hundreds of tiny
>> sstables are created during repair for a wide row CF. This is beyond my
>> understanding. If everything is consistent, and for the entire repair
>> process Cassandra is saying "Endpoints /x.x.x.x and /x.x.x.y are consistent
>> for <CF>". Whats the need of creating sstables?
>>
>> Is there any alternative to regular major compaction to deal with
>> situation?
>>
>>
>> Thanks
>> Anuj Wadehra
>>
>>
>
>
>
>
>
>

--f46d0443044015365605182e1835
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Perhaps doing a sstable2json on some of the small tables w=
ould shed some illumination.=C2=A0 I was going to suggest the anticompactio=
n feature of C*2.1 (which I&#39;m not familiar with), but you&#39;re on 2.0=
.<br><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Jun =
9, 2015 at 11:11 AM, Anuj Wadehra <span dir=3D"ltr">&lt;<a href=3D"mailto:a=
nujw_2003@yahoo.co.in" target=3D"_blank">anujw_2003@yahoo.co.in</a>&gt;</sp=
an> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex"><table border=3D"0" cellpaddin=
g=3D"0" cellspacing=3D"0"><tbody><tr><td valign=3D"top">We were facing drop=
ped mutations earlier and we increased flush writers. Now there are no drop=
ped mutations in tpstats. To repair the damaged vnodes / inconsistent data =
we executed repair -pr on all nodes. Still, we see the same problem.=C2=A0<=
div><br></div><div>When we analyze repair logs we see 2 strange things:<div=
><br><div>1. &quot;Out of sync&quot; ranges for cf which are not being acti=
vely being written/updated while the repair is going on. When we repaired a=
ll data by repair -pr on all nodes, why out of sync data?</div><div><br></d=
iv><div>2. For some cf , repair logs shows that all
 ranges are consistent. Still we get so many sstables created during repair=
. When everything is in sync , why repair creates tiny sstables to repair d=
ata?</div><div><br></div><div>Thanks</div><div><span class=3D"">Anuj Wadehr=
a<br><br><p><a href=3D"https://overview.mail.yahoo.com/mobile/?.src=3DAndro=
id" target=3D"_blank">Sent from Yahoo Mail on Android</a></p> <hr></span><t=
able border=3D"0" cellpadding=3D"0" cellspacing=3D"0"> <tbody> <tr> <td val=
ign=3D"top"> <div style=3D"font-family:Roboto,sans-serif;color:#7e7d80"><b>=
From</b>:&quot;Ken Hancock&quot; &lt;<a href=3D"mailto:ken.hancock@schange.=
com" target=3D"_blank">ken.hancock@schange.com</a>&gt;<br><b>Date</b>:Tue, =
9 Jun, 2015 at 8:24 pm<br><b>Subject</b>:Re: Hundreds of sstables after eve=
ry Repair<br><br></div><div><div class=3D"h5"> <div dir=3D"ltr">I think thi=
s came up recently in another thread.=C2=A0 If you&#39;re getting large num=
bers of SSTables after repairs, that means that your nodes are
 diverging from the keys that they&#39;re supposed to be having.=C2=A0 Like=
ly you&#39;re dropping mutations.=C2=A0 Do a nodetool tpstats on each of yo=
ur nodes and look at the mutation droppped counters.=C2=A0 If you&#39;re se=
eing dropped message, my money you have a non-zero FlushWriter &quot;All ti=
me blocked&quot; stat which is causing mutations to be dropped.<br clear=3D=
"none"><br clear=3D"none"><br clear=3D"none"><div><div class=3D"gmail_extra=
"><br clear=3D"none"><div class=3D"gmail_quote">On Tue, Jun 9, 2015 at 10:3=
5 AM, Anuj Wadehra <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect"=
>anujw_2003@yahoo.co.in</a>&gt;</span> wrote:<br clear=3D"none"><blockquote=
 class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc soli=
d;padding-left:1ex"><table border=3D"0" cellpadding=3D"0" cellspacing=3D"0"=
><tbody><tr><td colspan=3D"1" rowspan=3D"1" valign=3D"top"><div>Any
 suggestions or comments on this one?</div><div><br clear=3D"none"></div><d=
iv>Thanks</div><div>Anuj Wadehra<br clear=3D"none"><br clear=3D"none"><p><a=
 rel=3D"nofollow" shape=3D"rect" href=3D"https://overview.mail.yahoo.com/mo=
bile/?.src=3DAndroid" target=3D"_blank">Sent from Yahoo Mail on Android</a>=
</p> <hr><div> </div><div> </div><div> </div><table border=3D"0" cellpaddin=
g=3D"0" cellspacing=3D"0"><tbody><tr><td colspan=3D"1" rowspan=3D"1" valign=
=3D"top"><div> <div style=3D"font-family:Roboto,sans-serif;color:#7e7d80"><=
b>From</b>:&quot;Anuj Wadehra&quot; &lt;<a rel=3D"nofollow" shape=3D"rect">=
anujw_2003@yahoo.co.in</a>&gt;<br clear=3D"none"><b>Date</b>:Sun, 7 Jun, 20=
15 at 1:54 am<br clear=3D"none"><b>Subject</b>:Hundreds of sstables after e=
very Repair<br clear=3D"none"><br clear=3D"none"></div></div><div><div> <di=
v style=3D"color:#000;background-color:#fff;font-family:garamond,new york,t=
imes,serif;font-size:16px"><div dir=3D"ltr">Hi,</div><div dir=3D"ltr"><br c=
lear=3D"none"></div><div dir=3D"ltr">We are using 2.0.3 and vnodes. After e=
very repair -pr operation=C2=A0 50+ tiny sstables( &lt;10K) get created. An=
d these sstables never get compacted due to coldness issue. I have raised <=
a rel=3D"nofollow" shape=3D"rect" href=3D"https://issues.apache.org/jira/br=
owse/CASSANDRA-9146" target=3D"_blank">https://issues.apache.org/jira/brows=
e/CASSANDRA-9146</a> for this issue but I have been told to upgrade. Till w=
e upgrade to latest 2.0.x , we are stuck. Upgrade takes time, testing and p=
lanning in Production systems :(<br clear=3D"none"></div><div dir=3D"ltr"><=
br clear=3D"none"></div><div dir=3D"ltr">I have observed that even if vnode=
s are NOT damaged, hundreds of tiny sstables are created during repair for =
a wide row CF. This is beyond my
 understanding. If everything is consistent, and for the entire repair proc=
ess Cassandra is saying &quot;Endpoints /x.x.x.x and /x.x.x.y are consisten=
t for &lt;CF&gt;&quot;. Whats the need of creating sstables?</div><div dir=
=3D"ltr"><br clear=3D"none"></div><div dir=3D"ltr">Is there any alternative=
 to regular major compaction to deal with situation? <br clear=3D"none"></d=
iv><div dir=3D"ltr"><br clear=3D"none"></div><div dir=3D"ltr"><br clear=3D"=
none"></div><div dir=3D"ltr">Thanks</div><div dir=3D"ltr">Anuj Wadehra<br c=
lear=3D"none"></div><div dir=3D"ltr"><br clear=3D"none"></div></div></div><=
/div></td></tr></tbody></table></div></td></tr></tbody></table></blockquote=
></div><br clear=3D"none"><br clear=3D"all"><div><table style=3D"color:rgb(=
0,0,0);font-family:Verdana,Arial,Helvetica,sans-serif;font-size:11px" borde=
r=3D"0" cellpadding=3D"0" cellspacing=3D"0" width=3D"500" height=3D"100"><t=
body><tr><td colspan=3D"1" rowspan=3D"1" style=3D"font-family:Arial,Helveti=
ca,sans-serif;font-size:3mm;line-height:4mm;color:rgb(51,51,51)" bgcolor=3D=
"#FFFFFF" valign=3D"middle"><br clear=3D"none"></td><td colspan=3D"1" rowsp=
an=3D"1" valign=3D"top"><br clear=3D"none"></td></tr><tr><td colspan=3D"1" =
rowspan=3D"1" valign=3D"middle"><br clear=3D"none"></td></tr><tr><td colspa=
n=3D"1" rowspan=3D"1" style=3D"font-family:Arial,Helvetica,sans-serif;font-=
size:3mm;line-height:4mm;color:rgb(51,51,51)" align=3D"left" valign=3D"top"=
><br clear=3D"none"></td></tr></tbody></table></div>
</div></div></div></div></div></td>  </tr>   </tbody>   </table></div></div=
></div></td></tr></tbody></table></blockquote></div><br><br clear=3D"all"><=
div class=3D"gmail_signature"><table style=3D"color:rgb(0,0,0);font-family:=
Verdana,Arial,Helvetica,sans-serif;font-size:11px" border=3D"0" cellpadding=
=3D"0" cellspacing=3D"0" width=3D"500" height=3D"100"><tbody><tr><td style=
=3D"font-family:Arial,Helvetica,sans-serif;font-size:3mm;line-height:4mm;co=
lor:rgb(51,51,51)" bgcolor=3D"#FFFFFF" valign=3D"middle"><br></td><td valig=
n=3D"top"><br></td></tr><tr><td valign=3D"middle"><br></td></tr><tr><td sty=
le=3D"font-family:Arial,Helvetica,sans-serif;font-size:3mm;line-height:4mm;=
color:rgb(51,51,51)" align=3D"left" valign=3D"top"><br></td></tr></tbody></=
table></div>
</div></div>

--f46d0443044015365605182e1835--