Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of paulo.motta@chaordicsystems.com
 designates 209.85.160.173 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CADT6p3nK7AemHrnAJPGmjKrzRXXQuhwxrFFNmeJqZdbvUUJe-w@mail.gmail.com>
References: 
 <CADT6p3kDR027EpLhb8SGk-qg6b32jghrVFKfWc6BPBxO-TzBAQ@mail.gmail.com>
 <CAGM0Up8Y7wUos4tVniUmOS-s_LV5Oxta3pE+_Kp6pNehg23nQw@mail.gmail.com>
 <CAEDUwd26dW6YEaaijGQHdJ4fKhSQaHDQp6k1obxrvqQrctxxvQ@mail.gmail.com>
 <CABzeAR4+qSNyLwiBvV0nW8V4cmQ7WwNB_wGzEJGGYq=KqTA8PQ@mail.gmail.com>
 <CADT6p3mFwF6kZaOWh=u7Sq1oziDbjmNrPQPWW1Rhit9QZARokg@mail.gmail.com>
 <CADT6p3nK7AemHrnAJPGmjKrzRXXQuhwxrFFNmeJqZdbvUUJe-w@mail.gmail.com>
From: Paulo Ricardo Motta Gomes <paulo.motta@chaordicsystems.com>
Date: Mon, 30 Jun 2014 22:37:56 -0300
Message-ID: 
 <CAM+WaZjH5vNdpTSir6AA7--Lw4Z_kyp-VhS4spb-Mkq0Ggi2gg@mail.gmail.com>
Subject: Re: nodetool repair -snapshot option?
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11c1edec6d320a04fd17d4b7

--001a11c1edec6d320a04fd17d4b7
Content-Type: text/plain; charset=UTF-8

If you find it useful, I created a tool where you input the node IP,
keyspace, column family, and optionally the number of partitions (default:
32K), and it outputs the list of subranges for that node, CF, partition
size: https://github.com/pauloricardomg/cassandra-list-subranges

So you can basically iterate over the output of that and do subrange repair
for each node and cf, maybe in parallel. :)


On Mon, Jun 30, 2014 at 10:26 PM, Phil Burress <philburresseme@gmail.com>
wrote:

> One last question. Any tips on scripting a subrange repair?
>
>
> On Mon, Jun 30, 2014 at 7:12 PM, Phil Burress <philburresseme@gmail.com>
> wrote:
>
>> We are running repair -pr. We've tried subrange manually and that seems
>> to work ok. I guess we'll go with that going forward. Thanks for all the
>> info!
>>
>>
>> On Mon, Jun 30, 2014 at 6:52 PM, Jaydeep Chovatia <
>> chovatia.jaydeep@gmail.com> wrote:
>>
>>> Are you running full repair or on subset? If you are running full repair
>>> then try running on sub-set of ranges which means less data to worry during
>>> repair and that would help JAVA heap in general. You will have to do
>>> multiple iterations to complete entire range but at-least it will work.
>>>
>>> -jaydeep
>>>
>>>
>>> On Mon, Jun 30, 2014 at 3:22 PM, Robert Coli <rcoli@eventbrite.com>
>>> wrote:
>>>
>>>> On Mon, Jun 30, 2014 at 3:08 PM, Yuki Morishita <mor.yuki@gmail.com>
>>>> wrote:
>>>>
>>>>> Repair uses snapshot option by default since 2.0.2 (see NEWS.txt).
>>>>>
>>>>
>>>> As a general meta comment, the process by which operationally important
>>>> defaults change in Cassandra seems ad-hoc and sub-optimal.
>>>>
>>>> For to record, my view was that this change, which makes repair even
>>>> slower than it previously was, was probably overly optimistic.
>>>>
>>>> It's also weird in that it changes default behavior which has been
>>>> unchanged since the start of Cassandra time and is therefore probably
>>>> automated against. Why was it so critically important to switch to snapshot
>>>> repair that it needed to be shotgunned as a new default in 2.0.2?
>>>>
>>>> =Rob
>>>>
>>>>
>>>
>>>
>>
>


-- 
*Paulo Motta*

Chaordic | *Platform*
*www.chaordic.com.br <http://www.chaordic.com.br/>*
+55 48 3232.3200

--001a11c1edec6d320a04fd17d4b7
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>If you find it useful, I created a tool where you inp=
ut the node IP, keyspace, column family, and optionally the number of parti=
tions (default: 32K), and it outputs the list of subranges for that node, C=
F, partition size:=C2=A0<a href=3D"https://github.com/pauloricardomg/cassan=
dra-list-subranges">https://github.com/pauloricardomg/cassandra-list-subran=
ges</a></div>

<div><br>So you can basically iterate over the output of that and do subran=
ge repair for each node and cf, maybe in parallel. :)</div></div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Mon, Jun 30, 2014 at=
 10:26 PM, Phil Burress <span dir=3D"ltr">&lt;<a href=3D"mailto:philburress=
eme@gmail.com" target=3D"_blank">philburresseme@gmail.com</a>&gt;</span> wr=
ote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">One last question. Any tips=
 on scripting a subrange repair?</div><div class=3D"HOEnZb"><div class=3D"h=
5"><div class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">On Mon, Jun 30, 2014 at 7:12 PM, Phil Bu=
rress <span dir=3D"ltr">&lt;<a href=3D"mailto:philburresseme@gmail.com" tar=
get=3D"_blank">philburresseme@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">We are running repair -pr. =
We&#39;ve tried subrange manually and that seems to work ok. I guess we&#39=
;ll go with that going forward. Thanks for all the info!</div>


<div><div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">
On Mon, Jun 30, 2014 at 6:52 PM, Jaydeep Chovatia <span dir=3D"ltr">&lt;<a =
href=3D"mailto:chovatia.jaydeep@gmail.com" target=3D"_blank">chovatia.jayde=
ep@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Are you running full repair or on subset? If you are runni=
ng full repair then try running on sub-set of ranges which means less data =
to worry during repair and that would help JAVA heap in general. You will h=
ave to do multiple iterations to complete entire range but at-least it will=
 work.<span><font color=3D"#888888"><div>


<br></div><div>-jaydeep</div></font></span></div><div><div><div class=3D"gm=
ail_extra"><br><br><div class=3D"gmail_quote">On Mon, Jun 30, 2014 at 3:22 =
PM, Robert Coli <span dir=3D"ltr">&lt;<a href=3D"mailto:rcoli@eventbrite.co=
m" target=3D"_blank">rcoli@eventbrite.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra">=
<div class=3D"gmail_quote"><div>On Mon, Jun 30, 2014 at 3:08 PM, Yuki Moris=
hita <span dir=3D"ltr">&lt;<a href=3D"mailto:mor.yuki@gmail.com" target=3D"=
_blank">mor.yuki@gmail.com</a>&gt;</span> wrote:<br>


</div><div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.=
8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-st=
yle:solid;padding-left:1ex">Repair uses snapshot option by default since 2.=
0.2 (see NEWS.txt).<br>


</blockquote><div><br></div></div><div><div>As a general meta comment, the =
process by which operationally important defaults change in Cassandra seems=
 ad-hoc and sub-optimal.</div><div><br></div></div><div>For to record, my v=
iew was that this change, which makes repair even slower than it previously=
 was, was probably overly optimistic.<br>


</div><div><br></div><div>It&#39;s also weird in that it changes default be=
havior which has been unchanged since the start of Cassandra time and is th=
erefore probably automated against. Why was it so critically important to s=
witch to snapshot repair that it needed to be shotgunned as a new default i=
n 2.0.2?</div>


<div><br></div><div>=3DRob<br></div><div>=C2=A0</div></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div dir=3D"ltr"><div style=3D"background-color:rgb(255,255,255)"><b>Paulo =
Motta</b></div><div style=3D"background-color:rgb(255,255,255)"><br></div><=
div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px;ba=
ckground-color:rgb(255,255,255)">

<div style=3D"color:rgb(136,136,136);font-size:small;font-family:arial"><sp=
an style=3D"color:rgb(68,68,68)">Chaordic | <i>Platform</i></span><br></div=
><div style=3D"color:rgb(136,136,136);font-size:small;font-family:arial"><u=
><a href=3D"http://www.chaordic.com.br/" style=3D"color:rgb(17,85,204)" tar=
get=3D"_blank"><font color=3D"#444444">www.chaordic.com.br</font></a></u></=
div>

<div style=3D"color:rgb(136,136,136);font-size:small;font-family:arial"><fo=
nt size=3D"1" color=3D"#666666">+55 48 3232.3200</font></div></div></div>
</div>

--001a11c1edec6d320a04fd17d4b7--