Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: <991216328.27769515.1471601787734.JavaMail.yahoo@mail.yahoo.com>
References: <CADYwe76fUan9ESYGYX3ESZiTAKESZqfarGa8pPhQ8ZYOdsOEGA@mail.gmail.com>
 <CAKaZCX4RquSPQOiAjc-PqPFiuU+1Rj2UBW5=M75TWgUw=YeUdg@mail.gmail.com>
 <CADYwe765fsBoXc_ZA6+oaPU+Jc6Ldkth5iS4kQO18_3nw436Zg@mail.gmail.com>
 <CAKaZCX6T2qwz40KpkFme5TiHynzDTVhOcBLHGritUsOixLe5mg@mail.gmail.com>
 <CAPbVhuN6dsBfrd72HNA7WTsus=M8ZzxtfSCgmKFjJGZ9KgJA8w@mail.gmail.com>
 <CAJXyu8=mpA53H8_iSd7SPwzWBrnZCS=oOsOv6FNbhFqWYRcY6Q@mail.gmail.com> <991216328.27769515.1471601787734.JavaMail.yahoo@mail.yahoo.com>
From: =?UTF-8?B?SsOpcsO0bWUgTWFpbmF1ZA==?= <jerome@mainaud.com>
Date: Fri, 19 Aug 2016 14:23:30 +0200
Message-ID: <CAJXyu8kAKn2aY66q=Yh3gDu7ZUyCD5XqCgkk5B0-Qqw9ymnbTA@mail.gmail.com>
Subject: Re: nodetool repair with -pr and -dc
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a113b0818365492053a6bc49c
archived-at: Fri, 19 Aug 2016 12:23:38 -0000

--001a113b0818365492053a6bc49c
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi Romain,

Thank you for your answer, I will open a ticket soon.

Best

--=20
J=C3=A9r=C3=B4me Mainaud
jerome@mainaud.com

2016-08-19 12:16 GMT+02:00 Romain Hardouin <romainh_ml@yahoo.fr>:

> Hi J=C3=A9r=C3=B4me,
>
> The code in 2.2.6 allows -local and -pr:
> https://github.com/apache/cassandra/blob/cassandra-2.2.
> 6/src/java/org/apache/cassandra/service/StorageService.java#L2899
>
> But... the options validation introduced in CASSANDRA-6455 seems to break
> this feature!
> https://github.com/apache/cassandra/blob/cassandra-2.2.
> 6/src/java/org/apache/cassandra/repair/messages/RepairOption.java#L211
>
> I suggest to open a ticket https://issues.apache.org/
> jira/browse/cassandra/
>
> Best,
>
> Romain
>
>
> Le Vendredi 19 ao=C3=BBt 2016 11h47, J=C3=A9r=C3=B4me Mainaud <jerome@mai=
naud.com> a
> =C3=A9crit :
>
>
> Hello,
>
> I've got a repair command with both -pr and -local rejected on an 2.2.6
> cluster.
> The exact command was : nodetool repair --full -par -pr -local -j 4
>
> The message is  =E2=80=9CYou need to run primary range repair on all node=
s in the
> cluster=E2=80=9D.
>
> Reading the code and previously cited CASSANDRA-7450, it should have been
> accepted.
>
> Did anyone meet this error before ?
>
> Thanks
>
>
> --
> J=C3=A9r=C3=B4me Mainaud
> jerome@mainaud.com
>
> 2016-08-12 1:14 GMT+02:00 kurt Greaves <kurt@instaclustr.com>:
>
> -D does not do what you think it does. I've quoted the relevant
> documentation from the README:
>
>
> <https://github.com/BrianGallew/cassandra_range_repair#multiple-datacente=
rs>Multiple
> Datacenters
> If you have multiple datacenters in your ring, then you MUST specify the
> name of the datacenter containing the node you are repairing as part of t=
he
> command-line options (--datacenter=3DDCNAME). Failure to do so will resul=
t in
> only a subset of your data being repaired (approximately
> data/number-of-datacenters). This is because nodetool has no way to
> determine the relevant DC on its own, which in turn means it will use the
> tokens from every ring member in every datacenter.
>
>
>
> On 11 August 2016 at 12:24, Paulo Motta <pauloricardomg@gmail.com> wrote:
>
> > if we want to use -pr option ( which i suppose we should to prevent
> duplicate checks) in 2.0 then if we run the repair on all nodes in a sing=
le
> DC then it should be sufficient and we should not need to run it on all
> nodes across DC's?
>
> No, because the primary ranges of the nodes in other DCs will be missing
> repair, so you should either run with -pr in all nodes in all DCs, or
> restrict repair to a specific DC with -local (and have duplicate checks).
> Combined -pr and -local are only supported on 2.1
>
>
> 2016-08-11 1:29 GMT-03:00 Anishek Agarwal <anishek@gmail.com>:
>
> ok thanks, so if we want to use -pr option ( which i suppose we should to
> prevent duplicate checks) in 2.0 then if we run the repair on all nodes i=
n
> a single DC then it should be sufficient and we should not need to run it
> on all nodes across DC's ?
>
>
>
> On Wed, Aug 10, 2016 at 5:01 PM, Paulo Motta <pauloricardomg@gmail.com>
> wrote:
>
> On 2.0 repair -pr option is not supported together with -local, -hosts or
> -dc, since it assumes you need to repair all nodes in all DCs and it will
> throw and error if you try to run with nodetool, so perhaps there's
> something wrong with range_repair options parsing.
>
> On 2.1 it was added support to simultaneous -pr and -local options on
> CASSANDRA-7450, so if you need that you can either upgade to 2.1 or
> backport that to 2.0.
>
>
> 2016-08-10 5:20 GMT-03:00 Anishek Agarwal <anishek@gmail.com>:
>
> Hello,
>
> We have 2.0.17 cassandra cluster(*DC1*) with a cross dc setup with a
> smaller cluster(*DC2*).  After reading various blogs about
> scheduling/running repairs looks like its good to run it with the followi=
ng
>
>
> -pr for primary range only
> -st -et for sub ranges
> -par for parallel
> -dc to make sure we can schedule repairs independently on each Data centr=
e
> we have.
>
> i have configured the above using the repair utility @ https://github.com=
/BrianGallew
> /cassandra_range_repair.git
> <https://github.com/BrianGallew/cassandra_range_repair.git>
>
> which leads to the following command :
>
> ./src/range_repair.py -k [keyspace] -c [columnfamily name] -v -H localhos=
t
> -p -D* DC1*
>
> but looks like the merkle tree is being calculated on nodes which are par=
t
> of other *DC2.*
>
> why does this happen? i thought it should only look at the nodes in local
> cluster. however on nodetool the* -pr* option cannot be used with *-local=
* according
> to docs @https://docs.datastax.com/en/ cassandra/2.0/cassandra/tools/
> toolsRepair.html
> <https://docs.datastax.com/en/cassandra/2.0/cassandra/tools/toolsRepair.h=
tml>
>
> so i am may be missing something, can someone help explain this please.
>
> thanks
> anishek
>
>
>
>
>
>
>
> --
> Kurt Greaves
> kurt@instaclustr.com
> www.instaclustr.com
>
>
>
>
>

--001a113b0818365492053a6bc49c
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>Hi Romain,<br><br></div>Thank you for your answe=
r, I will open a ticket soon.<br><br></div>Best<br></div><div class=3D"gmai=
l_extra"><br clear=3D"all"><div><div class=3D"gmail_signature" data-smartma=
il=3D"gmail_signature">-- <br>J=C3=A9r=C3=B4me Mainaud<br><a href=3D"mailto=
:jerome@mainaud.com" target=3D"_blank">jerome@mainaud.com</a><br></div></di=
v>
<br><div class=3D"gmail_quote">2016-08-19 12:16 GMT+02:00 Romain Hardouin <=
span dir=3D"ltr">&lt;<a href=3D"mailto:romainh_ml@yahoo.fr" target=3D"_blan=
k">romainh_ml@yahoo.fr</a>&gt;</span>:<br><blockquote class=3D"gmail_quote"=
 style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><d=
iv><div style=3D"color:#000;background-color:#fff;font-family:HelveticaNeue=
,Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-serif;font-size:16px"><d=
iv><font size=3D"2">Hi J=C3=A9r=C3=B4me,</font></div><div><font size=3D"2">=
<br></font></div><div><font size=3D"2">The code in 2.2.6 allows -local and =
-pr:</font></div><div><font size=3D"2"><a href=3D"https://github.com/apache=
/cassandra/blob/cassandra-2.2.6/src/java/org/apache/cassandra/service/Stora=
geService.java#L2899" target=3D"_blank">https://github.com/apache/<wbr>cass=
andra/blob/cassandra-2.2.<wbr>6/src/java/org/apache/<wbr>cassandra/service/=
<wbr>StorageService.java#L2899</a></font></div><div><font size=3D"2"><br></=
font></div><div><font size=3D"2">But... the options validation introduced i=
n CASSANDRA-6455 seems to break this feature!</font></div><div><font size=
=3D"2"><a href=3D"https://github.com/apache/cassandra/blob/cassandra-2.2.6/=
src/java/org/apache/cassandra/repair/messages/RepairOption.java#L211" targe=
t=3D"_blank">https://github.com/apache/<wbr>cassandra/blob/cassandra-2.2.<w=
br>6/src/java/org/apache/<wbr>cassandra/repair/messages/<wbr>RepairOption.j=
ava#L211</a></font></div><div><br></div><div><font size=3D"2">I suggest to =
open a ticket <a href=3D"https://issues.apache.org/jira/browse/cassandra/" =
target=3D"_blank">https://issues.apache.org/<wbr>jira/browse/cassandra/</a>=
</font></div><div><br></div><div><font size=3D"2">Best,</font></div><div><f=
ont size=3D"2"><br></font></div><div></div><div dir=3D"ltr"><font size=3D"2=
">Romain</font></div><div><div class=3D"h5"> <div><br><br></div><div style=
=3D"display:block"> <div style=3D"font-family:HelveticaNeue,Helvetica Neue,=
Helvetica,Arial,Lucida Grande,sans-serif;font-size:16px"> <div style=3D"fon=
t-family:HelveticaNeue,Helvetica Neue,Helvetica,Arial,Lucida Grande,sans-se=
rif;font-size:16px"> <div dir=3D"ltr"><font face=3D"Arial" size=3D"2"> Le V=
endredi 19 ao=C3=BBt 2016 11h47, J=C3=A9r=C3=B4me Mainaud &lt;<a href=3D"ma=
ilto:jerome@mainaud.com" target=3D"_blank">jerome@mainaud.com</a>&gt; a =C3=
=A9crit :<br></font></div>  <br><br> <div><div><div><div dir=3D"ltr"><div><=
div><div><div>Hello,<br clear=3D"none"><br clear=3D"none"></div>I&#39;ve go=
t a repair command with both -pr and -local rejected on an 2.2.6 cluster.<b=
r clear=3D"none"></div><div>The exact command was : <span style=3D"color:rg=
b(34,34,34);font-family:monospace,monospace;font-size:12.8px;font-style:nor=
mal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent=
:0px;text-transform:none;white-space:normal;word-spacing:0px;display:inline=
!important;float:none;background-color:rgb(255,255,255)">nodetool repair --=
full -par -pr -local -j 4</span></div><div><br clear=3D"none"></div><div>Th=
e message is=C2=A0 <span style=3D"color:rgb(34,34,34);font-family:sans-seri=
f;font-size:small;font-style:normal;font-weight:normal;letter-spacing:norma=
l;line-height:normal;text-indent:0px;text-transform:none;white-space:normal=
;word-spacing:0px;display:inline!important;float:none;background-color:rgb(=
255,255,255)">=E2=80=9CYou need to run primary range repair on all nodes in=
 the cluster=E2=80=9D.<br clear=3D"none"></span></div><div><br clear=3D"non=
e"></div>Reading the code and previously cited CASSANDRA-7450, it should ha=
ve been accepted.<br clear=3D"none"><br clear=3D"none"></div>Did anyone mee=
t this error before ?<br clear=3D"none"><br clear=3D"none"></div>Thanks<br =
clear=3D"none"><div><div><br clear=3D"none"></div></div></div><div><br clea=
r=3D"all"><div><div>-- <br clear=3D"none">J=C3=A9r=C3=B4me Mainaud<br clear=
=3D"none"><a rel=3D"nofollow" shape=3D"rect" href=3D"mailto:jerome@mainaud.=
com" target=3D"_blank">jerome@mainaud.com</a><br clear=3D"none"></div></div=
>
<br clear=3D"none"><div>2016-08-12 1:14 GMT+02:00 kurt Greaves <span dir=3D=
"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" href=3D"mailto:kurt@instaclust=
r.com" target=3D"_blank">kurt@instaclustr.com</a>&gt;</span>:<br clear=3D"n=
one"><div><blockquote style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid=
;padding-left:1ex"><div dir=3D"ltr">-D does not do what you think it does. =
I&#39;ve quoted the relevant documentation from the README:<br clear=3D"non=
e"><blockquote style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(=
204,204,204);padding-left:1ex"><h3><a rel=3D"nofollow" shape=3D"rect" href=
=3D"https://github.com/BrianGallew/cassandra_range_repair#multiple-datacent=
ers" target=3D"_blank"></a>Multiple Datacenters</h3>

<div>If you have multiple datacenters in your ring, then you MUST specify=
=20
the name of the datacenter containing the node you are repairing as part
 of the command-line options (--datacenter=3DDCNAME).  Failure to do so=20
will result in only a subset of your data being repaired (approximately=20
data/number-of-datacenters).  This is because nodetool has no way to=20
determine the relevant DC on its own, which in turn means it will use=20
the tokens from every ring member in every datacenter.</div></blockquote><b=
r clear=3D"none"></div><div><div><div><br clear=3D"none"><div>On 11 August =
2016 at 12:24, Paulo Motta <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=
=3D"rect" href=3D"mailto:pauloricardomg@gmail.com" target=3D"_blank">paulor=
icardomg@gmail.com</a>&gt;</span> wrote:<br clear=3D"none"><blockquote styl=
e=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div di=
r=3D"ltr"><span></span><div>&gt;  if we want to use -pr option ( which i su=
ppose we should to prevent=20
duplicate checks) in 2.0 then if we run the repair on all nodes in a=20
single DC then it should be sufficient and we should not need to run it=20
on all nodes across DC&#39;s?<br clear=3D"none"><br clear=3D"none"></div><d=
iv>No, because the primary ranges of the nodes in other DCs will be missing=
 repair, so you should either run with -pr in all nodes in all DCs, or rest=
rict repair to a specific DC with -local (and have duplicate checks). Combi=
ned -pr and -local are only supported on 2.1<br clear=3D"none"></div><br cl=
ear=3D"none"></div><div><div><div><br clear=3D"none"><div>2016-08-11 1:29 G=
MT-03:00 Anishek Agarwal <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D=
"rect" href=3D"mailto:anishek@gmail.com" target=3D"_blank">anishek@gmail.co=
m</a>&gt;</span>:<br clear=3D"none"><blockquote style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">ok thanks, so=
 if we want to use -pr option ( which i suppose we should to prevent duplic=
ate checks) in 2.0 then if we run the repair on all nodes in a single DC th=
en it should be sufficient and we should not need to run it on all nodes ac=
ross DC&#39;s ?<div><br clear=3D"none"></div><div><br clear=3D"none"></div>=
</div><div><div><div><br clear=3D"none"><div>On Wed, Aug 10, 2016 at 5:01 P=
M, Paulo Motta <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" hre=
f=3D"mailto:pauloricardomg@gmail.com" target=3D"_blank">pauloricardomg@gmai=
l.com</a>&gt;</span> wrote:<br clear=3D"none"><blockquote style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">On =
2.0 repair -pr option is not supported together with -local, -hosts or -dc,=
 since it assumes you need to repair all nodes in all DCs and it will throw=
 and error if you try to run with nodetool, so perhaps there&#39;s somethin=
g wrong with range_repair options parsing.<br clear=3D"none"><br clear=3D"n=
one">On 2.1 it was added support to simultaneous -pr and -local options on =
CASSANDRA-7450, so if you need that you can either upgade to 2.1 or backpor=
t that to 2.0.<div><div><br clear=3D"none"><div><div><br clear=3D"none"><di=
v>2016-08-10 5:20 GMT-03:00 Anishek Agarwal <span dir=3D"ltr">&lt;<a rel=3D=
"nofollow" shape=3D"rect" href=3D"mailto:anishek@gmail.com" target=3D"_blan=
k">anishek@gmail.com</a>&gt;</span>:<br clear=3D"none"><blockquote style=3D=
"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D=
"ltr">Hello,<div><br clear=3D"none"></div><div>We have 2.0.17 cassandra clu=
ster(<b>DC1</b>) with a cross dc setup with a smaller cluster(<b>DC2</b>).=
=C2=A0 After reading various blogs about scheduling/running repairs looks l=
ike its good to run it with the following=C2=A0</div><div><br clear=3D"none=
"></div><div><br clear=3D"none"></div><div>-pr for primary range only=C2=A0=
</div><div>-st -et for sub ranges=C2=A0</div><div>-par for parallel=C2=A0</=
div><div>-dc to make sure we can schedule repairs independently on each Dat=
a centre we have.=C2=A0</div><div><br clear=3D"none"></div><div>i have conf=
igured the above using the repair utility @ <a rel=3D"nofollow" shape=3D"re=
ct" href=3D"https://github.com/BrianGallew/cassandra_range_repair.git" targ=
et=3D"_blank">https://github.com/BrianGallew /cassandra_range_repair.git</a=
></div><div><br clear=3D"none"></div><div>which leads to the following comm=
and :</div><div><br clear=3D"none"></div><div>./src/range_repair.py -k [key=
space] -c [columnfamily name] -v -H localhost -p -D<b> DC1</b><br clear=3D"=
none"></div><div><br clear=3D"none"></div><div>but looks like the merkle tr=
ee is being calculated on nodes which are part of other <b>DC2.</b></div><d=
iv><b><br clear=3D"none"></b></div><div>why does this happen? i thought it =
should only look at the nodes in local cluster. however on nodetool the<i> =
-pr</i> option cannot be used with <i>-local</i>=C2=A0according to docs @<a=
 rel=3D"nofollow" shape=3D"rect" href=3D"https://docs.datastax.com/en/cassa=
ndra/2.0/cassandra/tools/toolsRepair.html" target=3D"_blank">https://docs.d=
atastax.com/en/ cassandra/2.0/cassandra/tools/ toolsRepair.html</a></div><d=
iv><br clear=3D"none"></div><div>so i am may be missing something, can some=
one help explain this please.</div><div><br clear=3D"none"></div><div>thank=
s</div><span><font color=3D"#888888"></font></span><div>anishek</div></div>
</blockquote></div><br clear=3D"none"></div></div></div></div></div>
</blockquote></div><br clear=3D"none"></div>
</div></div></blockquote></div><br clear=3D"none"></div>
</div></div></blockquote></div><br clear=3D"none"><br clear=3D"all"><br cle=
ar=3D"none"></div></div><span><font color=3D"#888888">-- <br clear=3D"none"=
></font></span><div><div dir=3D"ltr"><div><div dir=3D"ltr"><font face=3D"ti=
mes new roman, serif" color=3D"#0b5394">Kurt Greaves</font><div><font face=
=3D"times new roman, serif" color=3D"#0b5394"><a rel=3D"nofollow" shape=3D"=
rect" href=3D"mailto:kurt@instaclustr.com" target=3D"_blank">kurt@instaclus=
tr.com</a></font></div><div><font face=3D"times new roman, serif" color=3D"=
#0b5394"><a rel=3D"nofollow" shape=3D"rect" href=3D"http://www.instaclustr.=
com/" target=3D"_blank">www.instaclustr.com</a></font></div></div></div></d=
iv></div>
</div>
</blockquote></div></div><br clear=3D"none"></div></div></div><br><br></div=
>  </div> </div>  </div></div></div></div></div></blockquote></div><br></di=
v>

--001a113b0818365492053a6bc49c--