Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_1A42EE32-EF79-4D2C-A861-3B243D6C6B05"
Message-Id: <77719ECE-AF82-4073-86DA-D99EDD5579A7@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: nodetool status inconsistencies,
 repair performance and system keyspace compactions
Date: Fri, 5 Apr 2013 22:41:02 +0530
References: 
 <CAAgSJSW4BSZ2ztu5dv-QnSEe9WCiH62OnnfQtFmumggiMv7X_A@mail.gmail.com>
 <AF795C41-2268-4196-8130-93473DB690B2@thelastpickle.com>
 <CAAgSJSUAC=QJqYJkRGo6roYSeH6BBWaZRkxmWr6K3aFnArLGnw@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAAgSJSUAC=QJqYJkRGo6roYSeH6BBWaZRkxmWr6K3aFnArLGnw@mail.gmail.com>


--Apple-Mail=_1A42EE32-EF79-4D2C-A861-3B243D6C6B05
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

monitor the repair using nodetool compactionstats to see the merkle =
trees being created, and nodetool netstats to see data streaming.=20

Also look in the logs for messages from AntiEntropyService.java , that =
will tell you how long the node waited for each replica to get back to =
it.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 4/04/2013, at 5:42 PM, Ond=C5=99ej =C4=8Cerno=C5=A1 =
<cernoso@gmail.com> wrote:

> Hi,
>=20
> most has been resolved - the failed to uncompress error was really a
> bug in cassandra (see
> https://issues.apache.org/jira/browse/CASSANDRA-5391) and the problem
> with different load reporting is a change between 1.2.1 (reports 100%
> for 3 replicas/3 nodes/2 DCs setup I have) and 1.2.3 which reports the
> fraction. Is this correct?
>=20
> Anyway, the nodetool repair still takes ages to finish, considering
> only megabytes of not changing data are involved in my test:
>=20
> [root@host:/etc/puppet] nodetool repair ks
> [2013-04-04 13:26:46,618] Starting repair command #1, repairing 1536
> ranges for keyspace ks
> [2013-04-04 13:47:17,007] Repair session
> 88ebc700-9d1a-11e2-a0a1-05b94e1385c7 for range
> (-2270395505556181001,-2268004533044804266] finished
> ...
> [2013-04-04 13:47:17,063] Repair session
> 65d31180-9d1d-11e2-a0a1-05b94e1385c7 for range
> (1069254279177813908,1070290707448386360] finished
> [2013-04-04 13:47:17,063] Repair command #1 finished
>=20
> This is the status before the repair (by the way, after the datacenter
> has been bootstrapped from the remote one):
>=20
> [root@host:/etc/puppet] nodetool status
> Datacenter: us-east
> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
> Status=3DUp/Down
> |/ State=3DNormal/Leaving/Joining/Moving
> --  Address                   Load       Tokens  Owns   Host ID
>                                    Rack
> UN  xxx.xxx.xxx.xxx    5.74 MB    256     17.1%
> 06ff8328-32a3-4196-a31f-1e0f608d0638  1d
> UN  xxx.xxx.xxx.xxx    5.73 MB    256     15.3%
> 7a96bf16-e268-433a-9912-a0cf1668184e  1d
> UN  xxx.xxx.xxx.xxx    5.72 MB    256     17.5%
> 67a68a2a-12a8-459d-9d18-221426646e84  1d
> Datacenter: na-dev
> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
> Status=3DUp/Down
> |/ State=3DNormal/Leaving/Joining/Moving
> --  Address                  Load       Tokens  Owns   Host ID
>                                       Rack
> UN  xxx.xxx.xxx.xxx   5.74 MB    256     16.4%
> eb86aaae-ef0d-40aa-9b74-2b9704c77c0a  cmp02
> UN  xxx.xxx.xxx.xxx   5.74 MB    256     17.0%
> cd24af74-7f6a-4eaa-814f-62474b4e4df1  cmp01
> UN  xxx.xxx.xxx.xxx   5.74 MB    256     16.7%
> 1a55cfd4-bb30-4250-b868-a9ae13d81ae1  cmp05
>=20
> Why does it take 20 minutes to finish? Fortunately the big number of
> compactions I reported in the previous email was not triggered.
>=20
> And is there a documentation where I could find the exact semantics of
> repair when vnodes are used (and what -pr means in such a setup) and
> when run in multiple datacenter setup? I still don't quite get it.
>=20
> regards,
> Ond=C5=99ej =C4=8Cerno=C5=A1
>=20
>=20
> On Thu, Mar 28, 2013 at 3:30 AM, aaron morton =
<aaron@thelastpickle.com> wrote:
>> During one of my tests - see this thread in this mailing list:
>> =
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/java-io-I=
OException-FAILED-TO-UNCOMPRESS-5-exception-when-running-nodetool-rebuild-=
td7586494.html
>>=20
>> That thread has been updated, check the bug ondrej created.
>>=20
>> How will this perform in production with much bigger data if repair
>> takes 25 minutes on 7MB and 11k compactions were triggered by the
>> repair run?
>>=20
>> Seems a little odd.
>> See what happens the next time you run repair.
>>=20
>> Cheers
>>=20
>> -----------------
>> Aaron Morton
>> Freelance Cassandra Consultant
>> New Zealand
>>=20
>> @aaronmorton
>> http://www.thelastpickle.com
>>=20
>> On 27/03/2013, at 2:36 AM, Ond=C5=99ej =C4=8Cerno=C5=A1 =
<cernoso@gmail.com> wrote:
>>=20
>> Hi all,
>>=20
>> I have 2 DCs, 3 nodes each, RF:3, I use local quorum for both reads =
and
>> writes.
>>=20
>> Currently I test various operational qualities of the setup.
>>=20
>> During one of my tests - see this thread in this mailing list:
>> =
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/java-io-I=
OException-FAILED-TO-UNCOMPRESS-5-exception-when-running-nodetool-rebuild-=
td7586494.html
>> - I ran into this situation:
>>=20
>> - all nodes have all data and agree on it:
>>=20
>> [user@host1-dc1:~] nodetool status
>>=20
>> Datacenter: na-prod
>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
>> Status=3DUp/Down
>> |/ State=3DNormal/Leaving/Joining/Moving
>> --  Address                        Load         Tokens  Owns
>> (effective)  Host ID                                            Rack
>> UN  XXX.XXX.XXX.XXX   7.74 MB    256     100.0%
>> 0b1f1d79-52af-4d1b-a86d-bf4b65a05c49  cmp17
>> UN  XXX.XXX.XXX.XXX   7.74 MB    256     100.0%
>> 039f206e-da22-44b5-83bd-2513f96ddeac  cmp10
>> UN  XXX.XXX.XXX.XXX   7.72 MB    256     100.0%
>> 007097e9-17e6-43f7-8dfc-37b082a784c4  cmp11
>> Datacenter: us-east
>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
>> Status=3DUp/Down
>> |/ State=3DNormal/Leaving/Joining/Moving
>> --  Address                        Load         Tokens  Owns
>> (effective)  Host ID                                            Rack
>> UN  XXX.XXX.XXX.XXX    7.73 MB    256     100.0%
>> a336efae-8d9c-4562-8e2a-b766b479ecb4  1d
>> UN  XXX.XXX.XXX.XXX    7.73 MB    256     100.0%
>> ab1bbf0a-8ddc-4a12-a925-b119bd2de98e  1d
>> UN  XXX.XXX.XXX.XXX     7.73 MB    256     100.0%
>> f53fd294-16cc-497e-9613-347f07ac3850  1d
>>=20
>> - only one node disagrees:
>>=20
>> [user@host1-dc2:~] nodetool status
>> Datacenter: us-east
>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
>> Status=3DUp/Down
>> |/ State=3DNormal/Leaving/Joining/Moving
>> --  Address                         Load       Tokens   Owns   Host =
ID
>>                                             Rack
>> UN  XXX.XXX.XXX.XXX    7.73 MB    256     17.6%
>> a336efae-8d9c-4562-8e2a-b766b479ecb4  1d
>> UN  XXX.XXX.XXX.XXX    7.75 MB    256     16.4%
>> ab1bbf0a-8ddc-4a12-a925-b119bd2de98e  1d
>> UN  XXX.XXX.XXX.XXX     7.73 MB    256     15.7%
>> f53fd294-16cc-497e-9613-347f07ac3850  1d
>> Datacenter: na-prod
>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
>> Status=3DUp/Down
>> |/ State=3DNormal/Leaving/Joining/Moving
>> --  Address                         Load       Tokens   Owns   Host =
ID
>>                                             Rack
>> UN  XXX.XXX.XXX.XXX   7.74 MB    256     16.9%
>> 0b1f1d79-52af-4d1b-a86d-bf4b65a05c49  cmp17
>> UN  XXX.XXX.XXX.XXX   7.72 MB    256     17.1%
>> 007097e9-17e6-43f7-8dfc-37b082a784c4  cmp11
>> UN  XXX.XXX.XXX.XXX   7.73 MB    256     16.3%
>> 039f206e-da22-44b5-83bd-2513f96ddeac  cmp10
>>=20
>> I tried to rebuild the node from scratch, repair the node, no =
results.
>> Still the same owns stats.
>>=20
>> The cluster is built from cassandra 1.2.3 and uses vnodes.
>>=20
>>=20
>> On the related note: the data size, as you can see, is really small.
>> The cluster was created by setting up the us-east datacenter,
>> populating it with the dataset, then building the na-prod datacenter
>> and running nodetool rebuild us-east. When I tried to run nodetool
>> repair it took 25 minutes to finish, on this small dataset. Is this
>> ok?
>>=20
>> One other think I notices is the amount of compactions on the system
>> keyspace:
>>=20
>> =
/.../system/schema_columnfamilies/system-schema_columnfamilies-ib-11694-TO=
C.txt
>> =
/.../system/schema_columnfamilies/system-schema_columnfamilies-ib-11693-St=
atistics.db
>>=20
>> This is just after running the repair. Is this ok, considering the
>> dataset is 7MB and during the repair no operations were running
>> against the database, neither read, nor write, nothing?
>>=20
>> How will this perform in production with much bigger data if repair
>> takes 25 minutes on 7MB and 11k compactions were triggered by the
>> repair run?
>>=20
>> regards,
>>=20
>> Ondrej Cernos
>>=20
>>=20


--Apple-Mail=_1A42EE32-EF79-4D2C-A861-3B243D6C6B05
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
">monitor the repair using nodetool compactionstats to see the merkle =
trees being created, and nodetool netstats to see data =
streaming.&nbsp;<div><br></div><div>Also look in the logs for messages =
from&nbsp;AntiEntropyService.java , that will tell you how long the node =
waited for each replica to get back to =
it.&nbsp;</div><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 4/04/2013, at 5:42 PM, Ond=C5=99ej =C4=8Cerno=C5=A1 =
&lt;<a href=3D"mailto:cernoso@gmail.com">cernoso@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Hi,<br><br>most has been resolved - the failed to =
uncompress error was really a<br>bug in cassandra (see<br><a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-5391">https://issu=
es.apache.org/jira/browse/CASSANDRA-5391</a>) and the problem<br>with =
different load reporting is a change between 1.2.1 (reports 100%<br>for =
3 replicas/3 nodes/2 DCs setup I have) and 1.2.3 which reports =
the<br>fraction. Is this correct?<br><br>Anyway, the nodetool repair =
still takes ages to finish, considering<br>only megabytes of not =
changing data are involved in my test:<br><br>[root@host:/etc/puppet] =
nodetool repair ks<br>[2013-04-04 13:26:46,618] Starting repair command =
#1, repairing 1536<br>ranges for keyspace ks<br>[2013-04-04 =
13:47:17,007] Repair session<br>88ebc700-9d1a-11e2-a0a1-05b94e1385c7 for =
range<br>(-2270395505556181001,-2268004533044804266] =
finished<br>...<br>[2013-04-04 13:47:17,063] Repair =
session<br>65d31180-9d1d-11e2-a0a1-05b94e1385c7 for =
range<br>(1069254279177813908,1070290707448386360] =
finished<br>[2013-04-04 13:47:17,063] Repair command #1 =
finished<br><br>This is the status before the repair (by the way, after =
the datacenter<br>has been bootstrapped from the remote =
one):<br><br>[root@host:/etc/puppet] nodetool status<br>Datacenter: =
us-east<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>St=
atus=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Load =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens &nbsp;Owns &nbsp;&nbsp;Host =
ID<br> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Rack<br>UN =
&nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;&nbsp;5.74 MB &nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;17.1%<br>06ff8328-32a3-4196-a31f-1e0f608d0638 =
&nbsp;1d<br>UN &nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;&nbsp;5.73 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;15.3%<br>7a96bf16-e268-433a-9912-a0cf1668184e =
&nbsp;1d<br>UN &nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;&nbsp;5.72 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;17.5%<br>67a68a2a-12a8-459d-9d18-221426646e84 =
&nbsp;1d<br>Datacenter: =
na-dev<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>Status=
=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;Load =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens &nbsp;Owns &nbsp;&nbsp;Host =
ID<br> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;Rack<br>UN &nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;5.74 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;16.4%<br>eb86aaae-ef0d-40aa-9b74-2b9704c77c0a =
&nbsp;cmp02<br>UN &nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;5.74 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;17.0%<br>cd24af74-7f6a-4eaa-814f-62474b4e4df1 =
&nbsp;cmp01<br>UN &nbsp;xxx.xxx.xxx.xxx &nbsp;&nbsp;5.74 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;16.7%<br>1a55cfd4-bb30-4250-b868-a9ae13d81ae1 =
&nbsp;cmp05<br><br>Why does it take 20 minutes to finish? Fortunately =
the big number of<br>compactions I reported in the previous email was =
not triggered.<br><br>And is there a documentation where I could find =
the exact semantics of<br>repair when vnodes are used (and what -pr =
means in such a setup) and<br>when run in multiple datacenter setup? I =
still don't quite get it.<br><br>regards,<br>Ond=C5=99ej =
=C4=8Cerno=C5=A1<br><br><br>On Thu, Mar 28, 2013 at 3:30 AM, aaron =
morton &lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt; =
wrote:<br><blockquote type=3D"cite">During one of my tests - see this =
thread in this mailing list:<br><a =
href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/j=
ava-io-IOException-FAILED-TO-UNCOMPRESS-5-exception-when-running-nodetool-=
rebuild-td7586494.html">http://cassandra-user-incubator-apache-org.3065146=
.n2.nabble.com/java-io-IOException-FAILED-TO-UNCOMPRESS-5-exception-when-r=
unning-nodetool-rebuild-td7586494.html</a><br><br>That thread has been =
updated, check the bug ondrej created.<br><br>How will this perform in =
production with much bigger data if repair<br>takes 25 minutes on 7MB =
and 11k compactions were triggered by the<br>repair run?<br><br>Seems a =
little odd.<br>See what happens the next time you run =
repair.<br><br>Cheers<br><br>-----------------<br>Aaron =
Morton<br>Freelance Cassandra Consultant<br>New =
Zealand<br><br>@aaronmorton<br>http://www.thelastpickle.com<br><br>On =
27/03/2013, at 2:36 AM, Ond=C5=99ej =C4=8Cerno=C5=A1 =
&lt;cernoso@gmail.com&gt; wrote:<br><br>Hi all,<br><br>I have 2 DCs, 3 =
nodes each, RF:3, I use local quorum for both reads =
and<br>writes.<br><br>Currently I test various operational qualities of =
the setup.<br><br>During one of my tests - see this thread in this =
mailing =
list:<br>http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/=
java-io-IOException-FAILED-TO-UNCOMPRESS-5-exception-when-running-nodetool=
-rebuild-td7586494.html<br>- I ran into this situation:<br><br>- all =
nodes have all data and agree on it:<br><br>[user@host1-dc1:~] nodetool =
status<br><br>Datacenter: =
na-prod<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>St=
atus=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Load =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens =
&nbsp;Owns<br>(effective) &nbsp;Host ID =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Rack<br>UN &nbsp;XXX.XXX.XXX.XXX =
&nbsp;&nbsp;7.74 MB &nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>0b1f1d79-52af-4d1b-a86d-bf4b65a05c49 =
&nbsp;cmp17<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;7.74 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>039f206e-da22-44b5-83bd-2513f96ddeac =
&nbsp;cmp10<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;7.72 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>007097e9-17e6-43f7-8dfc-37b082a784c4 =
&nbsp;cmp11<br>Datacenter: =
us-east<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>St=
atus=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Load =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens =
&nbsp;Owns<br>(effective) &nbsp;Host ID =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Rack<br>UN &nbsp;XXX.XXX.XXX.XXX =
&nbsp;&nbsp;&nbsp;7.73 MB &nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>a336efae-8d9c-4562-8e2a-b766b479ecb4 =
&nbsp;1d<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;&nbsp;7.73 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>ab1bbf0a-8ddc-4a12-a925-b119bd2de98e =
&nbsp;1d<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;&nbsp;&nbsp;7.73 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;100.0%<br>f53fd294-16cc-497e-9613-347f07ac3850 =
&nbsp;1d<br><br>- only one node disagrees:<br><br>[user@host1-dc2:~] =
nodetool status<br>Datacenter: =
us-east<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>St=
atus=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Load=
 &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens &nbsp;&nbsp;Owns =
&nbsp;&nbsp;Host ID<br> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Rack<br>UN =
&nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;&nbsp;7.73 MB &nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;17.6%<br>a336efae-8d9c-4562-8e2a-b766b479ecb4 =
&nbsp;1d<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;&nbsp;7.75 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;16.4%<br>ab1bbf0a-8ddc-4a12-a925-b119bd2de98e =
&nbsp;1d<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;&nbsp;&nbsp;7.73 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;15.7%<br>f53fd294-16cc-497e-9613-347f07ac3850 =
&nbsp;1d<br>Datacenter: =
na-prod<br>=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D<br>St=
atus=3DUp/Down<br>|/ State=3DNormal/Leaving/Joining/Moving<br>-- =
&nbsp;Address =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Load=
 &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Tokens &nbsp;&nbsp;Owns =
&nbsp;&nbsp;Host ID<br> =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;=
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Rack<br>UN =
&nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;7.74 MB &nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;16.9%<br>0b1f1d79-52af-4d1b-a86d-bf4b65a05c49 =
&nbsp;cmp17<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;7.72 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;17.1%<br>007097e9-17e6-43f7-8dfc-37b082a784c4 =
&nbsp;cmp11<br>UN &nbsp;XXX.XXX.XXX.XXX &nbsp;&nbsp;7.73 MB =
&nbsp;&nbsp;&nbsp;256 =
&nbsp;&nbsp;&nbsp;&nbsp;16.3%<br>039f206e-da22-44b5-83bd-2513f96ddeac =
&nbsp;cmp10<br><br>I tried to rebuild the node from scratch, repair the =
node, no results.<br>Still the same owns stats.<br><br>The cluster is =
built from cassandra 1.2.3 and uses vnodes.<br><br><br>On the related =
note: the data size, as you can see, is really small.<br>The cluster was =
created by setting up the us-east datacenter,<br>populating it with the =
dataset, then building the na-prod datacenter<br>and running nodetool =
rebuild us-east. When I tried to run nodetool<br>repair it took 25 =
minutes to finish, on this small dataset. Is this<br>ok?<br><br>One =
other think I notices is the amount of compactions on the =
system<br>keyspace:<br><br>/.../system/schema_columnfamilies/system-schema=
_columnfamilies-ib-11694-TOC.txt<br>/.../system/schema_columnfamilies/syst=
em-schema_columnfamilies-ib-11693-Statistics.db<br><br>This is just =
after running the repair. Is this ok, considering the<br>dataset is 7MB =
and during the repair no operations were running<br>against the =
database, neither read, nor write, nothing?<br><br>How will this perform =
in production with much bigger data if repair<br>takes 25 minutes on 7MB =
and 11k compactions were triggered by the<br>repair =
run?<br><br>regards,<br><br>Ondrej =
Cernos<br><br><br></blockquote></blockquote></div><br></div></body></html>=

--Apple-Mail=_1A42EE32-EF79-4D2C-A861-3B243D6C6B05--