Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=QF7N3wz4Eq
	hWKiM4mToQEEVWetXIsOYmNOdDbDaR3QAA5NJVYcoYeI1scq51ePKu33p8Wj+TSO
	oLFsRuuKT9WZObojHxRf0ADlj3vkal8/Na8aP1zszKNaqE14afqNMaSw79lhe0h1
	ZvLeSVgyMVKPfvYWh2b6IuI8nHNm0B1Kc=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_8A8CB289-B5E2-45A3-8A78-BDC1DC6AC03D"
Subject: Re: Moving to a new cluster
Date: Sun, 25 Sep 2011 22:21:35 +1300
In-Reply-To: 
 <CAOA66tHOZbjs5bq0=7XFzX=t_rZ1q3ubmCm19Lp6czfr0Upg8w@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CAHwsXYm4y6fQ2ANmjana_dRHhT=akaW=Mo2M7-jN2XAnQ3vy_g@mail.gmail.com>
 <901040CC-C1B5-419D-9780-A2D0A00A0DBC@thelastpickle.com>
 <CAOA66tHjDEfjtvcGJ-OO0imxHU3F0r1Br-E9=5B1aE+4UqYTkA@mail.gmail.com>
 <AD2ABB96-E311-4A4A-969D-7748359AD3D0@thelastpickle.com>
 <CAOA66tHOZbjs5bq0=7XFzX=t_rZ1q3ubmCm19Lp6czfr0Upg8w@mail.gmail.com>
Message-Id: <37F8975A-F8AE-4BF8-A442-03E160202AB3@thelastpickle.com>


--Apple-Mail=_8A8CB289-B5E2-45A3-8A78-BDC1DC6AC03D
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

sounds like it.=20

A
-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 25/09/2011, at 6:10 PM, Yan Chunlu wrote:

> thanks!  is that similar problem described in this thread?
>=20
>  =
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/nodetool-=
repair-caused-high-disk-space-usage-td6695542.html
>=20
> On Sun, Sep 25, 2011 at 11:33 AM, aaron morton =
<aaron@thelastpickle.com> wrote:
> It can result in a lot of data on the node you run repair on. Where a =
lot means perhaps 2 or more  times more data.
>=20
> My unscientific approach is to repair one CF at a time so you can =
watch the disk usage and repair the smaller CF's first. After the repair =
compact if you need to.=20
>=20
> I think  the amount of extra data will be related to how out of sync =
things are, so once you get repair working smoothly it will be less of =
problem.
>=20
> Cheers
>    =20
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 23/09/2011, at 3:04 AM, Yan Chunlu wrote:
>=20
>>=20
>> hi Aaron:
>>=20
>> could you explain more about the issue about repair make space usage =
going crazy?
>>=20
>> I am planning to upgrade my cluster from 0.7.4 to 0.8.6, which is =
because the repair never works on 0.7.4 for me.
>> more specifically, CASSANDRA-2280 and CASSANDRA-2156.
>>=20
>>=20
>> from your description, I really worried about 0.8.6 might make it =
worse...
>>=20
>> thanks!
>>=20
>> On Thu, Sep 22, 2011 at 7:25 AM, aaron morton =
<aaron@thelastpickle.com> wrote:
>> How much data is on the nodes in cluster 1 and how much disk space on =
cluster 2 ? Be aware that Cassandra 0.8 has an issue where repair can go =
crazy and use a lot of space.=20
>>=20
>> If you are not regularly running repair I would also repair before =
the move.
>>=20
>> The repair after the copy is a good idea but should technically not =
be necessary. If you can practice the move watch the repair to see if =
much is transferred (check the logs). There is always a small transfer, =
but if you see data been transferred for several minutes I would =
investigate.=20
>>=20
>> When you start a repair it will repair will the other nodes it =
replicates data with. So you only need to run it every RF nodes. Start =
it one one, watch the logs to see who it talks to and then start it on =
the first node it does not talk to. And so on.=20
>>=20
>> Add a snapshot before the clean (repair will also snapshot before it =
runs)
>>=20
>> Scrub is not needed unless you are migrating or you have file errors.
>>=20
>> If your cluster is online, consider running the clean every RFth node =
rather than all at once (e.g. 1,4, 7, 10 then 2,5,8,11). It will have =
less impact on clients.=20
>>=20
>> Cheers
>>=20
>> -----------------
>> Aaron Morton
>> Freelance Cassandra Developer
>> @aaronmorton
>> http://www.thelastpickle.com
>>=20
>> On 22/09/2011, at 10:27 AM, Philippe wrote:
>>=20
>>> Hello,
>>> We're currently running on a 3-node RF=3D3 cluster. Now that we have =
a better grip on things, we want to replace it with a 12-node RF=3D3 =
cluster of "smaller" servers. So I wonder what the best way to move the =
data to the new cluster would be. I can afford to stop writing to the =
current cluster for whatever time is necessary. Has anyone written up =
something on this subject ?
>>>=20
>>> My plan is the following (nodes in cluster 1 are node1.1->1.3, nodes =
in cluster 2 are node2.1->2.12)
>>> stop writing to current cluster & drain it
>>> get a snapshot on each node
>>> Since it's RF=3D3, each node should have all the data, so assuming I =
set the tokens correctly I would move the snapshot from node1.1 to =
node2.1, 2.2, 2.3 and 2.4 then node1.2->node2.5,2.6,2.,2.8, etc. This is =
because the range for node1.1 is now spread across 2.1->2.4
>>> Run repair & clean & scrub on each node (more or less in //)
>>> What do you think ?
>>> Thanks
>>=20
>>=20
>=20
>=20


--Apple-Mail=_8A8CB289-B5E2-45A3-8A78-BDC1DC6AC03D
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
">sounds like it.&nbsp;<div><br></div><div>A<br><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 25/09/2011, at 6:10 PM, Yan Chunlu wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">thanks! =
&nbsp;is that similar problem described in this =
thread?<div><br><div>&nbsp;<a =
href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/n=
odetool-repair-caused-high-disk-space-usage-td6695542.html">http://cassand=
ra-user-incubator-apache-org.3065146.n2.nabble.com/nodetool-repair-caused-=
high-disk-space-usage-td6695542.html</a><br>

<br><div class=3D"gmail_quote">On Sun, Sep 25, 2011 at 11:33 AM, aaron =
morton <span dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex;">

<div style=3D"word-wrap:break-word">It can result in a lot of data on =
the node you run repair on. Where a lot means perhaps 2 or more =
&nbsp;times more data.<div><br></div><div>My unscientific approach is to =
repair one CF at a time so you can watch the disk usage and repair the =
smaller CF's first. After the repair compact if you need to.&nbsp;</div>

<div><br></div><div>I think &nbsp;the amount of extra data will be =
related to how out of sync things are, so once you get repair working =
smoothly it will be less of =
problem.</div><div><br></div><div>Cheers</div><div><div class=3D"im">

&nbsp;&nbsp; &nbsp;<br>
<br><div>
<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-align:auto;text-inde=
nt:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:m=
edium"><span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div></div>

</div></span></div></span></span>
</div>
<br></div><div><div></div><div class=3D"h5"><div><div>On 23/09/2011, at =
3:04 AM, Yan Chunlu wrote:</div><br><blockquote type=3D"cite"><br>hi =
Aaron:<div><br></div><div>could you explain more about the issue about =
repair make space usage going crazy?</div>

<div><br></div><div>I am planning to upgrade my cluster from 0.7.4 to =
0.8.6, which is because the repair never works on 0.7.4 for me.</div>

<div>more specifically,&nbsp;<span style=3D"font-family:arial, FreeSans, =
Helvetica, =
sans-serif;font-size:14px;line-height:20px;background-color:rgb(247, =
247, 247)"><a rel=3D"12500657" =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2280" =
style=3D"color:rgb(60, 120, 181);text-decoration:none" =
target=3D"_blank">CASSANDRA-2280</a>&nbsp;and&nbsp;</span><span =
style=3D"font-family:arial, FreeSans, Helvetica, =
sans-serif;font-size:14px;line-height:20px;background-color:rgb(247, =
247, 247)"><a rel=3D"12498425" =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2156" =
style=3D"color:rgb(60, 120, 181);text-decoration:none" =
target=3D"_blank">CASSANDRA-2156</a>.</span></div>


<div><br></div><div><br></div><div>from your description, I really =
worried about 0.8.6 might make it =
worse...</div><div><br></div><div>thanks!</div><div><br><div =
class=3D"gmail_quote">On Thu, Sep 22, 2011 at 7:25 AM, aaron morton =
<span dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word">How much data is on the nodes in cluster =
1 and how much disk space on cluster 2 ? Be aware that Cassandra 0.8 has =
an issue where repair can go crazy and use a lot of space.&nbsp;<div>


<br></div><div>If you are not regularly running repair I would also =
repair before the move.</div><div><br></div><div>The repair after the =
copy is a good idea but should technically not be necessary. If you can =
practice the move watch the repair to see if much is transferred (check =
the logs). There is always a small transfer, but if you see data been =
transferred for several minutes I would investigate.&nbsp;</div>


<div><br></div><div>When you start a repair it will repair will the =
other nodes it replicates data with. So you only need to run it every RF =
nodes. Start it one one, watch the logs to see who it talks to and then =
start it on the first node it does not talk to. And so on.&nbsp;</div>


<div><br></div><div>Add a snapshot before the clean (repair will also =
snapshot before it runs)</div><div><br></div><div>Scrub is not needed =
unless you are migrating or you have file =
errors.</div><div><br></div><div>If your cluster is online, consider =
running the clean every RFth node rather than all at once (e.g. 1,4, 7, =
10 then 2,5,8,11). It will have less impact on clients.&nbsp;</div>


<div><br></div><div>Cheers</div><div><br><div>
<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-align:auto;text-inde=
nt:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:m=
edium"><span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">


<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">


<div><div>-----------------</div><div>Aaron Morton</div><font =
color=3D"#888888"><div>Freelance Cassandra =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div>


</font></div></div></span></div></span></span>
</div><div><div></div><div>

<br><div><div>On 22/09/2011, at 10:27 AM, Philippe =
wrote:</div><br><blockquote type=3D"cite">Hello,<br>We're currently =
running on a 3-node RF=3D3 cluster. Now that we have a better grip on =
things, we want to replace it with a 12-node RF=3D3 cluster of "smaller" =
servers. So I wonder what the best way to move the data to the new =
cluster would be. I can afford to stop writing to the current cluster =
for whatever time is necessary. Has anyone written up something on this =
subject ?<br>


<br>My plan is the following (nodes in cluster 1 are node1.1-&gt;1.3, =
nodes in cluster 2 are node2.1-&gt;2.12)<br><ul><li>stop writing to =
current cluster &amp; drain it</li><li>get a snapshot on each =
node</li><li>Since it's RF=3D3, each node should have all the data, so =
assuming I set the tokens correctly I would move the snapshot from =
node1.1 to node2.1, 2.2, 2.3 and 2.4 then =
node1.2-&gt;node2.5,2.6,2.,2.8, etc. This is because the range for =
node1.1 is now spread across 2.1-&gt;2.4</li>


<li>Run repair &amp; clean &amp; scrub on each node (more or less in =
//)</li></ul>What do you think ?<br>Thanks<br>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_8A8CB289-B5E2-45A3-8A78-BDC1DC6AC03D--