Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=p6qhbaZHa7
	BwkM37YbNbawq+2yY6Q4OfUkEZP2qqqhhRwr0z0Wq9j5in5MpjzLW6P5tNg60xxu
	AJHPLz5mgmi6FjVrla80ZAHMPYTeCPfBROQOrkP9aTcsaglOEiFu/EGDvg1D0RB7
	Z3/Aulm7lC+FEA215JbCbec3e33mHbOFg=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_932943AF-AA01-4C57-AF6C-69E440249343"
Subject: Re: Moving to a new cluster
Date: Sun, 25 Sep 2011 16:33:48 +1300
In-Reply-To: 
 <CAOA66tHjDEfjtvcGJ-OO0imxHU3F0r1Br-E9=5B1aE+4UqYTkA@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CAHwsXYm4y6fQ2ANmjana_dRHhT=akaW=Mo2M7-jN2XAnQ3vy_g@mail.gmail.com>
 <901040CC-C1B5-419D-9780-A2D0A00A0DBC@thelastpickle.com>
 <CAOA66tHjDEfjtvcGJ-OO0imxHU3F0r1Br-E9=5B1aE+4UqYTkA@mail.gmail.com>
Message-Id: <AD2ABB96-E311-4A4A-969D-7748359AD3D0@thelastpickle.com>


--Apple-Mail=_932943AF-AA01-4C57-AF6C-69E440249343
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

It can result in a lot of data on the node you run repair on. Where a =
lot means perhaps 2 or more  times more data.

My unscientific approach is to repair one CF at a time so you can watch =
the disk usage and repair the smaller CF's first. After the repair =
compact if you need to.=20

I think  the amount of extra data will be related to how out of sync =
things are, so once you get repair working smoothly it will be less of =
problem.

Cheers
   =20

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 23/09/2011, at 3:04 AM, Yan Chunlu wrote:

>=20
> hi Aaron:
>=20
> could you explain more about the issue about repair make space usage =
going crazy?
>=20
> I am planning to upgrade my cluster from 0.7.4 to 0.8.6, which is =
because the repair never works on 0.7.4 for me.
> more specifically, CASSANDRA-2280 and CASSANDRA-2156.
>=20
>=20
> from your description, I really worried about 0.8.6 might make it =
worse...
>=20
> thanks!
>=20
> On Thu, Sep 22, 2011 at 7:25 AM, aaron morton =
<aaron@thelastpickle.com> wrote:
> How much data is on the nodes in cluster 1 and how much disk space on =
cluster 2 ? Be aware that Cassandra 0.8 has an issue where repair can go =
crazy and use a lot of space.=20
>=20
> If you are not regularly running repair I would also repair before the =
move.
>=20
> The repair after the copy is a good idea but should technically not be =
necessary. If you can practice the move watch the repair to see if much =
is transferred (check the logs). There is always a small transfer, but =
if you see data been transferred for several minutes I would =
investigate.=20
>=20
> When you start a repair it will repair will the other nodes it =
replicates data with. So you only need to run it every RF nodes. Start =
it one one, watch the logs to see who it talks to and then start it on =
the first node it does not talk to. And so on.=20
>=20
> Add a snapshot before the clean (repair will also snapshot before it =
runs)
>=20
> Scrub is not needed unless you are migrating or you have file errors.
>=20
> If your cluster is online, consider running the clean every RFth node =
rather than all at once (e.g. 1,4, 7, 10 then 2,5,8,11). It will have =
less impact on clients.=20
>=20
> Cheers
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 22/09/2011, at 10:27 AM, Philippe wrote:
>=20
>> Hello,
>> We're currently running on a 3-node RF=3D3 cluster. Now that we have =
a better grip on things, we want to replace it with a 12-node RF=3D3 =
cluster of "smaller" servers. So I wonder what the best way to move the =
data to the new cluster would be. I can afford to stop writing to the =
current cluster for whatever time is necessary. Has anyone written up =
something on this subject ?
>>=20
>> My plan is the following (nodes in cluster 1 are node1.1->1.3, nodes =
in cluster 2 are node2.1->2.12)
>> stop writing to current cluster & drain it
>> get a snapshot on each node
>> Since it's RF=3D3, each node should have all the data, so assuming I =
set the tokens correctly I would move the snapshot from node1.1 to =
node2.1, 2.2, 2.3 and 2.4 then node1.2->node2.5,2.6,2.,2.8, etc. This is =
because the range for node1.1 is now spread across 2.1->2.4
>> Run repair & clean & scrub on each node (more or less in //)
>> What do you think ?
>> Thanks
>=20
>=20


--Apple-Mail=_932943AF-AA01-4C57-AF6C-69E440249343
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">It =
can result in a lot of data on the node you run repair on. Where a lot =
means perhaps 2 or more &nbsp;times more data.<div><br></div><div>My =
unscientific approach is to repair one CF at a time so you can watch the =
disk usage and repair the smaller CF's first. After the repair compact =
if you need to.&nbsp;</div><div><br></div><div>I think &nbsp;the amount =
of extra data will be related to how out of sync things are, so once you =
get repair working smoothly it will be less of =
problem.</div><div><br></div><div>Cheers</div><div>&nbsp;&nbsp; =
&nbsp;<br>
<br><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>
<br><div><div>On 23/09/2011, at 3:04 AM, Yan Chunlu wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><br>hi =
Aaron:<div><br></div><div>could you explain more about the issue about =
repair make space usage going crazy?</div><div><br></div><div>I am =
planning to upgrade my cluster from 0.7.4 to 0.8.6, which is because the =
repair never works on 0.7.4 for me.</div>

<div>more specifically,&nbsp;<span class=3D"Apple-style-span" =
style=3D"font-family: arial, FreeSans, Helvetica, sans-serif; font-size: =
14px; line-height: 20px; background-color: rgb(247, 247, 247); "><a =
id=3D"key-val" rel=3D"12500657" =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2280" =
style=3D"color: rgb(60, 120, 181); cursor: pointer; text-decoration: =
none; ">CASSANDRA-2280</a>&nbsp;and&nbsp;</span><span =
class=3D"Apple-style-span" style=3D"font-family: arial, FreeSans, =
Helvetica, sans-serif; font-size: 14px; line-height: 20px; =
background-color: rgb(247, 247, 247); "><a id=3D"key-val" rel=3D"12498425"=
 href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2156" =
style=3D"color: rgb(60, 120, 181); cursor: pointer; text-decoration: =
none; ">CASSANDRA-2156</a>.</span></div>

<div><br></div><div><br></div><div>from your description, I really =
worried about 0.8.6 might make it =
worse...</div><div><br></div><div>thanks!</div><div><br><div =
class=3D"gmail_quote">On Thu, Sep 22, 2011 at 7:25 AM, aaron morton =
<span dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex;"><div =
style=3D"word-wrap:break-word">How much data is on the nodes in cluster =
1 and how much disk space on cluster 2 ? Be aware that Cassandra 0.8 has =
an issue where repair can go crazy and use a lot of space.&nbsp;<div>

<br></div><div>If you are not regularly running repair I would also =
repair before the move.</div><div><br></div><div>The repair after the =
copy is a good idea but should technically not be necessary. If you can =
practice the move watch the repair to see if much is transferred (check =
the logs). There is always a small transfer, but if you see data been =
transferred for several minutes I would investigate.&nbsp;</div>

<div><br></div><div>When you start a repair it will repair will the =
other nodes it replicates data with. So you only need to run it every RF =
nodes. Start it one one, watch the logs to see who it talks to and then =
start it on the first node it does not talk to. And so on.&nbsp;</div>

<div><br></div><div>Add a snapshot before the clean (repair will also =
snapshot before it runs)</div><div><br></div><div>Scrub is not needed =
unless you are migrating or you have file =
errors.</div><div><br></div><div>If your cluster is online, consider =
running the clean every RFth node rather than all at once (e.g. 1,4, 7, =
10 then 2,5,8,11). It will have less impact on clients.&nbsp;</div>

<div><br></div><div>Cheers</div><div><br><div>
<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-align:auto;text-inde=
nt:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:m=
edium"><span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<div><div>-----------------</div><div>Aaron Morton</div><font =
color=3D"#888888"><div>Freelance Cassandra =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div>

</font></div></div></span></div></span></span>
</div><div><div></div><div class=3D"h5">

<br><div><div>On 22/09/2011, at 10:27 AM, Philippe =
wrote:</div><br><blockquote type=3D"cite">Hello,<br>We're currently =
running on a 3-node RF=3D3 cluster. Now that we have a better grip on =
things, we want to replace it with a 12-node RF=3D3 cluster of "smaller" =
servers. So I wonder what the best way to move the data to the new =
cluster would be. I can afford to stop writing to the current cluster =
for whatever time is necessary. Has anyone written up something on this =
subject ?<br>


<br>My plan is the following (nodes in cluster 1 are node1.1-&gt;1.3, =
nodes in cluster 2 are node2.1-&gt;2.12)<br><ul><li>stop writing to =
current cluster &amp; drain it</li><li>get a snapshot on each =
node</li><li>Since it's RF=3D3, each node should have all the data, so =
assuming I set the tokens correctly I would move the snapshot from =
node1.1 to node2.1, 2.2, 2.3 and 2.4 then =
node1.2-&gt;node2.5,2.6,2.,2.8, etc. This is because the range for =
node1.1 is now spread across 2.1-&gt;2.4</li>


<li>Run repair &amp; clean &amp; scrub on each node (more or less in =
//)</li></ul>What do you think ?<br>Thanks<br>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_932943AF-AA01-4C57-AF6C-69E440249343--