Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: unknown (nike.apache.org: error in processing during lookup of
 kais@neteck-fr.com)
MIME-Version: 1.0
In-Reply-To: <02DD1960-A0E4-49B1-B821-6B98807CFB27@thelastpickle.com>
References: 
 <CAF0CjOpmY5EeHsLiwZAFi2pW-YC_aFxV67PX9LDgv_Jrjx5yiw@mail.gmail.com>
	<CA+VSrLpVfTTU3t8-2n-VqBGvOv3O9bbu5kSeSUc90gp=r5BBSQ@mail.gmail.com>
	<F3E0C5DC-D73F-4169-AE09-C02A9A84608F@yahoo.com>
	<02DD1960-A0E4-49B1-B821-6B98807CFB27@thelastpickle.com>
Date: Fri, 19 Apr 2013 17:43:16 +0200
Message-ID: 
 <CAF0CjOp4Lky3eTkmRA8+s6Ha4T97Hik72Obezbp0PpkVaTL5hA@mail.gmail.com>
Subject: Re: Moving cluster
From: Kais Ahmed <kais@neteck-fr.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=089e013cc396dfdf3a04dab893c9

--089e013cc396dfdf3a04dab893c9
Content-Type: text/plain; charset=ISO-8859-1

Hello and thank you for your answers.

The first solution is much easier for me because I use the vnode.

What is the risk of the first solution

thank you,


2013/4/18 aaron morton <aaron@thelastpickle.com>

> This is roughly the lift and shift process I use.
>
> Note that disabling thrift and gossip does not stop an existing repair
> session. So I often drain and then shutdown, and copy the live data dir
> rather than a snapshot dir.
>
> Cheers
>
> -----------------
> Aaron Morton
> Freelance Cassandra Consultant
> New Zealand
>
> @aaronmorton
> http://www.thelastpickle.com
>
> On 19/04/2013, at 4:10 AM, Michael Theroux <mtheroux2@yahoo.com> wrote:
>
> This should work.
>
> Another option is to follow a process similar to what we recently did.  We
> recently and successfully upgraded 12 instances from large to xlarge
> instances in AWS.  I chose not to replace nodes as restoring data from the
> ring would have taken significant time and put the cluster under some
> additional load.  I also wanted to eliminate the possibility that any
> issues on the new nodes could be blamed on new configuration/operating
> system differences.  Instead we followed the following procedure (removing
> some details that would likely be unique to our infrastructure).
>
> For a node being upgraded:
>
> 1) nodetool disable thrift
> 2) nodetool disable gossip
> 3) Snapshot the data (nodetool snapshot ...)
> 4) Backup the snapshot data to EBS (assuming you are on ephemeral)
> 5) Stop cassandra
> 6) Move the cassandra.yaml configuration file to cassandra.yaml.bak (to
> prevent any future restarts to cause cassandra to restart)
> 7) Shutdown the instance
> 8) Take an AMI of the instance
> 9) Start a new instance from the AMI with the desired hardware
> 10) If you assign the new instance a new IP Address, make sure any entries
> in /etc/hosts, or the broadcast_address in cassandra.yaml is updated
> 11) Attach the volume you backed up your snapshot data to to the new
> instance and mount it
> 12) Restore the snapshot data
> 13) Restore cassandra.yaml file
> 13) Restart cassandra
>
> - I recommend practicing this on a test cluster first
> - As you replace nodes with new IP Addresses, eventually all your seeds
> will need be updated.  This is not a big deal until all your seed nodes
> have been replaced.
> - Don't forget about NTP!  Make sure it is running on all your new nodes.
>  Myself, to be extra careful, I actually deleted the ntp drift file and let
> NTP recalculate it because its a new instance, and it took over an hour to
> restore our snapshot data... but that may have been overkill.
> - If you have the opportunity, depending on your situation, increase
> the max_hint_window_in_ms
> - Your details may vary
>
> Thanks,
> -Mike
>
> On Apr 18, 2013, at 11:07 AM, Alain RODRIGUEZ wrote:
>
> I would say add your 3 servers to the 3 tokens where you want them, let's
> say :
>
> {
>     "0": {
>         "0": 0,
>         "1": 56713727820156410577229101238628035242,
>         "2": 113427455640312821154458202477256070485
>     }
> }
>
> or these token -1 or +1 if you already have these token used. And then
> just decommission x1Large nodes. You should be good to go.
>
>
>
> 2013/4/18 Kais Ahmed <kais@neteck-fr.com>
>
>> Hi,
>>
>> What is the best pratice to move from a cluster of 7 nodes (m1.xlarge) to
>> 3 nodes (hi1.4xlarge).
>>
>> Thanks,
>>
>
>
>
>

--089e013cc396dfdf3a04dab893c9
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><span id=3D"result_box" class=3D"" lang=3D"en"><span class=
=3D"">Hello and</span> <span class=3D"">thank you</span> <span class=3D"">f=
or</span> <span class=3D"">your answers</span><span>.</span><br><br><span c=
lass=3D"">The</span> <span class=3D"">first</span> <span class=3D"">solutio=
n is much</span> <span class=3D"">easier for me</span> <span class=3D"">bec=
ause I use</span> <span class=3D"">the</span> <span class=3D"">vnode</span>=
<span>.</span><br>
<br><span class=3D"">What is the risk</span> <span class=3D"">of the</span>=
 <span class=3D"">first</span> <span class=3D"">solution</span><br><br><spa=
n class=3D"">thank you</span><span class=3D"">,</span></span></div><div cla=
ss=3D"gmail_extra">
<br><br><div class=3D"gmail_quote">2013/4/18 aaron morton <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:aaron@thelastpickle.com" target=3D"_blank">aaron@the=
lastpickle.com</a>&gt;</span><br><blockquote class=3D"gmail_quote" style=3D=
"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">This is roughly the lift and shift proc=
ess I use.=A0<div><br></div><div>Note that disabling thrift and gossip does=
 not stop an existing repair session. So I often drain and then shutdown, a=
nd copy the live data dir rather than a snapshot dir.=A0</div>
<div><br></div><div>Cheers</div><div>=A0</div><div><div>
<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">
<div style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;tex=
t-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:norma=
l;text-transform:none;font-size:medium;white-space:normal;font-family:Helve=
tica;word-wrap:break-word;word-spacing:0px">
<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;text-align:-webkit-auto;font-style:normal;font-weight:norm=
al;line-height:normal;border-collapse:separate;text-transform:none;font-siz=
e:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><div st=
yle=3D"word-wrap:break-word">
<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">
<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">
<span style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;fon=
t-variant:normal;font-style:normal;font-weight:normal;line-height:normal;bo=
rder-collapse:separate;text-transform:none;font-size:medium;white-space:nor=
mal;font-family:Helvetica;word-spacing:0px"><div style=3D"word-wrap:break-w=
ord">
<div>-----------------</div><div>Aaron Morton</div><div>Freelance Cassandra=
 Consultant</div><div>New Zealand</div><div><br></div><div>@aaronmorton</di=
v><div><a href=3D"http://www.thelastpickle.com" target=3D"_blank">http://ww=
w.thelastpickle.com</a></div>
</div></span></div></span></div></span></div></span></div></div>
</div><div><div class=3D"h5">

<br><div><div>On 19/04/2013, at 4:10 AM, Michael Theroux &lt;<a href=3D"mai=
lto:mtheroux2@yahoo.com" target=3D"_blank">mtheroux2@yahoo.com</a>&gt; wrot=
e:</div><br><blockquote type=3D"cite"><div style=3D"word-wrap:break-word">T=
his should work. =A0<div>
<br></div><div>Another option is to follow a process similar to what we rec=
ently did. =A0We recently and successfully upgraded 12 instances from large=
 to xlarge instances in AWS. =A0I chose not to replace nodes as restoring d=
ata from the ring would have taken significant time and put the cluster und=
er some additional load. =A0I also wanted to eliminate the possibility that=
 any issues on the new nodes could be blamed on new configuration/operating=
 system differences. =A0Instead we followed the following procedure (removi=
ng some details that would likely be unique to our infrastructure).<div>
<br></div><div>For a node being upgraded:</div><div><br></div><div>1) nodet=
ool disable thrift=A0</div><div>2) nodetool disable gossip</div><div>3) Sna=
pshot the data (nodetool snapshot ...)</div><div>4) Backup the snapshot dat=
a to EBS (assuming you are on ephemeral)</div>
<div>5) Stop cassandra</div><div>6) Move the cassandra.yaml configuration f=
ile to cassandra.yaml.bak (to prevent any future restarts to cause cassandr=
a to restart)</div><div>7) Shutdown the instance</div><div>8) Take an AMI o=
f the instance</div>
<div>9) Start a new instance from the AMI with the desired hardware</div><d=
iv>10) If you assign the new instance a new IP Address, make sure any entri=
es in /etc/hosts, or the broadcast_address in cassandra.yaml is updated</di=
v>
<div>11) Attach the volume you backed up your snapshot data to to the new i=
nstance and mount it</div><div>12) Restore the snapshot data</div><div>13) =
Restore cassandra.yaml file</div><div>13) Restart cassandra</div><div><br>
</div><div>- I recommend practicing this on a test cluster first</div><div>=
- As you replace nodes with new IP Addresses, eventually all your seeds wil=
l need be updated. =A0This is not a big deal until all your seed nodes have=
 been replaced.</div>
<div>- Don&#39;t forget about NTP! =A0Make sure it is running on all your n=
ew nodes. =A0Myself, to be extra careful, I actually deleted the ntp drift =
file and let NTP recalculate it because its a new instance, and it took ove=
r an hour to restore our snapshot data... but that may have been overkill.<=
/div>
<div>- If you have the opportunity, depending on your situation, increase t=
he=A0max_hint_window_in_ms</div><div>- Your details may vary</div><div><br>=
</div><div>Thanks,</div><div>-Mike</div><div><br><div><div>On Apr 18, 2013,=
 at 11:07 AM, Alain RODRIGUEZ wrote:</div>
<br><blockquote type=3D"cite"><div dir=3D"ltr">I would say add your 3 serve=
rs to the 3 tokens where you want them, let&#39;s say :<div><br></div><div>=
<div>{</div><div>=A0 =A0 &quot;0&quot;: {</div><div>=A0 =A0 =A0 =A0 &quot;0=
&quot;: 0,</div>
<div>=A0 =A0 =A0 =A0 &quot;1&quot;: 56713727820156410577229101238628035242,=
</div>

<div>=A0 =A0 =A0 =A0 &quot;2&quot;: 113427455640312821154458202477256070485=
</div><div>=A0 =A0 }</div><div>}</div></div><div><br></div><div>or these to=
ken -1 or +1 if you already have these token used. And then just decommissi=
on x1Large nodes. You should be good to go.</div>


<div><br></div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">2013/4/18 Kais Ahmed <span dir=3D"ltr">&lt;<a href=3D"mailto:kais@n=
eteck-fr.com" target=3D"_blank">kais@neteck-fr.com</a>&gt;</span><br><block=
quote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc=
 solid;padding-left:1ex">


<div dir=3D"ltr"><div><div>Hi,<br><br></div>What is the best pratice to mov=
e from a cluster of 7 nodes (m1.xlarge) to 3 nodes (hi1.4xlarge).<br><br></=
div>Thanks,<br></div>
</blockquote></div><br></div>
</blockquote></div><br></div></div></div></blockquote></div><br></div></div=
></div></div></blockquote></div><br></div>

--089e013cc396dfdf3a04dab893c9--