Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of pauloricardomg@gmail.com
 designates 209.85.220.49 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAKaZCX5WinnSPW-nftLv+N6wDGxgwYr7r6sZS1jZC=N1VoT8uw@mail.gmail.com>
References: 
 <CAKaZCX7ygJSZ=6vkN7n8y+U7MGXdX9dgP+L_ViWWGcs-VxBfhg@mail.gmail.com>
 <CAEDUwd3aGZ1KN9+drGprHta57eEeEnNLuM=2JxuGyj-BXMCWvg@mail.gmail.com>
 <CAKaZCX5zfZ60wdA3KCV8O68FjHMxh=cWdibbepqB4bmtciKJgQ@mail.gmail.com>
 <CAEDUwd3afzs3MFjmE0FzPEvOjmaMQfMyFX--90GS+h5zXvnafg@mail.gmail.com>
 <CAKaZCX70daFBwAZRcH+d5xTROH0K6XhuPvQMqgcnMAQfxpig4g@mail.gmail.com>
 <CAFyMrKGGmnEkqh5iOFVzNdXz5NivtwsjgZ=2XvfaRGXwJzt7bw@mail.gmail.com>
 <CAKaZCX5WinnSPW-nftLv+N6wDGxgwYr7r6sZS1jZC=N1VoT8uw@mail.gmail.com>
From: Paulo Motta <pauloricardomg@gmail.com>
Date: Wed, 2 Oct 2013 15:49:07 -0300
Message-ID: 
 <CAKaZCX5rzC75FgkveddaPryyVGR3g1s+KozF-ZU7VdQKH2qF5A@mail.gmail.com>
Subject: Re: Best version to upgrade from 1.1.10 to 1.2.X
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=047d7bf0f2f464a07104e7c6875f

--047d7bf0f2f464a07104e7c6875f
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hello,

I just started the rolling upgrade procedure from 1.1.10 to 2.1.10. Our
strategy is to simultaneously upgrade one server from each replication
group. So, if we have a 6 nodes with RF=3D2, we will upgrade 3 nodes at a
time (from distinct replication groups).

My question is: do the newly upgraded nodes show as "Down" in the "nodetool
ring" of the old cluster (1.1.10)? Because I thought that network
compatibility meant nodes from a newer version would receive traffic (write
+ reads) from the previous version without problems.

Cheers,

Paulo


2013/9/26 Paulo Motta <pauloricardomg@gmail.com>

> Hello Charles,
>
> Thank you very much for your detailed upgrade report. It'll be very
> helpful during our upgrade operation (even though we'll do a rolling
> production upgrade).
>
> I'll also share our findings during the upgrade here.
>
> Cheers,
>
> Paulo
>
>
> 2013/9/24 Charles Brophy <cbrophy@zulily.com>
>
>> Hi Paulo,
>>
>> I just completed a migration from 1.1.10 to 1.2.10 and it was
>> surprisingly painless.
>>
>> The course of action that I took:
>> 1) describe cluster - make sure all nodes are on the same schema
>> 2) shutoff all maintenance tasks; i.e. make sure no scheduled repair is
>> going to kick off in the middle of what you're doing
>> 3) snapshot - maybe not necessary but it's so quick it makes no sense to
>> skip this step
>> 4) drain the nodes - I shut down the entire cluster rather than chance
>> any incompatible gossip concerns that might come from a rolling upgrade.=
 I
>> have the luxury of controlling both the providers and consumers of our
>> data, so this wasn't so disruptive for us.
>> 5) Upgrade the nodes, turn them on one-by-one, monitor the logs for funn=
y
>> business.
>> 6) nodetool upgradesstables
>> 7) Turn various maintenance tasks back on, etc.
>>
>> The worst part was managing the yaml/config changes between the versions=
.
>> It wasn't horrible, but the diff was "noisier" than a more incremental
>> upgrade typically is. A few things I recall that were special:
>> 1) Since you have an existing cluster, you'll probably need to set the
>> default partitioner back to RandomPartitioner in cassandra.yaml. I belie=
ve
>> that is outlined in NEWS.
>> 2) I set the initial tokens to be the same as what the nodes held
>> previously.
>> 3) The timeout is now divided into more atomic settings and you get to
>> decided how (or if) to configure it from the default appropriately.
>>
>> tldr; I did a standard upgrade and payed careful attention to the
>> NEWS.txt upgrade notices. I did a full cluster restart and NOT a rolling
>> upgrade. It went without a hitch.
>>
>> Charles
>>
>>
>>
>>
>>
>>
>> On Tue, Sep 24, 2013 at 2:33 PM, Paulo Motta <pauloricardomg@gmail.com>w=
rote:
>>
>>> Cool, sounds fair enough. Thanks for the help, Rob!
>>>
>>> If anyone has upgraded from 1.1.X to 1.2.X, please feel invited to shar=
e
>>> any tips on issues you're encountered that are not yet documented.
>>>
>>> Cheers,
>>>
>>> Paulo
>>>
>>>
>>> 2013/9/24 Robert Coli <rcoli@eventbrite.com>
>>>
>>>> On Tue, Sep 24, 2013 at 1:41 PM, Paulo Motta <pauloricardomg@gmail.com=
>wrote:
>>>>
>>>>> Doesn't the probability of something going wrong increases as the gap
>>>>> between the versions increase? So, using this reasoning, upgrading fr=
om
>>>>> 1.1.10 to 1.2.6 would have less chance of something going wrong then =
from
>>>>> 1.1.10 to 1.2.9 or 1.2.10.
>>>>>
>>>>
>>>> Sorta, but sorta not.
>>>>
>>>> https://github.com/apache/cassandra/blob/trunk/NEWS.txt
>>>>
>>>> Is the canonical source of concerns on upgrade. There are a few cases
>>>> where upgrading to the "root" of X.Y.Z creates issues that do not exis=
t if
>>>> you upgrade to the "head" of that line. AFAIK there have been no cases
>>>> where upgrading to the "head" of a line (where that line is mature, li=
ke
>>>> 1.2.10) has created problems which would have been avoided by upgradin=
g to
>>>> the "root" first.
>>>>
>>>>
>>>>> I'm hoping this reasoning is wrong and I can update directly from
>>>>> 1.1.10 to 1.2.10. :-)
>>>>>
>>>>
>>>> That's what I plan to do when we move to 1.2.X, FWIW.
>>>>
>>>> =3DRob
>>>>
>>>
>>>
>>>
>>> --
>>> Paulo Ricardo
>>>
>>> --
>>> European Master in Distributed Computing***
>>> Royal Institute of Technology - KTH
>>> *
>>> *Instituto Superior T=E9cnico - IST*
>>> *http://paulormg.com*
>>>
>>
>>
>
>
> --
> Paulo Ricardo
>
> --
> European Master in Distributed Computing***
> Royal Institute of Technology - KTH
> *
> *Instituto Superior T=E9cnico - IST*
> *http://paulormg.com*
>


--=20
Paulo Ricardo

--=20
European Master in Distributed Computing***
Royal Institute of Technology - KTH
*
*Instituto Superior T=E9cnico - IST*
*http://paulormg.com*

--047d7bf0f2f464a07104e7c6875f
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hello,<div><br></div><div>I just started the rolling upgra=
de procedure from 1.1.10 to 2.1.10. Our strategy is to simultaneously upgra=
de one server from each replication group. So, if we have a 6 nodes with RF=
=3D2, we will upgrade 3 nodes at a time (from distinct replication groups).=
</div>

<div><br></div><div>My question is: do the newly upgraded nodes show as &qu=
ot;Down&quot; in the &quot;nodetool ring&quot; of the old cluster (1.1.10)?=
 Because I thought that network compatibility meant nodes from a newer vers=
ion would receive traffic (write + reads) from the previous version without=
 problems.</div>

<div><br></div><div>Cheers,</div><div><br></div><div>Paulo=A0</div></div><d=
iv class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/9/26 Paulo=
 Motta <span dir=3D"ltr">&lt;<a href=3D"mailto:pauloricardomg@gmail.com" ta=
rget=3D"_blank">pauloricardomg@gmail.com</a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hello Charles,<div><br></di=
v><div>Thank you very much for your detailed upgrade report. It&#39;ll be v=
ery helpful during our upgrade operation (even though we&#39;ll do a rollin=
g production upgrade).</div>

<div>
<br></div><div>I&#39;ll also share our findings during the upgrade here.</d=
iv><div><br></div><div>Cheers,</div><div><br>Paulo</div></div><div class=3D=
"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br><br><div class=3D=
"gmail_quote">

2013/9/24 Charles Brophy <span dir=3D"ltr">&lt;<a href=3D"mailto:cbrophy@zu=
lily.com" target=3D"_blank">cbrophy@zulily.com</a>&gt;</span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Paulo,<div><br></div><di=
v>I just completed a migration from 1.1.10 to 1.2.10 and it was surprisingl=
y painless.=A0</div>


<div><br></div><div>The course of action that I took:</div><div>1) describe=
 cluster - make sure all nodes are on the same schema<br>
</div><div>2) shutoff all maintenance tasks; i.e. make sure no scheduled re=
pair is going to kick off in the middle of what you&#39;re doing</div><div>=
3) snapshot - maybe not necessary but it&#39;s so quick it makes no sense t=
o skip this step<br>


</div><div>4) drain the nodes - I shut down the entire cluster rather than =
chance any incompatible gossip concerns that might come from a rolling upgr=
ade. I have the luxury of controlling both the providers and consumers of o=
ur data, so this wasn&#39;t so disruptive for us.</div>


<div>5) Upgrade the nodes, turn them on one-by-one, monitor the logs for fu=
nny business.</div><div>6) nodetool upgradesstables</div><div>7) Turn vario=
us maintenance tasks back on, etc.</div><div><br></div><div>The worst part =
was managing the yaml/config changes between the versions. It wasn&#39;t ho=
rrible, but the diff was &quot;noisier&quot; than a more incremental upgrad=
e typically is. A few things I recall that were special:</div>


<div>1) Since you have an existing cluster, you&#39;ll probably need to set=
 the default partitioner back to RandomPartitioner in cassandra.yaml. I bel=
ieve that is outlined in NEWS.=A0</div><div>2) I set the initial tokens to =
be the same as what the nodes held previously.=A0</div>


<div>3) The timeout is now divided into more atomic settings and you get to=
 decided how (or if) to configure it from the default appropriately.</div><=
div><br></div><div>tldr; I did a standard upgrade and payed careful attenti=
on to the NEWS.txt upgrade notices. I did a full cluster restart and NOT a =
rolling upgrade. It went without a hitch.</div>


<span><font color=3D"#888888">
<div><br></div><div>Charles</div><div><br></div><div><br></div><div><br></d=
iv><div><br></div></font></span></div><div><div><div class=3D"gmail_extra">=
<br><br><div class=3D"gmail_quote">On Tue, Sep 24, 2013 at 2:33 PM, Paulo M=
otta <span dir=3D"ltr">&lt;<a href=3D"mailto:pauloricardomg@gmail.com" targ=
et=3D"_blank">pauloricardomg@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Cool, sounds fair enough. T=
hanks for the help, Rob!<div><br></div><div>If anyone has upgraded from 1.1=
.X to 1.2.X, please feel invited to share any tips on issues you&#39;re enc=
ountered that are not yet documented.</div>


<div><br></div><div>Cheers,</div><div><br></div><div>Paulo</div></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/9/24 Robert C=
oli <span dir=3D"ltr">&lt;<a href=3D"mailto:rcoli@eventbrite.com" target=3D=
"_blank">rcoli@eventbrite.com</a>&gt;</span><br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>On Tue, Sep 24, 2013 a=
t 1:41 PM, Paulo Motta <span dir=3D"ltr">&lt;<a href=3D"mailto:pauloricardo=
mg@gmail.com" target=3D"_blank">pauloricardomg@gmail.com</a>&gt;</span> wro=
te:<br>


</div><div class=3D"gmail_extra"><div class=3D"gmail_quote"><div>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div dir=3D"ltr"><div>Doesn&#39;t the probability of somet=
hing going wrong increases as the gap between the versions increase? So, us=
ing this reasoning, upgrading from 1.1.10 to 1.2.6 would have less chance o=
f something going wrong then from 1.1.10 to 1.2.9 or 1.2.10.</div>


</div></blockquote><div><br></div></div><div>Sorta, but sorta not.=A0</div>=
<div><br></div><div><a href=3D"https://github.com/apache/cassandra/blob/tru=
nk/NEWS.txt" target=3D"_blank">https://github.com/apache/cassandra/blob/tru=
nk/NEWS.txt</a><br>


</div>
<div><br></div><div>Is the canonical source of concerns on upgrade. There a=
re a few cases where upgrading to the &quot;root&quot; of X.Y.Z creates iss=
ues that do not exist if you upgrade to the &quot;head&quot; of that line. =
AFAIK there have been no cases where upgrading to the &quot;head&quot; of a=
 line (where that line is mature, like 1.2.10) has created problems which w=
ould have been avoided by upgrading to the &quot;root&quot; first.</div>


<div>
<div>=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px=
 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div dir=3D"ltr"><div>I&#39;m hoping this re=
asoning is wrong and I can update directly from 1.1.10 to 1.2.10. :-)</div>


</div></blockquote><div><br></div></div><div>That&#39;s what I plan to do w=
hen we move to 1.2.X, FWIW.</div><div><br></div><div>=3DRob</div></div></di=
v></div><span><font color=3D"#888888">
</font></span></blockquote></div><span><font color=3D"#888888"><br><br clea=
r=3D"all"><div><br></div>-- <br><div>Paulo Ricardo</div><div><br></div>-- <=
br><span>European Master in Distributed Computing<i></i></span><span style=
=3D"font-family:arial,sans-serif;line-height:15px"><i style=3D"font-style:n=
ormal"><br>


Royal Institute of Technology -=A0<i style=3D"font-style:normal">KTH</i><br=
></i></span><div><span style=3D"font-family:arial,sans-serif;line-height:15=
px"><i style=3D"font-style:normal"><i style=3D"font-style:normal">Instituto=
 Superior T=E9cnico - IST</i></i></span></div>


<div><span style=3D"font-family:arial,sans-serif;line-height:15px"><i style=
=3D"font-style:normal"><i style=3D"font-style:normal"><a href=3D"http://pau=
lormg.com" target=3D"_blank">http://paulormg.com</a></i></i></span></div>
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Paulo Ricardo</div><div><br></div>-- <br><span>European Master in Dist=
ributed Computing<i></i></span><span style=3D"font-family:arial,sans-serif;=
line-height:15px"><i style=3D"font-style:normal"><br>


Royal Institute of Technology -=A0<i style=3D"font-style:normal">KTH</i><br=
></i></span><div><span style=3D"font-family:arial,sans-serif;line-height:15=
px"><i style=3D"font-style:normal"><i style=3D"font-style:normal">Instituto=
 Superior T=E9cnico - IST</i></i></span></div>


<div><span style=3D"font-family:arial,sans-serif;line-height:15px"><i style=
=3D"font-style:normal"><i style=3D"font-style:normal"><a href=3D"http://pau=
lormg.com" target=3D"_blank">http://paulormg.com</a></i></i></span></div>
</div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Paulo Ricardo</div><div><br></div>-- <br><span>European Master in Dist=
ributed Computing<i></i></span><span style=3D"font-family:arial,sans-serif;=
line-height:15px"><i style=3D"font-style:normal"><br>

Royal Institute of Technology -=A0<i style=3D"font-style:normal">KTH</i><br=
></i></span><div><span style=3D"font-family:arial,sans-serif;line-height:15=
px"><i style=3D"font-style:normal"><i style=3D"font-style:normal">Instituto=
 Superior T=E9cnico - IST</i></i></span></div>

<div><span style=3D"font-family:arial,sans-serif;line-height:15px"><i style=
=3D"font-style:normal"><i style=3D"font-style:normal"><a href=3D"http://pau=
lormg.com" target=3D"_blank">http://paulormg.com</a></i></i></span></div>
</div>

--047d7bf0f2f464a07104e7c6875f--