Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of eran.chinthaka@gmail.com
 designates 209.85.220.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAAafH9TaS2yt0VQMYkUiONb9RVvMqht324PHaRAbAZ0nya+-EQ@mail.gmail.com>
References: 
 <CAPTiAJ-ygz-Ug72egU=ZzRMA+jttQAv_mWik0y4szOj-NTFeaQ@mail.gmail.com>
	<CAAafH9TaS2yt0VQMYkUiONb9RVvMqht324PHaRAbAZ0nya+-EQ@mail.gmail.com>
Date: Mon, 23 Jul 2012 20:24:42 -0700
Message-ID: 
 <CAPTiAJ_Wi6zvbHyVnUw1BiHjm-j=yB8vpkqPbJczsZ1tfv73dA@mail.gmail.com>
Subject: Re: Bringing a dead node back up after fixing hardware issues
From: Eran Chinthaka Withana <eran.chinthaka@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=20cf3078119e41e0ee04c58ae762

--20cf3078119e41e0ee04c58ae762
Content-Type: text/plain; charset=ISO-8859-1

Thanks Brandon for the answer (and I didn't know driftx = Brandon Williams.
Thanks for your awesome support in Cassandra IRC)

Increasing CL is tricky for us for now, as our RF on that datacenter is 2
and CL is set to ONE. If we make the CL to be LOCAL_QUORUM, then, if a node
goes down we will have trouble. I will try to increase the RF to 3 in that
data center and set the CL to LOCAL_QUORUM if nothing works out.

About decommissioning, if the node goes down. There is no way of knowing
running that command on that node, right? IIUC, decommissioning should be
run on a node that needs to be decommissioned.

Coming back to the original question, without touching the CL, can we bring
back a dead node (after fixing it) and somehow tell Cassandra that the node
is backup and do not send read requests until it gets all the data?

Thanks,
Eran Chinthaka Withana


On Mon, Jul 23, 2012 at 6:48 PM, Brandon Williams <driftx@gmail.com> wrote:

> On Mon, Jul 23, 2012 at 6:26 PM, Eran Chinthaka Withana
> <eran.chinthaka@gmail.com> wrote:
> > Method 1: I copied the data from all the nodes in that data center, into
> the
> > repaired node, and brought it back up. But because of the rate of updates
> > happening, the read misses started going up.
>
> That's not really a good method when you scale up and the amount of
> data in the cluster won't fit on a single machine.
>
> > Method 2: I issued a removetoken command for that node's token and let
> the
> > cluster stream the data into relevant nodes. At the end of this process,
> the
> > dead node was not showing up in the ring output. Then I brought the node
> > back up. I was expecting, Cassandra to first stream data into the new
> node
> > (which happens to be the dead node which was in the cluster earlier) and
> > once its done then make it serve reads. But, in the server log, I can
> see as
> > soon the node comes up, it started serving reads, creating a large
> number of
> > read misses.
>
> Removetoken is for dead nodes, so the node has no way of locally
> knowing it shouldn't be a cluster member any longer when it starts up.
>  Instead if you had decommissioned, it would have saved a flag to
> indicate it should bootstrap at the next startup.
>
> > So the question is, what is the best way to bring back a dead node (once
> its
> > hardware issues are fixed) without impacting read misses?
>
> Increase your consistency level.  Run a repair on the node once it's
> back up, unless the repair time took longer than gc_grace, in which
> case you need to removetoken it, delete all the data, and bootstrap it
> back in if you don't want anything deleted to resurrect.
>
> -Brandon
>

--20cf3078119e41e0ee04c58ae762
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks Brandon for the answer (and I didn&#39;t know driftx =3D Brandon Wil=
liams. Thanks for your awesome support in Cassandra IRC)<div><br></div><div=
>Increasing CL is tricky for us for now, as our RF on that datacenter is 2 =
and CL is set to ONE. If we make the CL to be LOCAL_QUORUM, then, if a node=
 goes down we will have trouble. I will try to increase the RF to 3 in that=
 data center and set the CL to LOCAL_QUORUM if nothing works out.=A0</div>
<div><br></div><div>About decommissioning, if the node goes down. There is =
no way of knowing running that command on that node, right? IIUC, decommiss=
ioning should be run on a node that needs to be decommissioned.=A0</div><di=
v>
<br></div><div>Coming back to the original question, without touching the C=
L, can we bring back a dead node (after fixing it) and somehow tell Cassand=
ra that the node is backup and do not send read requests until it gets all =
the data?</div>
<div><br clear=3D"all">Thanks,<br>Eran Chinthaka Withana<br>
<br><br><div class=3D"gmail_quote">On Mon, Jul 23, 2012 at 6:48 PM, Brandon=
 Williams <span dir=3D"ltr">&lt;<a href=3D"mailto:driftx@gmail.com" target=
=3D"_blank">driftx@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex">
<div class=3D"im">On Mon, Jul 23, 2012 at 6:26 PM, Eran Chinthaka Withana<b=
r>
&lt;<a href=3D"mailto:eran.chinthaka@gmail.com">eran.chinthaka@gmail.com</a=
>&gt; wrote:<br>
&gt; Method 1: I copied the data from all the nodes in that data center, in=
to the<br>
&gt; repaired node, and brought it back up. But because of the rate of upda=
tes<br>
&gt; happening, the read misses started going up.<br>
<br>
</div>That&#39;s not really a good method when you scale up and the amount =
of<br>
data in the cluster won&#39;t fit on a single machine.<br>
<div class=3D"im"><br>
&gt; Method 2: I issued a removetoken command for that node&#39;s token and=
 let the<br>
&gt; cluster stream the data into relevant nodes. At the end of this proces=
s, the<br>
&gt; dead node was not showing up in the ring output. Then I brought the no=
de<br>
&gt; back up. I was expecting, Cassandra to first stream data into the new =
node<br>
&gt; (which happens to be the dead node which was in the cluster earlier) a=
nd<br>
&gt; once its done then make it serve reads. But, in the server log, I can =
see as<br>
&gt; soon the node comes up, it started serving reads, creating a large num=
ber of<br>
&gt; read misses.<br>
<br>
</div>Removetoken is for dead nodes, so the node has no way of locally<br>
knowing it shouldn&#39;t be a cluster member any longer when it starts up.<=
br>
=A0Instead if you had decommissioned, it would have saved a flag to<br>
indicate it should bootstrap at the next startup.<br>
<div class=3D"im"><br>
&gt; So the question is, what is the best way to bring back a dead node (on=
ce its<br>
&gt; hardware issues are fixed) without impacting read misses?<br>
<br>
</div>Increase your consistency level. =A0Run a repair on the node once it&=
#39;s<br>
back up, unless the repair time took longer than gc_grace, in which<br>
case you need to removetoken it, delete all the data, and bootstrap it<br>
back in if you don&#39;t want anything deleted to resurrect.<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
-Brandon<br>
</font></span></blockquote></div><br></div>

--20cf3078119e41e0ee04c58ae762--