Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_7E79054E-C164-4C5A-ADE2-2EDC2BA9FB87"
Message-Id: <1C4330B9-EE2D-46FD-AD53-0654020B43CB@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: Recovering from a faulty cassandra node
Date: Fri, 22 Mar 2013 05:58:29 +1300
References: 
 <CAPqEvGFL9M475MvQ9uZzzW+58M0Oi7BG9SWisRqzpJaC3vQ9bg@mail.gmail.com>
 <CD6DE270.23CFA%Dean.Hiller@nrel.gov>
 <CA+VSrLqGG4ObMMf4XT8ENsejP0LPchfk_06KLBN0p6TACqmo9A@mail.gmail.com>
 <CAPqEvGF0608HGxHjDCrqJM+6BjxSCseegdAOcbE4oQEgc2xtRQ@mail.gmail.com>
 <CAPqEvGH+_xB=d8pbKZ4Fy6cBryc5VYQLqgT6gmcY0UZoaVYHiA@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAPqEvGH+_xB=d8pbKZ4Fy6cBryc5VYQLqgT6gmcY0UZoaVYHiA@mail.gmail.com>


--Apple-Mail=_7E79054E-C164-4C5A-ADE2-2EDC2BA9FB87
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

>  Not sure if I needed to change cassandra-topology.properties file on =
the existing nodes.
If you are using the PropertyFileSnitch all nodes need to have the same =
cassandra-topology.properties file.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 21/03/2013, at 1:34 AM, Jabbar Azam <ajazam@gmail.com> wrote:

> I've added the node with a different IP address and after disabling =
the firewall data is being streamed from the existing nodes to the wiped =
node. I'll do a cleanup, followed by remove node once it's done.
>=20
> I've also added the new node to the existing nodes' =
cassandra-topology.properties file and restarted them. I also found I =
had iptables switched on and couldn't understand why the wiped node =
couldn't see the cluster. Not sure if I needed to change =
cassandra-topology.properties file on the existing nodes.
>=20
>=20
>=20
>=20
> On 19 March 2013 15:49, Jabbar Azam <ajazam@gmail.com> wrote:
> Do I use removenode before adding the reinstalled node or after?
>=20
>=20
> On 19 March 2013 15:45, Alain RODRIGUEZ <arodrime@gmail.com> wrote:
> In 1.2, you may want to use the nodetool removenode if your server i =
broken or unreachable, else I guess nodetool decommission remains the =
good way to remove a node. =
(http://www.datastax.com/docs/1.2/references/nodetool)
>=20
> When this node is out, rm -rf /yourpath/cassandra/* on this serveur, =
change the configuration if needed (not sure about the auto_bootstrap =
param) and start Cassandra on that node again. It should join the ring =
as a new node.
>=20
> Good luck.
>=20
>=20
> 2013/3/19 Hiller, Dean <Dean.Hiller@nrel.gov>
>=20
> Since you "cleared" out that node, it IS the replacement node.
>=20
> Dean
>=20
> From: Jabbar Azam <ajazam@gmail.com<mailto:ajazam@gmail.com>>
> Reply-To: =
"user@cassandra.apache.org<mailto:user@cassandra.apache.org>" =
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> Date: Tuesday, March 19, 2013 9:29 AM
> To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" =
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> Subject: Re: Recovering from a faulty cassandra node
>=20
> Hello Dean.
>=20
> I'm using vnodes so can't specify a token. In addition I can't follow =
the replace node docs because I don't have a replacement node.
>=20
>=20
> On 19 March 2013 15:25, Hiller, Dean =
<Dean.Hiller@nrel.gov<mailto:Dean.Hiller@nrel.gov>> wrote:
> I have not done this as of yet but from all that I have read your best =
option is to follow the replace node documentation which I belive you =
need to
>=20
>=20
>  1.  Have the token be the same BUT add 1 to it so it doesn't think =
it's the same computer
>  2.  Have the bootstrap option set or something so streaming takes =
affect.
>=20
> I would however test that all out in QA to make sure it works and if =
you have QUOROM reads/writes a good part of that test would be to take =
node X down after your node Y is back in the cluster to make sure =
reads/writes are working on the node you fixed=85..you just need to make =
sure node X shares one of the token ranges of node Y AND your =
writes/reads are in that token range.
>=20
> Dean
>=20
> From: Jabbar Azam =
<ajazam@gmail.com<mailto:ajazam@gmail.com><mailto:ajazam@gmail.com<mailto:=
ajazam@gmail.com>>>
> Reply-To: =
"user@cassandra.apache.org<mailto:user@cassandra.apache.org><mailto:user@c=
assandra.apache.org<mailto:user@cassandra.apache.org>>" =
<user@cassandra.apache.org<mailto:user@cassandra.apache.org><mailto:user@c=
assandra.apache.org<mailto:user@cassandra.apache.org>>>
> Date: Tuesday, March 19, 2013 8:51 AM
> To: =
"user@cassandra.apache.org<mailto:user@cassandra.apache.org><mailto:user@c=
assandra.apache.org<mailto:user@cassandra.apache.org>>" =
<user@cassandra.apache.org<mailto:user@cassandra.apache.org><mailto:user@c=
assandra.apache.org<mailto:user@cassandra.apache.org>>>
> Subject: Recovering from a faulty cassandra node
>=20
> Hello,
>=20
> I am using Cassandra 1.2.2 on a 4 node test cluster with vnodes. I =
waited for over a week to insert lots of data into the cluster. During =
the end of the process one of the nodes had a hardware fault.
>=20
> I have fixed the hardware fault but the filing system on that node is =
corrupt so I'll have to reinstall the OS and cassandra.
>=20
> I can think of two ways of reintegrating the host into the cluster
>=20
> 1) shrink the cluster to three nodes and add the node into the cluster
>=20
> 2) Add the node into the cluster without shrinking
>=20
> I'm not sure of the best approach to take and I'm not sure how to =
achieve each step.
>=20
> Can anybody help?
>=20
>=20
> --
> Thanks
>=20
>  Jabbar Azam
>=20
>=20
>=20
> --
> Thanks
>=20
> Jabbar Azam
>=20
>=20
>=20
>=20
> --=20
> Thanks
>=20
> Jabbar Azam
>=20
>=20
>=20
> --=20
> Thanks
>=20
> Jabbar Azam


--Apple-Mail=_7E79054E-C164-4C5A-ADE2-2EDC2BA9FB87
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><blockquote type=3D"cite"><div dir=3D"ltr">&nbsp;Not sure if I needed =
to change cassandra-topology.properties file on the existing =
nodes.</div></blockquote>If you are using the PropertyFileSnitch all =
nodes need to have the same cassandra-topology.properties =
file.&nbsp;<div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 21/03/2013, at 1:34 AM, Jabbar Azam &lt;<a =
href=3D"mailto:ajazam@gmail.com">ajazam@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div dir=3D"ltr"><div>I've added the node with a different =
IP address and after disabling the firewall data is being streamed from =
the existing nodes to the wiped node. I'll do a cleanup, followed by =
remove node once it's done.<br>
<br></div>I've also added the new node to the existing nodes' =
cassandra-topology.properties file and restarted them. I also found I =
had iptables switched on and couldn't understand why the wiped node =
couldn't see the cluster. Not sure if I needed to change =
cassandra-topology.properties file on the existing nodes.<br>
<br><br></div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">On 19 March 2013 15:49, Jabbar Azam <span =
dir=3D"ltr">&lt;<a href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;</span> wrote:<br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">
<div dir=3D"ltr">Do I use removenode before adding the reinstalled node =
or after?<br></div><div class=3D"gmail_extra"><div><div =
class=3D"h5"><br><br><div class=3D"gmail_quote">On 19 March 2013 15:45, =
Alain RODRIGUEZ <span dir=3D"ltr">&lt;<a =
href=3D"mailto:arodrime@gmail.com" =
target=3D"_blank">arodrime@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">In =
1.2, you may want to use the nodetool removenode if your server i broken =
or unreachable, else I guess nodetool decommission remains the good way =
to remove a node. (<a =
href=3D"http://www.datastax.com/docs/1.2/references/nodetool" =
target=3D"_blank">http://www.datastax.com/docs/1.2/references/nodetool</a>=
)<div>


<br></div><div>When this node is out, rm -rf /yourpath/cassandra/* on =
this serveur, change the configuration if needed (not sure about the =
auto_bootstrap param) and start Cassandra on that node again. It should =
join the ring as a new node.</div>


<div><br></div><div>Good luck.</div></div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">2013/3/19 =
Hiller, Dean <span dir=3D"ltr">&lt;<a href=3D"mailto:Dean.Hiller@nrel.gov"=
 target=3D"_blank">Dean.Hiller@nrel.gov</a>&gt;</span><div>

<div><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">Since you "cleared" =
out that node, it IS the replacement node.<br>
<div><br>
Dean<br>
<br>
From: Jabbar Azam &lt;<a href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&lt;mailto:<a =
href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;&gt;<br>
Reply-To: "<a href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;" &lt;<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;<br>


</div>Date: Tuesday, March 19, 2013 9:29 AM<br>
<div>To: "<a href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;" &lt;<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;<br>


</div>Subject: Re: Recovering from a faulty cassandra node<br>
<div><br>
Hello Dean.<br>
<br>
I'm using vnodes so can't specify a token. In addition I can't follow =
the replace node docs because I don't have a replacement node.<br>
<br>
<br>
</div><div>On 19 March 2013 15:25, Hiller, Dean &lt;<a =
href=3D"mailto:Dean.Hiller@nrel.gov" =
target=3D"_blank">Dean.Hiller@nrel.gov</a>&lt;mailto:<a =
href=3D"mailto:Dean.Hiller@nrel.gov" =
target=3D"_blank">Dean.Hiller@nrel.gov</a>&gt;&gt; wrote:<br>


I have not done this as of yet but from all that I have read your best =
option is to follow the replace node documentation which I belive you =
need to<br>
<br>
<br>
&nbsp;1. &nbsp;Have the token be the same BUT add 1 to it so it doesn't =
think it's the same computer<br>
&nbsp;2. &nbsp;Have the bootstrap option set or something so streaming =
takes affect.<br>
<br>
I would however test that all out in QA to make sure it works and if you =
have QUOROM reads/writes a good part of that test would be to take node =
X down after your node Y is back in the cluster to make sure =
reads/writes are working on the node you fixed=85..you just need to make =
sure node X shares one of the token ranges of node Y AND your =
writes/reads are in that token range.<br>


<br>
Dean<br>
<br>
</div>From: Jabbar Azam &lt;<a href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&lt;mailto:<a =
href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;&lt;mailto:<a =
href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&lt;mailto:<a =
href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;&gt;&gt;<br>


Reply-To: "<a href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;" &lt;<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;&gt;<br>


<div>Date: Tuesday, March 19, 2013 8:51 AM<br>
</div>To: "<a href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;" &lt;<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&lt;mailto:<a =
href=3D"mailto:user@cassandra.apache.org" =
target=3D"_blank">user@cassandra.apache.org</a>&gt;&gt;&gt;<br>


<div>Subject: Recovering from a faulty cassandra node<br>
<br>
Hello,<br>
<br>
I am using Cassandra 1.2.2 on a 4 node test cluster with vnodes. I =
waited for over a week to insert lots of data into the cluster. During =
the end of the process one of the nodes had a hardware fault.<br>
<br>
I have fixed the hardware fault but the filing system on that node is =
corrupt so I'll have to reinstall the OS and cassandra.<br>
<br>
I can think of two ways of reintegrating the host into the cluster<br>
<br>
1) shrink the cluster to three nodes and add the node into the =
cluster<br>
<br>
2) Add the node into the cluster without shrinking<br>
<br>
I'm not sure of the best approach to take and I'm not sure how to =
achieve each step.<br>
<br>
Can anybody help?<br>
<br>
<br>
--<br>
Thanks<br>
<br>
&nbsp;Jabbar Azam<br>
<br>
<br>
<br>
--<br>
Thanks<br>
<br>
Jabbar Azam<br>
</div></blockquote></div></div></div><br></div>
</blockquote></div><br><br clear=3D"all"><br></div></div><span =
class=3D"HOEnZb"><font color=3D"#888888">-- <br><div =
dir=3D"ltr">Thanks<br><br>Jabbar Azam<br></div>
</font></span></div>
</blockquote></div><br><br clear=3D"all"><br>-- <br><div =
dir=3D"ltr">Thanks<br><br>Jabbar Azam<br></div>
</div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_7E79054E-C164-4C5A-ADE2-2EDC2BA9FB87--