Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=OWDbI8QPOf
	+Zn2n3R/7ZTCKf2rxNlHI6ozcZg9sUGPC34+H8TNVseAmQYJ83wL3/S8eDbMiqss
	FkU4uiE2/05oZu/ETlFF7RtKbeJa3FIa3z42e7NhFRNVZFf8L44Zn4dcs6UM+SY9
	ZQCaILs9hkwlilUmR9RHTzVMT0VtlZmZw=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1084)
Content-Type: multipart/alternative; boundary=Apple-Mail-22--724463699
Subject: Re: Easy way to overload a single node on purpose?
Date: Fri, 17 Jun 2011 19:20:17 +1200
In-Reply-To: <BANLkTi=ZTxQy95qUwkx73nup139NKz4bQg@mail.gmail.com>
To: user@cassandra.apache.org
References: <BANLkTi=u5bsDNr_cigJzeFHvPOafbgyo_Q@mail.gmail.com>
 <1F398D0F-EA23-4C63-B0D9-751A59F694D8@thelastpickle.com>
 <BANLkTi=ZTxQy95qUwkx73nup139NKz4bQg@mail.gmail.com>
Message-Id: <7AB3F720-3418-470F-B400-95249151C45B@thelastpickle.com>


--Apple-Mail-22--724463699
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

The short answer to the problem you saw is monitor the disk space. Also =
monitor client side logs for errors. Running out of commit log space =
does not stop the node from doing reads, so it can still be considered =
up.=20

One nodes view of it's own UP'ness is not as important as the other =
nodes (or clients) view of it.  For example...

A node will appear UP in the ring view of another node if it is =
participating in gossip messages and it's application state is normal. =
But a node will appear UP in it's own view of the ring most of time =
(assuming not bootstrap, leaving etc and it has joined the ring). This =
applies even if it's gossip service has been disabled.

To a client a node will appear down if it is not responding to RPC =
requests. But it could still be part of the cluster, appear UP to other =
nodes and be responding to read and/or write.=20

So to monitor that a node is running in some form you can...

- you should be monitoring the TP stats anyway, so you know the node is =
in some running state=20
- check that you can connect as a client to each node and do some simple =
call. Either read/write or describe_ring() which will exec locallay or =
describe_schema_versions() which will call all live nodes. A read/write =
will only verify that the node can act as a coordinator, not that it can =
read/write it's self.=20
- monitor the other nodes view of each node using nodetool ring.=20

Now that i've written that I'm not 100% sold on it, but it will do for =
now :)
=20
Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 17 Jun 2011, at 10:25, Suan Aik Yeo wrote:

> > Having a ping column can work if every key is replicated to every =
node. It would tell you the cluster is working, sort of. Once the number =
of nodes is greater than the RF, it tells you a subset of the nodes =
works.
>=20
> The way our check works is that each node checks itself, so in this =
context we're not concerned about whether the cluster is "up", but that =
each individual node is "up".
> =20
> So the symptoms I saw, the node actually going "down" etc, were =
probably due to many different events happening at the time, and will be =
very hard to recreate?
>=20
> On Thu, Jun 16, 2011 at 6:16 AM, aaron morton =
<aaron@thelastpickle.com> wrote:
> >     DEBUG 14:36:55,546 ... timed out
>=20
> Is logged when the coordinator times out waiting for the replicas to =
respond, the timeout setting is rpc_timeout in the yaml file. This =
results in the client getting a TimedOutException.
>=20
> AFAIK There is no global everything is good / bad flags to check. e.g. =
AFAIK I node will not mark its self down if it runs out of disk space.  =
So you need to monitor the free disk space and alert on that.
>=20
> Having a ping column can work if every key is replicated to every =
node. It would tell you the cluster is working, sort of. Once the number =
of nodes is greater than the RF, it tells you a subset of the nodes =
works.
>=20
> If you google around you'll find discussions about monitoring with =
munin, ganglia, cloud kick and Ops Centre.
>=20
> If you install mx4j you can access the JMX metrics via HTTP,
>=20
> Cheers
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 16 Jun 2011, at 10:38, Suan Aik Yeo wrote:
>=20
> > Here's a weird one... what's the best way to get a Cassandra node =
into a "half-crashed" state?
> >
> > We have a 3-node cluster running 0.7.5. A few days ago this happened =
organically to node1 - the partition the commitlog was on was 100% full =
and there was a "No space left on device" error, and after a while, =
although the cluster and node1 was still up, to the other nodes it was =
down, and messages like:
> >     DEBUG 14:36:55,546 ... timed out
> > started to show up in its debug logs.
> >
> > We have a tool to indicate to the load balancer that a Cassandra =
node is down, but it didn't detect it that time. Now I'm having trouble =
purposefully getting the node back to that state, so that I can try =
other monitoring methods. I've tried to fill up the commitlog partition =
with other files, and although I get the "No space left on device" =
error, the node still doesn't go down and show the other symptoms it =
showed before.
> >
> > Also, if anyone could recommend a good way for a node itself to =
detect that its in such a state I'd be interested in that too. Currently =
what we're doing is making a "describe_cluster_name()" thrift call, but =
that still worked when the node was "down". I'm thinking of something =
like reading/writing to a fixed value in a keyspace as a check... =
Unfortunately Java-based solutions are out of the question.
> >
> >
> > Thanks,
> > Suan
>=20
>=20


--Apple-Mail-22--724463699
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div>The short answer to the problem you saw is monitor the disk =
space. Also monitor client side logs for errors. Running out of commit =
log space does not stop the node from doing reads, so it can still be =
considered up.&nbsp;</div><div><br></div><div>One nodes view of it's own =
UP'ness is not as important as the other nodes (or clients) view of it. =
&nbsp;For example...</div><div><br></div><div>A node will appear UP in =
the ring view of another node if it is participating in gossip messages =
and it's application state is normal. But&nbsp;a node will appear UP in =
it's own view of the ring most of time (assuming not bootstrap, leaving =
etc and it has joined the ring). This applies even if it's gossip =
service has been disabled.</div><div><br></div><div>To a client a node =
will appear down if it is not responding to RPC requests. But it could =
still be part of the cluster, appear UP to other nodes and be responding =
to read and/or write.&nbsp;</div><div><br></div><div>So to monitor that =
a node is running in some form you can...</div><div><br></div><div>- you =
should be monitoring the TP stats anyway, so you know the node is in =
some running state&nbsp;</div><div>- check that you can connect as a =
client to each node and do some simple call. Either read/write or =
describe_ring() which will exec locallay or describe_schema_versions() =
which will call all live nodes. A read/write will only verify that the =
node can act as a coordinator, not that it can read/write it's =
self.&nbsp;</div><div>- monitor the other nodes view of each node using =
nodetool ring.&nbsp;</div><div><br></div><div>Now that i've written that =
I'm not 100% sold on it, but it will do for now =
:)</div><div>&nbsp;</div><div>Cheers</div><div><br></div><div><div><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 17 Jun 2011, at 10:25, Suan Aik Yeo wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><div>&gt; =
Having a ping column can work if every key is replicated to every node. =
It would tell you the cluster is working, sort of. Once the number of =
nodes is greater than the RF, it tells you a subset of the nodes =
works.</div>
<div><br></div>The way our check works is that each node checks itself, =
so in this context we're not concerned about whether the cluster is =
"up", but that each individual node is "up".<div>&nbsp;</div><div>
So the symptoms I saw, the node actually going "down" etc, were probably =
due to many different events happening at the time, and will be very =
hard to recreate?<br><br><div class=3D"gmail_quote">On Thu, Jun 16, 2011 =
at 6:16 AM, aaron morton <span dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; "><div class=3D"im">&gt; &nbsp; &nbsp; DEBUG 14:36:55,546 ... =
timed out<br>
<br>
</div>Is logged when the coordinator times out waiting for the replicas =
to respond, the timeout setting is rpc_timeout in the yaml file. This =
results in the client getting a TimedOutException.<br>
<br>
AFAIK There is no global everything is good / bad flags to check. e.g. =
AFAIK I node will not mark its self down if it runs out of disk space. =
&nbsp;So you need to monitor the free disk space and alert on that.<br>
<br>
Having a ping column can work if every key is replicated to every node. =
It would tell you the cluster is working, sort of. Once the number of =
nodes is greater than the RF, it tells you a subset of the nodes =
works.<br>
<br>
If you google around you'll find discussions about monitoring with =
munin, ganglia, cloud kick and Ops Centre.<br>
<br>
If you install mx4j you can access the JMX metrics via HTTP,<br>
<br>
Cheers<br>
<br>
-----------------<br>
<font color=3D"#888888">Aaron Morton<br>
Freelance Cassandra Developer<br>
@aaronmorton<br>
<a href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
</font><div><div></div><div class=3D"h5"><br>
On 16 Jun 2011, at 10:38, Suan Aik Yeo wrote:<br>
<br>
&gt; Here's a weird one... what's the best way to get a Cassandra node =
into a "half-crashed" state?<br>
&gt;<br>
&gt; We have a 3-node cluster running 0.7.5. A few days ago this =
happened organically to node1 - the partition the commitlog was on was =
100% full and there was a "No space left on device" error, and after a =
while, although the cluster and node1 was still up, to the other nodes =
it was down, and messages like:<br>

&gt; &nbsp; &nbsp; DEBUG 14:36:55,546 ... timed out<br>
&gt; started to show up in its debug logs.<br>
&gt;<br>
&gt; We have a tool to indicate to the load balancer that a Cassandra =
node is down, but it didn't detect it that time. Now I'm having trouble =
purposefully getting the node back to that state, so that I can try =
other monitoring methods. I've tried to fill up the commitlog partition =
with other files, and although I get the "No space left on device" =
error, the node still doesn't go down and show the other symptoms it =
showed before.<br>

&gt;<br>
&gt; Also, if anyone could recommend a good way for a node itself to =
detect that its in such a state I'd be interested in that too. Currently =
what we're doing is making a "describe_cluster_name()" thrift call, but =
that still worked when the node was "down". I'm thinking of something =
like reading/writing to a fixed value in a keyspace as a check... =
Unfortunately Java-based solutions are out of the question.<br>

&gt;<br>
&gt;<br>
&gt; Thanks,<br>
&gt; Suan<br>
<br>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div></div></body></html>=

--Apple-Mail-22--724463699--