Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of chensheng2010@gmail.com
 designates 209.85.220.172 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=b95b0Q21Y8NFzTUOpj3ZC7FjopYV1bQROOLmgs5QwsmWLE0wyKMDCYVe1EAD9D+4sv
         sge3797jsxZRsAAhMqqlMbfaxkMFBAOZYna6wdNr399OATYPSD688HhF5fwTvkdM9dOG
         fRL48/jAib3ivZVrkrwlhPWVafIRmNEd2YVbQ=
MIME-Version: 1.0
In-Reply-To: <BANLkTikpLFCOtVdwF1kT5hMgMdzMv0+21A@mail.gmail.com>
References: <BANLkTinwc1NXVSHs-pVGWxm3W03jueE+FA@mail.gmail.com>
	<BANLkTikpLFCOtVdwF1kT5hMgMdzMv0+21A@mail.gmail.com>
Date: Thu, 28 Apr 2011 16:57:48 +0800
Message-ID: <BANLkTimdQfc9VVLqy19WaC9Ubz6sOqF4og@mail.gmail.com>
Subject: Re: Heavy writes ok for single node, but failed for cluster
From: Sheng Chen <chensheng2010@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=bcaec501c62a69247c04a1f6c0cf

--bcaec501c62a69247c04a1f6c0cf
Content-Type: text/plain; charset=ISO-8859-1

Thank you for your advice. Rf>=2 is a good work around.
I was using 0.7.4 and have updated to the latest 0.7 branch, which includes
2554 patch.
But it doesn't help. I still get lots of UnavailableException after the
following logs,

 INFO [GossipTasks:1] 2011-04-28 16:12:17,661 Gossiper.java (line 228)
InetAddress /192.168.125.49 is now dead.
 INFO [GossipStage:1] 2011-04-28 16:12:19,627 Gossiper.java (line 609)
InetAddress /192.168.125.49 is now UP

 INFO [HintedHandoff:1] 2011-04-28 16:13:11,452 HintedHandOffManager.java
(line 304) Started hinted handoff for endpoint /192.168.125.49
 INFO [HintedHandoff:1] 2011-04-28 16:13:11,453 HintedHandOffManager.java
(line 360) Finished hinted handoff of 0 rows to endpoint /192.168.125.49

It seems that the gossip failure detection is too sensitive. Is there any
configuration?


2011/4/27 Sylvain Lebresne <sylvain@datastax.com>

> On Wed, Apr 27, 2011 at 10:32 AM, Sheng Chen <chensheng2010@gmail.com>
> wrote:
> > I succeeded to insert 1 billion records into a single node cassandra,
> >>> bin/stress -d cas01 -o insert -n 1000000000 -c 5 -S 34 -C5 -t 20
> > Inserts finished in about 14 hours at a speed of 20k/sec.
> > But when I added another node, tests always failed with
> UnavailableException
> > in an hour.
> >>> bin/stress -d cas01,cas02 -o insert -n 1000000000 -c 5 -S 34 -C5 -t 20
> > Writes speed is also 20k/sec because of the bottleneck in the client, so
> the
> > pressure on each server node should be 50% of the single node test.
> > Why couldn't they handle?
> > By default, rf=1, consistency=ONE
> > Some information that may be helpful,
> > 1. no warn/error in log file, the cluster is still alive after those
> > exception
> > 2. the last logs on both nodes happen to be a compaction complete info
> > 3. gossip log shows one node is dead and then up again in 3 seconds
>
> That's your problem. Once marked down (and since rf=1), when an update for
> cas02 reach cas01 and cas01 has marked cas02 down, it will throw the
> UnavailableException.
>
> Now, it shouldn't have been marked down and I suspect this is due to
> https://issues.apache.org/jira/browse/CASSANDRA-2554
> (even though you didn't tell which version you're using, I suppose
> this is a 0.7.*).
>
> If you apply this patch or use the svn current 0.7 branch, that should
> hopefully
> not happen again.
>
> Note that if you had rf >= 2, the node would still have been marked down
> wrongly
> for 3 seconds, but that would have been transparent to the stress test.
>
> > 4. I set hinted_handoff_enabled: false, but still see lots of handoff
> logs
>
> What are those saying ?
>
> --
> Sylvain
>

--bcaec501c62a69247c04a1f6c0cf
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thank you for your advice.=A0Rf&gt;=3D2 is a good work around.<div>I was us=
ing 0.7.4 and have updated to the latest 0.7 branch, which includes 2554 pa=
tch.</div><div>But it doesn&#39;t help. I still get lots of=A0<span class=
=3D"Apple-style-span" style=3D"font-family: arial, sans-serif; font-size: 1=
3px; border-collapse: collapse; ">UnavailableException after the following =
logs,</span></div>
<div><span class=3D"Apple-style-span" style=3D"font-family: arial, sans-ser=
if; font-size: 13px; border-collapse: collapse; "><br></span></div><div><di=
v>=A0INFO [GossipTasks:1] 2011-04-28 16:12:17,661 Gossiper.java (line 228) =
InetAddress /<a href=3D"http://192.168.125.49">192.168.125.49</a> is now de=
ad.</div>
<div>=A0INFO [GossipStage:1] 2011-04-28 16:12:19,627 Gossiper.java (line 60=
9) InetAddress /<a href=3D"http://192.168.125.49">192.168.125.49</a> is now=
 UP</div><div><br></div><div><div>=A0INFO [HintedHandoff:1] 2011-04-28 16:1=
3:11,452 HintedHandOffManager.java (line 304) Started hinted handoff for en=
dpoint /<a href=3D"http://192.168.125.49">192.168.125.49</a></div>
<div>=A0INFO [HintedHandoff:1] 2011-04-28 16:13:11,453 HintedHandOffManager=
.java (line 360) Finished hinted handoff of 0 rows to endpoint /<a href=3D"=
http://192.168.125.49">192.168.125.49</a></div></div><div><br></div><div>It=
 seems that the gossip failure detection is too sensitive. Is there any con=
figuration?</div>
<div><br></div><div><br></div><div><br></div><div><br></div><div><br></div>=
<br><div class=3D"gmail_quote">2011/4/27 Sylvain Lebresne <span dir=3D"ltr"=
>&lt;<a href=3D"mailto:sylvain@datastax.com">sylvain@datastax.com</a>&gt;</=
span><br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;"><div><div></div><div class=3D"h5">On Wed, A=
pr 27, 2011 at 10:32 AM, Sheng Chen &lt;<a href=3D"mailto:chensheng2010@gma=
il.com">chensheng2010@gmail.com</a>&gt; wrote:<br>

&gt; I succeeded to insert 1 billion records into a single node cassandra,<=
br>
&gt;&gt;&gt; bin/stress -d cas01 -o insert -n 1000000000 -c 5 -S 34 -C5 -t =
20<br>
&gt; Inserts finished in about 14 hours at a speed of 20k/sec.<br>
&gt; But when I added another node, tests always failed with UnavailableExc=
eption<br>
&gt; in an hour.<br>
&gt;&gt;&gt;=A0bin/stress -d cas01,cas02 -o insert -n 1000000000 -c 5 -S 34=
 -C5 -t 20<br>
&gt; Writes speed is also 20k/sec because of the bottleneck in the client, =
so the<br>
&gt; pressure on each server node should be 50% of the single node test.<br=
>
&gt; Why couldn&#39;t they handle?<br>
&gt; By default, rf=3D1, consistency=3DONE<br>
&gt; Some information that may be helpful,<br>
&gt; 1. no warn/error in log file, the cluster is still alive after those<b=
r>
&gt; exception<br>
&gt; 2. the last logs on both nodes happen to be a compaction complete info=
<br>
&gt; 3. gossip log shows one node is dead and then up again in 3 seconds<br=
>
<br>
</div></div>That&#39;s your problem. Once marked down (and since rf=3D1), w=
hen an update for<br>
cas02 reach cas01 and cas01 has marked cas02 down, it will throw the<br>
UnavailableException.<br>
<br>
Now, it shouldn&#39;t have been marked down and I suspect this is due to<br=
>
<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2554" target=3D"=
_blank">https://issues.apache.org/jira/browse/CASSANDRA-2554</a><br>
(even though you didn&#39;t tell which version you&#39;re using, I suppose<=
br>
this is a 0.7.*).<br>
<br>
If you apply this patch or use the svn current 0.7 branch, that should hope=
fully<br>
not happen again.<br>
<br>
Note that if you had rf &gt;=3D 2, the node would still have been marked do=
wn wrongly<br>
for 3 seconds, but that would have been transparent to the stress test.<br>
<div class=3D"im"><br>
&gt; 4. I set hinted_handoff_enabled: false, but still see lots of handoff =
logs<br>
<br>
</div>What are those saying ?<br>
<br>
--<br>
<font color=3D"#888888">Sylvain<br>
</font></blockquote></div><br></div>

--bcaec501c62a69247c04a1f6c0cf--