Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: <CAKgYGar5ROeuO7g8TfpKsL7jw99j40jPDgE8XTn4SULbxePkFQ@mail.gmail.com>
References: <CAO2OEE3=S4dTSLywOhH5cusnxpoBztiz-AU7RV7V1nO5HuLWMg@mail.gmail.com>
 <CAOUOv0EaJceKuX3B5fMA++=bnu3h9fz7scx03DmxTE2MG7cxig@mail.gmail.com>
 <CAO2OEE2Lk1D3joEhg-2k26wn_Hb3HYzjfKA+z+333cHOvkwG3g@mail.gmail.com>
 <CAOUOv0HtNp+y5sWDs8u_Rf=3N4kKza2ryMzC9z16puAE45kFLg@mail.gmail.com>
 <CAO2OEE19bFF-085vE_-HqGjeVsDb_4KhkCk8U0+qcqnReGNuYA@mail.gmail.com>
 <CBF088DC-A13A-4CEE-B763-1C4B8F156905@foundev.pro> <CAO2OEE0u9+XcSfUoJ31igvsBdCij6b97D6KhUkRnyp3NAZVs8g@mail.gmail.com>
 <CAKgYGar5ROeuO7g8TfpKsL7jw99j40jPDgE8XTn4SULbxePkFQ@mail.gmail.com>
From: Yuan Fang <yuan@kryptoncloud.com>
Date: Thu, 7 Jul 2016 15:34:34 -0700
Message-ID: <CAO2OEE0opk02qusx1xCVH0Yjw73bErws9SeMeGXi3bxOUfkReg@mail.gmail.com>
Subject: Re: Is my cluster normal?
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=94eb2c114852b2b3e30537134cb3
archived-at: Thu, 07 Jul 2016 22:35:21 -0000

--94eb2c114852b2b3e30537134cb3
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thanks Ben! For the post, it seems they got a little better but similar
result than i did. Good to know it.
I am not sure if a little fine tuning of heap memory will help or not.

On Thu, Jul 7, 2016 at 2:58 PM, Ben Slater <ben.slater@instaclustr.com>
wrote:

> Hi Yuan,
>
> You might find this blog post a useful comparison:
>
> https://www.instaclustr.com/blog/2016/01/07/multi-data-center-apache-spar=
k-and-apache-cassandra-benchmark/
>
> Although the focus is on Spark and Cassandra and multi-DC there are also
> some single DC benchmarks of m4.xl clusters plus some discussion of how w=
e
> went about benchmarking.
>
> Cheers
> Ben
>
>
> On Fri, 8 Jul 2016 at 07:52 Yuan Fang <yuan@kryptoncloud.com> wrote:
>
>> Yes, here is my stress test result:
>> Results:
>> op rate                   : 12200 [WRITE:12200]
>> partition rate            : 12200 [WRITE:12200]
>> row rate                  : 12200 [WRITE:12200]
>> latency mean              : 16.4 [WRITE:16.4]
>> latency median            : 7.1 [WRITE:7.1]
>> latency 95th percentile   : 38.1 [WRITE:38.1]
>> latency 99th percentile   : 204.3 [WRITE:204.3]
>> latency 99.9th percentile : 465.9 [WRITE:465.9]
>> latency max               : 1408.4 [WRITE:1408.4]
>> Total partitions          : 1000000 [WRITE:1000000]
>> Total errors              : 0 [WRITE:0]
>> total gc count            : 0
>> total gc mb               : 0
>> total gc time (s)         : 0
>> avg gc time(ms)           : NaN
>> stdev gc time(ms)         : 0
>> Total operation time      : 00:01:21
>> END
>>
>> On Thu, Jul 7, 2016 at 2:49 PM, Ryan Svihla <rs@foundev.pro> wrote:
>>
>>> Lots of variables you're leaving out.
>>>
>>> Depends on write size, if you're using logged batch or not, what
>>> consistency level, what RF, if the writes come in bursts, etc, etc.
>>> However, that's all sort of moot for determining "normal" really you ne=
ed a
>>> baseline as all those variables end up mattering a huge amount.
>>>
>>> I would suggest using Cassandra stress as a baseline and go from there
>>> depending on what those numbers say (just pick the defaults).
>>>
>>> Sent from my iPhone
>>>
>>> On Jul 7, 2016, at 4:39 PM, Yuan Fang <yuan@kryptoncloud.com> wrote:
>>>
>>> yes, it is about 8k writes per node.
>>>
>>>
>>>
>>> On Thu, Jul 7, 2016 at 2:18 PM, daemeon reiydelle <daemeonr@gmail.com>
>>> wrote:
>>>
>>>> Are you saying 7k writes per node? or 30k writes per node?
>>>>
>>>>
>>>> *.......*
>>>>
>>>>
>>>>
>>>> *Daemeon C.M. ReiydelleUSA (+1) 415.501.0198
>>>> <%28%2B1%29%20415.501.0198>London (+44) (0) 20 8144 9872
>>>> <%28%2B44%29%20%280%29%2020%208144%209872>*
>>>>
>>>> On Thu, Jul 7, 2016 at 2:05 PM, Yuan Fang <yuan@kryptoncloud.com>
>>>> wrote:
>>>>
>>>>> writes 30k/second is the main thing.
>>>>>
>>>>>
>>>>> On Thu, Jul 7, 2016 at 1:51 PM, daemeon reiydelle <daemeonr@gmail.com=
>
>>>>> wrote:
>>>>>
>>>>>> Assuming you meant 100k, that likely for something with 16mb of
>>>>>> storage (probably way small) where the data is more that 64k hence w=
ill not
>>>>>> fit into the row cache.
>>>>>>
>>>>>>
>>>>>> *.......*
>>>>>>
>>>>>>
>>>>>>
>>>>>> *Daemeon C.M. ReiydelleUSA (+1) 415.501.0198
>>>>>> <%28%2B1%29%20415.501.0198>London (+44) (0) 20 8144 9872
>>>>>> <%28%2B44%29%20%280%29%2020%208144%209872>*
>>>>>>
>>>>>> On Thu, Jul 7, 2016 at 1:25 PM, Yuan Fang <yuan@kryptoncloud.com>
>>>>>> wrote:
>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> I have a cluster of 4 m4.xlarge nodes(4 cpus and 16 gb memory and
>>>>>>> 600GB ssd EBS).
>>>>>>> I can reach a cluster wide write requests of 30k/second and read
>>>>>>> request about 100/second. The cluster OS load constantly above 10. =
Are
>>>>>>> those normal?
>>>>>>>
>>>>>>> Thanks!
>>>>>>>
>>>>>>>
>>>>>>> Best,
>>>>>>>
>>>>>>> Yuan
>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>
>> --
> =E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94
> Ben Slater
> Chief Product Officer
> Instaclustr: Cassandra + Spark - Managed | Consulting | Support
> +61 437 929 798
>

--94eb2c114852b2b3e30537134cb3
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks Ben! For the post, it seems they got a little bette=
r but similar result than i did. Good to know it.<div>I am not sure if a li=
ttle fine tuning of heap memory will help or not.=C2=A0<br><div class=3D"gm=
ail_extra"><br><div class=3D"gmail_quote">On Thu, Jul 7, 2016 at 2:58 PM, B=
en Slater <span dir=3D"ltr">&lt;<a href=3D"mailto:ben.slater@instaclustr.co=
m" target=3D"_blank">ben.slater@instaclustr.com</a>&gt;</span> wrote:<br><b=
lockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px =
#ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Yuan,<div><br></div><div>Y=
ou might find this blog post a useful comparison:</div><div><a href=3D"http=
s://www.instaclustr.com/blog/2016/01/07/multi-data-center-apache-spark-and-=
apache-cassandra-benchmark/" target=3D"_blank">https://www.instaclustr.com/=
blog/2016/01/07/multi-data-center-apache-spark-and-apache-cassandra-benchma=
rk/</a><br></div><div><br></div><div>Although the focus is on Spark and Cas=
sandra and multi-DC there are also some single DC benchmarks of m4.xl clust=
ers plus some discussion of how we went about benchmarking.</div><div><br><=
/div><div>Cheers</div><div>Ben</div><div><br></div></div><div><div><br><div=
 class=3D"gmail_quote"><div dir=3D"ltr">On Fri, 8 Jul 2016 at 07:52 Yuan Fa=
ng &lt;<a href=3D"mailto:yuan@kryptoncloud.com" target=3D"_blank">yuan@kryp=
toncloud.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div di=
r=3D"ltr">Yes, here is my stress test result:<div><div>Results:</div><div>o=
p rate =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 : 122=
00 [WRITE:12200]</div><div>partition rate =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0: 12200 [WRITE:12200]</div><div>row rate =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0: 12200 [WRITE:12200]</div><div>laten=
cy mean =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0: 16.4 [WRITE:16.4]=
</div><div>latency median =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0: 7.1 [W=
RITE:7.1]</div><div>latency 95th percentile =C2=A0 : 38.1 [WRITE:38.1]</div=
><div>latency 99th percentile =C2=A0 : 204.3 [WRITE:204.3]</div><div>latenc=
y 99.9th percentile : 465.9 [WRITE:465.9]</div><div>latency max =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 : 1408.4 [WRITE:1408.4]</div><div>To=
tal partitions =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0: 1000000 [WRITE:1000000]<=
/div><div>Total errors =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0: 0 =
[WRITE:0]</div><div>total gc count =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0: 0</div><div>total gc mb =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 : 0</div><div>total gc time (s) =C2=A0 =C2=A0 =C2=A0 =C2=A0 : 0</div><d=
iv>avg gc time(ms) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 : NaN</div><div>stdev=
 gc time(ms) =C2=A0 =C2=A0 =C2=A0 =C2=A0 : 0</div><div>Total operation time=
 =C2=A0 =C2=A0 =C2=A0: 00:01:21</div><div>END</div></div></div><div class=
=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, Jul 7, 2016 at 2:49=
 PM, Ryan Svihla <span dir=3D"ltr">&lt;<a href=3D"mailto:rs@foundev.pro" ta=
rget=3D"_blank">rs@foundev.pro</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><div dir=3D"auto"><div>Lots of variables you&#39;re leaving o=
ut.</div><div><br></div><div>Depends on write size, if you&#39;re using log=
ged batch or not, what consistency level, what RF, if the writes come in bu=
rsts, etc, etc. However, that&#39;s all sort of moot for determining &quot;=
normal&quot; really you need a baseline as all those variables end up matte=
ring a huge amount.</div><div><br></div><div>I would suggest using Cassandr=
a stress as a baseline and go from there depending on what those numbers sa=
y (just pick the defaults).<br><br>Sent from my iPhone</div><div><div><div>=
<br>On Jul 7, 2016, at 4:39 PM, Yuan Fang &lt;<a href=3D"mailto:yuan@krypto=
ncloud.com" target=3D"_blank">yuan@kryptoncloud.com</a>&gt; wrote:<br><br><=
/div><blockquote type=3D"cite"><div><div dir=3D"ltr">yes, it is about 8k wr=
ites per node.<div><br></div><div><br></div></div><div class=3D"gmail_extra=
"><br><div class=3D"gmail_quote">On Thu, Jul 7, 2016 at 2:18 PM, daemeon re=
iydelle <span dir=3D"ltr">&lt;<a href=3D"mailto:daemeonr@gmail.com" target=
=3D"_blank">daemeonr@gmail.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-f=
amily:comic sans ms,sans-serif;color:rgb(7,55,99)">Are you saying 7k writes=
 per node? or 30k writes per node?<br></div></div><div class=3D"gmail_extra=
"><span><br clear=3D"all"><div><div data-smartmail=3D"gmail_signature"><div=
 dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><span style=3D"col=
or:rgb(56,118,29)"><span style=3D"background-color:rgb(255,255,255)"><b><sp=
an style=3D"font-family:comic sans ms,sans-serif"></span></b></span></span>=
<span style=3D"color:rgb(56,118,29)"><span style=3D"background-color:rgb(25=
5,255,255)"><b><span style=3D"font-family:comic sans ms,sans-serif"><br>...=
....</span></b></span></span><span style=3D"color:rgb(56,118,29)"><span sty=
le=3D"background-color:rgb(255,255,255)"><b><span style=3D"font-family:comi=
c sans ms,sans-serif"><br><br>Daemeon C.M. Reiydelle<br>USA <a href=3D"tel:=
%28%2B1%29%20415.501.0198" value=3D"+14155010198" target=3D"_blank">(+1) 41=
5.501.0198</a><br>London <a href=3D"tel:%28%2B44%29%20%280%29%2020%208144%2=
09872" value=3D"+442081449872" target=3D"_blank">(+44) (0) 20 8144 9872</a>=
</span></b></span></span><font size=3D"1"><i><br></i></font></div></div></d=
iv></div></div></div></div>
<br></span><div><div><div class=3D"gmail_quote">On Thu, Jul 7, 2016 at 2:05=
 PM, Yuan Fang <span dir=3D"ltr">&lt;<a href=3D"mailto:yuan@kryptoncloud.co=
m" target=3D"_blank">yuan@kryptoncloud.com</a>&gt;</span> wrote:<br><blockq=
uote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex"><div dir=3D"ltr">writes 30k/second is the main thin=
g.<div><br></div></div><div><div><div class=3D"gmail_extra"><br><div class=
=3D"gmail_quote">On Thu, Jul 7, 2016 at 1:51 PM, daemeon reiydelle <span di=
r=3D"ltr">&lt;<a href=3D"mailto:daemeonr@gmail.com" target=3D"_blank">daeme=
onr@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:comic sans ms=
,sans-serif;color:rgb(7,55,99)">Assuming you meant 100k, that likely for so=
mething with 16mb of storage (probably way small) where the data is more th=
at 64k hence will not fit into the row cache.<br></div></div><div class=3D"=
gmail_extra"><br clear=3D"all"><div><div data-smartmail=3D"gmail_signature"=
><div dir=3D"ltr"><div><div dir=3D"ltr"><div><div dir=3D"ltr"><span style=
=3D"color:rgb(56,118,29)"><span style=3D"background-color:rgb(255,255,255)"=
><b><span style=3D"font-family:comic sans ms,sans-serif"></span></b></span>=
</span><span style=3D"color:rgb(56,118,29)"><span style=3D"background-color=
:rgb(255,255,255)"><b><span style=3D"font-family:comic sans ms,sans-serif">=
<br>.......</span></b></span></span><span style=3D"color:rgb(56,118,29)"><s=
pan style=3D"background-color:rgb(255,255,255)"><b><span style=3D"font-fami=
ly:comic sans ms,sans-serif"><br><br>Daemeon C.M. Reiydelle<br>USA <a href=
=3D"tel:%28%2B1%29%20415.501.0198" value=3D"+14155010198" target=3D"_blank"=
>(+1) 415.501.0198</a><br>London <a href=3D"tel:%28%2B44%29%20%280%29%2020%=
208144%209872" value=3D"+442081449872" target=3D"_blank">(+44) (0) 20 8144 =
9872</a></span></b></span></span><font size=3D"1"><i><br></i></font></div><=
/div></div></div></div></div></div><div><div>
<br><div class=3D"gmail_quote">On Thu, Jul 7, 2016 at 1:25 PM, Yuan Fang <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:yuan@kryptoncloud.com" target=3D"_bla=
nk">yuan@kryptoncloud.com</a>&gt;</span> wrote:<br><blockquote class=3D"gma=
il_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-lef=
t:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote"><br><br><div dir=3D"ltr"=
>I have a cluster of 4 m4.xlarge nodes(4 cpus and 16 gb memory and 600GB ss=
d EBS).<div>I can reach a cluster wide write requests of 30k/second and rea=
d request about 100/second. The cluster OS load constantly above 10. Are th=
ose normal?</div><div><br></div><div>Thanks!</div><div><br></div><div><br><=
/div><div>Best,</div><div><br></div><div>Yuan=C2=A0</div></div>
</div><br></div>
</blockquote></div><br></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div>
</blockquote></div><br></div>
</div></blockquote></div></div></div></blockquote></div><br></div>
</blockquote></div></div></div><span><font color=3D"#888888"><div dir=3D"lt=
r">-- <br></div><div data-smartmail=3D"gmail_signature"><div dir=3D"ltr">=
=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94<di=
v>Ben Slater<div>Chief Product Officer</div><div>Instaclustr: Cassandra + S=
park - Managed | Consulting | Support</div><div><a href=3D"tel:%2B61%20437%=
20929%20798" value=3D"+61437929798" target=3D"_blank">+61 437 929 798</a></=
div></div></div></div>
</font></span></blockquote></div><br></div></div></div>

--94eb2c114852b2b3e30537134cb3--