Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of ynerella999@gmail.com
 designates 209.85.160.46 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CANF7QJS1w4=XYfVDooXpDsvTkDfocUUw8XcUheSUr2fKxKCMdw@mail.gmail.com>
References: 
 <CANF7QJRm4jqJH-T9r9OudU94SwFGHQFRLo6Pqsw9wOE5vpztUw@mail.gmail.com>
	<CAKmMYa-PWAD3ycX-pK1bEPkv7mNsce06Lia6pWK_kZkVuxmHQg@mail.gmail.com>
	<CANF7QJRTxaEpU941TifJJc2RjgZmvPWHR1V5NewymQ2DU_kdEQ@mail.gmail.com>
	<CAOZF2BcNsjVV=qmwLk6UpCxS_oJYwNjF2P_KCDgkCi7wUMMhog@mail.gmail.com>
	<CANF7QJS1w4=XYfVDooXpDsvTkDfocUUw8XcUheSUr2fKxKCMdw@mail.gmail.com>
Date: Wed, 19 Feb 2014 10:02:15 -0800
Message-ID: 
 <CAOZF2BftVmTP7UrFFTCeLW9GQMwYCD1ckU24a0y9xyGrhkY3ew@mail.gmail.com>
Subject: Re: High CPU load on one node in the cluster
From: Yogi Nerella <ynerella999@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11332b6c569ef004f2c630a6

--001a11332b6c569ef004f2c630a6
Content-Type: text/plain; charset=ISO-8859-1

You should start your Cassandra daemon with -verbose:gc (please check
syntax) and then run it in foreground, as Cassandra closes the standard out)
Please see other emails in this forum for getting Garbage Collection
Statistics from Cassandra user mail, or look at any Java specific sites.

Ex:
http://stackoverflow.com/questions/1161647/how-to-redirect-verbose-garbage-collection-output-to-a-file


It depends on what JVM you are running.


On Wed, Feb 19, 2014 at 9:12 AM, Sourabh Agrawal <iitr.sourabh@gmail.com>wrote:

> How do I get that statistic?
>
>
> On Wed, Feb 19, 2014 at 10:34 PM, Yogi Nerella <ynerella999@gmail.com>wrote:
>
>> Could be your -Xmn800M is too low, that is why it is trying garbage
>> collecting very frequently.
>> Do you have any statistics on how much memory it is collecting on every
>> cycle?
>>
>>
>>
>> On Wed, Feb 19, 2014 at 8:47 AM, Sourabh Agrawal <iitr.sourabh@gmail.com>wrote:
>>
>>> Below is CPU usage from top. I don't see any steal. Idle time is pretty
>>> low.
>>>
>>> Cpu(s): 83.3%us, 14.5%sy,  0.0%ni,  0.5%id,  0.0%wa,  0.0%hi,  1.7%si,
>>>  0.0%st
>>>
>>> Any other pointers?
>>>
>>>
>>> On Wed, Feb 19, 2014 at 8:34 PM, Nate McCall <nate@thelastpickle.com>wrote:
>>>
>>>> You may be seeing steal from another tenant on the VM. This article has
>>>> a good explanation:
>>>>
>>>> http://blog.scoutapp.com/articles/2013/07/25/understanding-cpu-steal-time-when-should-you-be-worried
>>>>
>>>> In short, kill the instance and launch a new one. Depending on your
>>>> latency requirements and operational ability to respond, you may want to
>>>> consider paying for dedicated instances.
>>>>
>>>>
>>>> On Wed, Feb 19, 2014 at 2:30 AM, Sourabh Agrawal <
>>>> iitr.sourabh@gmail.com> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> I am running cassandra 2.0.3 cluster on 4 AWS nodes. memory arguments
>>>>> are the following for each node :
>>>>> -Xms8G -Xmx8G -Xmn800M
>>>>>
>>>>> I am experiencing consistent high loads on one of the nodes. Each node
>>>>> is getting approximately equal number of writes. I tried to have a look at
>>>>> the logs and seems like CMS GC is running every 1-2 seconds.
>>>>>
>>>>> Any pointers on how to debug this?
>>>>>
>>>>> --
>>>>> Sourabh Agrawal
>>>>> Bangalore
>>>>> +91 9945657973
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> -----------------
>>>> Nate McCall
>>>> Austin, TX
>>>> @zznate
>>>>
>>>> Co-Founder & Sr. Technical Consultant
>>>> Apache Cassandra Consulting
>>>> http://www.thelastpickle.com
>>>>
>>>
>>>
>>>
>>> --
>>> Sourabh Agrawal
>>> Bangalore
>>> +91 9945657973
>>>
>>
>>
>
>
> --
> Sourabh Agrawal
> Bangalore
> +91 9945657973
>

--001a11332b6c569ef004f2c630a6
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">You should start your Cassandra daemon with -verbose:gc (p=
lease check syntax) and then run it in foreground, as Cassandra closes the =
standard out)<div>Please see other emails in this forum for getting Garbage=
 Collection Statistics from Cassandra user mail, or look at any Java specif=
ic sites.</div>
<div><br></div><div>Ex: =A0<a href=3D"http://stackoverflow.com/questions/11=
61647/how-to-redirect-verbose-garbage-collection-output-to-a-file">http://s=
tackoverflow.com/questions/1161647/how-to-redirect-verbose-garbage-collecti=
on-output-to-a-file</a><br>
<div><br></div><div><br></div><div>It depends on what JVM you are running.<=
/div><div><div><br></div></div></div></div><div class=3D"gmail_extra"><br><=
br><div class=3D"gmail_quote">On Wed, Feb 19, 2014 at 9:12 AM, Sourabh Agra=
wal <span dir=3D"ltr">&lt;<a href=3D"mailto:iitr.sourabh@gmail.com" target=
=3D"_blank">iitr.sourabh@gmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">How do I get that statistic=
?</div><div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><=
br><br><div class=3D"gmail_quote">
On Wed, Feb 19, 2014 at 10:34 PM, Yogi Nerella <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:ynerella999@gmail.com" target=3D"_blank">ynerella999@gmail.com<=
/a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Could be your -Xmn800M is t=
oo low, that is why it is trying garbage collecting very frequently. =A0=A0=
<div>
Do you have any statistics on how much memory it is collecting on every cyc=
le? =A0=A0</div>
<div><br></div></div><div><div>
<div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Feb 1=
9, 2014 at 8:47 AM, Sourabh Agrawal <span dir=3D"ltr">&lt;<a href=3D"mailto=
:iitr.sourabh@gmail.com" target=3D"_blank">iitr.sourabh@gmail.com</a>&gt;</=
span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Below is CPU usage fro=
m top. I don&#39;t see any steal. Idle time is pretty low.</div><div><br></=
div>


Cpu(s): 83.3%us, 14.5%sy, =A00.0%ni, =A00.5%id, =A00.0%wa, =A00.0%hi, =A01.=
7%si, =A00.0%st<br><div><br></div><div>
Any other pointers?</div></div><div><div><div class=3D"gmail_extra"><br><br=
><div class=3D"gmail_quote">On Wed, Feb 19, 2014 at 8:34 PM, Nate McCall <s=
pan dir=3D"ltr">&lt;<a href=3D"mailto:nate@thelastpickle.com" target=3D"_bl=
ank">nate@thelastpickle.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">You may be seeing steal fro=
m another tenant on the VM. This article has a good explanation:<div><a hre=
f=3D"http://blog.scoutapp.com/articles/2013/07/25/understanding-cpu-steal-t=
ime-when-should-you-be-worried" target=3D"_blank">http://blog.scoutapp.com/=
articles/2013/07/25/understanding-cpu-steal-time-when-should-you-be-worried=
</a><br>


</div><div><br></div><div>In short, kill the instance and launch a new one.=
 Depending on your latency requirements and operational ability to respond,=
 you may want to consider paying for dedicated instances.</div></div><div c=
lass=3D"gmail_extra">


<div><div>
<br><br><div class=3D"gmail_quote">On Wed, Feb 19, 2014 at 2:30 AM, Sourabh=
 Agrawal <span dir=3D"ltr">&lt;<a href=3D"mailto:iitr.sourabh@gmail.com" ta=
rget=3D"_blank">iitr.sourabh@gmail.com</a>&gt;</span> wrote:<br><blockquote=
 class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc soli=
d;padding-left:1ex">


<div dir=3D"ltr">Hi,<div><br></div><div>I am running cassandra 2.0.3 cluste=
r on 4 AWS nodes. memory arguments are the following for each node :=A0</di=
v><div>-Xms8G -Xmx8G -Xmn800M<br></div><div><br></div><div>I am experiencin=
g consistent high loads on one of the nodes. Each node is getting approxima=
tely equal number of writes. I tried to have a look at the logs and seems l=
ike CMS GC is running every 1-2 seconds.</div>


<div><br></div><div>Any pointers on how to debug this?</div><span><font col=
or=3D"#888888"><div><div><br></div>-- <br><div dir=3D"ltr">Sourabh Agrawal<=
div>Bangalore</div><div><a href=3D"tel:%2B91%209945657973" value=3D"+919945=
657973" target=3D"_blank">+91 9945657973</a></div>


</div>
</div></font></span></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div></div><span><=
font color=3D"#888888">-- <br><div dir=3D"ltr">-----------------<br>Nate Mc=
Call<br>Austin, TX<br>@zznate<br><br>Co-Founder &amp; Sr. Technical Consult=
ant<br>


Apache Cassandra Consulting<br><a href=3D"http://www.thelastpickle.com" tar=
get=3D"_blank">http://www.thelastpickle.com</a></div>

</font></span></div>
</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><div dir=3D"=
ltr">Sourabh Agrawal<div>Bangalore</div><div><a href=3D"tel:%2B91%209945657=
973" value=3D"+919945657973" target=3D"_blank">+91 9945657973</a></div></di=
v>
</div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div dir=3D"ltr">Sourabh Agrawal<div>Bangalore</div><div><a href=3D"tel:%2B=
91%209945657973" value=3D"+919945657973" target=3D"_blank">+91 9945657973</=
a></div>
</div>
</div>
</div></div></blockquote></div><br></div>

--001a11332b6c569ef004f2c630a6--