Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <BD145ED3-10A8-4C9C-8E33-3E7E1A97C018@gmx.net>
References: <75CABA7A-F053-4084-AF9A-101114C72614@gmx.net>
	<CAB-=z43QYH=TGcz+dpw_6Tkfhc=dV+ESeWbEDs90qbdAnMLEZw@mail.gmail.com>
	<FA5AE330-729E-4945-9DE0-F54513A27668@gmx.net>
	<CAB-=z42iHihR8SVhDrbVpfyJjYSsPCKes1ZoxKvcJe9AXKdaDA@mail.gmail.com>
	<BD145ED3-10A8-4C9C-8E33-3E7E1A97C018@gmx.net>
Date: Mon, 4 Jul 2011 13:18:48 -0400
Message-ID: 
 <CAB-=z420zVfNuJufgQ_E9obpfZAeQ-ojNWZki7Z6hWr4WVrb5A@mail.gmail.com>
Subject: Re: Cassandra memory problem
From: Sebastien Coutu <scoutu@openplaces.org>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=005045015a66788a5c04a7418f29

--005045015a66788a5c04a7418f29
Content-Type: text/plain; charset=ISO-8859-1

Hi Daniel,

Yes we do see it, since I've added the JNA libraries, it takes a bit more
time at that step and locks all the memory. We're using JNA 3.3.0 we've
downloaded from there:

https://github.com/twall/jna#readme

<https://github.com/twall/jna#readme>Our servers currently have 32GB of
memory and we've assigned 12GB of memory to the Cassandra JVM. We're seeing
the following in the logs:

 INFO [main] 2011-06-27 11:43:14,605 AbstractCassandraDaemon.java (line 97)
Heap size: 11811160064/11811160064
 INFO [main] 2011-06-27 11:43:21,272 CLibrary.java (line 106) JNA mlockall
successful
 INFO [main] 2011-06-27 11:43:21,292 DatabaseDescriptor.java (line 121)
Loading settings from
file:/home/hadoop/bin/cassandra/yul01fct/conf/cassandra.yaml
 INFO [main] 2011-06-27 11:43:21,404 DatabaseDescriptor.java (line 181)
DiskAccessMode 'auto' determined to be mmap, indexAccessMode is mmap

On the servers, we're seeing a lot of system memory assigned to cache that
is "reassigned" to used memory when the applications running on the system
really needs it. We're not seeing any swapped memory because we've tweaked
swappiness and every application running on the system. We're monitoring the
performance of that cluster with Ganglia and see the memory "movement" from
the standard graphs produced.

Regards,

SC

On Mon, Jul 4, 2011 at 12:33 PM, Daniel Doubleday
<daniel.doubleday@gmx.net>wrote:

> Hi Sebastian,
>
> one question: do you use jna.jar and do you see JNA mlockall successful in
> your logs.
> There's that wild theory here that our problem might be related to mlockall
> and no swap.
> Maybe the JVM does some realloc stuff and the pinned pages are not cleared
> ...
>
> but that's really only wild guessing.
>
> Also you are saying that on your servers res mem is not > max heap and the
> java process is not swapping?
>
> Thanks,
> Daniel
>
> On Jul 4, 2011, at 6:04 PM, Sebastien Coutu wrote:
>
> It was among one of the issues we had. One of our hosts was using OpenJDK
> and we've switched it to Sun and this part of the issue stabilized. The
> other issues we had were Heap going through the roof and then OOM under
> load.
>
>
> On Mon, Jul 4, 2011 at 11:01 AM, Daniel Doubleday <
> daniel.doubleday@gmx.net> wrote:
>
>> Just to make sure:
>> You were seeing that res mem was more than twice of max java heap and that
>> did change after you tweaked GC settings?
>>
>> Note that I am not having a heap / gc problem. The VM itself thinks
>> everything is golden.
>>
>> On Jul 4, 2011, at 3:41 PM, Sebastien Coutu wrote:
>>
>> We had an issue like that a short while ago here. This was mainly
>> happening under heavy load and we managed to stabilize it by tweaking the
>> Young/Old space ratio of the JVM and by also tweaking the tenuring
>> thresholds/survivor ratios. What kind of load to you have on your systems?
>> Mostly reads, writes?
>>
>> SC
>>
>> On Mon, Jul 4, 2011 at 6:52 AM, Daniel Doubleday <
>> daniel.doubleday@gmx.net> wrote:
>>
>>> Hi all,
>>>
>>> we have a mem problem with cassandra. res goes up without bounds (well
>>> until the os kills the process because we dont have swap)
>>>
>>> I found a thread that's about the same problem but on OpenJDK:
>>>
>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-td5840777.html
>>>
>>> We are on Debian with Sun JDK.
>>>
>>> Resident mem is 7.4G while heap is restricted to 3G.
>>>
>>> Anyone else is seeing this with Sun JDK?
>>>
>>> Cheers,
>>> Daniel
>>>
>>> :/home/dd# java -version
>>> java version "1.6.0_24"
>>> Java(TM) SE Runtime Environment (build 1.6.0_24-b07)
>>> Java HotSpot(TM) 64-Bit Server VM (build 19.1-b02, mixed mode)
>>>
>>> :/home/dd# ps aux |grep java
>>> cass     28201  9.5 46.8 372659544 7707172 ?   SLl  May24 5656:21
>>> /usr/bin/java -ea -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42
>>> -Xms3000M -Xmx3000M -Xmn400M ...
>>>
>>>   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
>>>
>>>
>>> 28201 cass      20   0  355g 7.4g 1.4g S    8 46.9   5656:25 java
>>>
>>>
>>>
>>>
>>
>>
>
>

--005045015a66788a5c04a7418f29
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi Daniel,<div><br></div><div>Yes we do see it, since I&#39;ve added the JN=
A libraries, it takes a bit more time at that step and locks all the memory=
. We&#39;re using JNA 3.3.0 we&#39;ve downloaded from there:</div><div><br>
</div><div><meta http-equiv=3D"content-type" content=3D"text/html; charset=
=3Dutf-8"><a href=3D"https://github.com/twall/jna#readme">https://github.co=
m/twall/jna#readme</a></div><div><br></div><div><a href=3D"https://github.c=
om/twall/jna#readme"></a>Our servers currently have 32GB of memory and we&#=
39;ve assigned 12GB of memory to the Cassandra JVM. We&#39;re seeing the fo=
llowing in the logs:</div>
<div><br></div><div><div>=A0INFO [main] 2011-06-27 11:43:14,605 AbstractCas=
sandraDaemon.java (line 97) Heap size: 11811160064/11811160064</div><div>=
=A0INFO [main] 2011-06-27 11:43:21,272 CLibrary.java (line 106) JNA mlockal=
l successful</div>
<div>=A0INFO [main] 2011-06-27 11:43:21,292 DatabaseDescriptor.java (line 1=
21) Loading settings from file:/home/hadoop/bin/cassandra/yul01fct/conf/cas=
sandra.yaml</div><div>=A0INFO [main] 2011-06-27 11:43:21,404 DatabaseDescri=
ptor.java (line 181) DiskAccessMode &#39;auto&#39; determined to be mmap, i=
ndexAccessMode is mmap</div>
<div><br></div><div>On the servers, we&#39;re seeing a lot of system memory=
 assigned to cache that is &quot;reassigned&quot; to used memory when the a=
pplications running on the system really needs it. We&#39;re not seeing any=
 swapped memory because we&#39;ve tweaked swappiness and every application =
running on the system. We&#39;re monitoring the performance of that cluster=
 with Ganglia and see the memory &quot;movement&quot; from the standard gra=
phs produced.</div>
<div><br></div><div>Regards,</div><div><br></div><div>SC</div><br><div clas=
s=3D"gmail_quote">On Mon, Jul 4, 2011 at 12:33 PM, Daniel Doubleday <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:daniel.doubleday@gmx.net">daniel.doubleday=
@gmx.net</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;"><div style=3D"word-wrap:break-word"><div>Hi=
 Sebastian,</div><div><br></div>one question: do you use jna.jar and do you=
 see=A0JNA mlockall successful in your logs.<div>
There&#39;s that wild theory here that our problem might be related to mloc=
kall and no swap.=A0</div><div>Maybe the JVM does some realloc stuff and th=
e pinned pages are not cleared ...=A0</div><div><br></div><div>but that&#39=
;s really only wild guessing.</div>
<div><br></div><div>Also you are saying that on your servers res mem is not=
 &gt; max heap and the java process is not swapping?</div><div><br></div><d=
iv>Thanks,</div><div>Daniel</div><div><div><font color=3D"#888888"><br></fo=
nt><div>
<div class=3D"im"><div>On Jul 4, 2011, at 6:04 PM, Sebastien Coutu wrote:</=
div><br></div><div><div></div><div class=3D"h5"><blockquote type=3D"cite">I=
t was among one of the issues we had. One of our hosts was using OpenJDK an=
d we&#39;ve switched it to Sun and this part of the issue stabilized. The o=
ther issues we had were Heap going through the roof and then OOM under load=
.<div>

<br><br><div class=3D"gmail_quote">On Mon, Jul 4, 2011 at 11:01 AM, Daniel =
Doubleday <span dir=3D"ltr">&lt;<a href=3D"mailto:daniel.doubleday@gmx.net"=
 target=3D"_blank">daniel.doubleday@gmx.net</a>&gt;</span> wrote:<br><block=
quote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc=
 solid;padding-left:1ex">

<div style=3D"word-wrap:break-word">Just to make sure:=A0<div>You were seei=
ng that res mem was more than twice of max java heap and that did change af=
ter you tweaked GC settings?<div><br></div><div>Note that I am not having a=
 heap / gc problem. The VM itself thinks everything is golden.</div>

<div><div></div><div><div><br><div><div>On Jul 4, 2011, at 3:41 PM, Sebasti=
en Coutu wrote:</div><br><blockquote type=3D"cite">We had an issue like tha=
t a short while ago here. This was mainly happening under heavy load and we=
 managed to stabilize it by tweaking the Young/Old space ratio of the JVM a=
nd by also tweaking the tenuring thresholds/survivor ratios. What kind of l=
oad to you have on your systems? Mostly reads, writes?<div>


<br></div><div>SC<br><div><br><div class=3D"gmail_quote">On Mon, Jul 4, 201=
1 at 6:52 AM, Daniel Doubleday <span dir=3D"ltr">&lt;<a href=3D"mailto:dani=
el.doubleday@gmx.net" target=3D"_blank">daniel.doubleday@gmx.net</a>&gt;</s=
pan> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div style=3D"word-wrap:break-word">Hi all,<div><br></div><div>we have a me=
m problem with cassandra. res goes up without bounds (well until the os kil=
ls the process because we dont have swap)</div><div><br></div><div>I found =
a thread that&#39;s about the same problem but on OpenJDK:=A0</div>


<div><a href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabbl=
e.com/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-td5840777=
.html" target=3D"_blank">http://cassandra-user-incubator-apache-org.3065146=
.n2.nabble.com/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-=
td5840777.html</a></div>


<div><br></div><div>We are on Debian with Sun JDK.</div><div><br></div><div=
>Resident mem is 7.4G while heap is restricted to 3G.</div><div><br></div><=
div>Anyone else is seeing this with Sun JDK?</div><div><br></div><div>

Cheers,</div>
<div>Daniel</div><div><br></div><div><div>:/home/dd# java -version</div><di=
v>java version &quot;1.6.0_24&quot;</div><div>Java(TM) SE Runtime Environme=
nt (build 1.6.0_24-b07)</div><div>Java HotSpot(TM) 64-Bit Server VM (build =
19.1-b02, mixed mode)</div>


</div><div><br></div><div>:/home/dd# ps aux |grep java</div><div>cass =A0 =
=A0 28201 =A09.5 46.8 372659544 7707172 ? =A0 SLl =A0May24 5656:21 /usr/bin=
/java -ea -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=3D42 -Xms3000M =
-Xmx3000M -Xmn400M ...</div>


<div><br></div><div><div>=A0=A0PID USER =A0 =A0 =A0PR =A0NI =A0VIRT =A0RES =
=A0SHR S %CPU %MEM =A0 =A0TIME+ =A0COMMAND =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=
 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=
 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=A0</div>


<div>28201 cass =A0 =A0 =A020 =A0 0 =A0355g 7.4g 1.4g S =A0 =A08 46.9 =A0 5=
656:25 java</div></div><div><br></div><div><br></div><div><br></div></div><=
/blockquote></div><br></div></div>
</blockquote></div><br></div></div></div></div></div></blockquote></div><br=
></div>
</blockquote></div></div></div><br></div></div></div></blockquote></div><br=
></div>

--005045015a66788a5c04a7418f29--