Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of sanjeev@locomatix.com
 designates 209.85.216.179 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <BANLkTiknqSDQ8M0pB7PBjSBNWSqivXKLxA@mail.gmail.com>
References: <564509F2-34C9-428C-901F-1A798803745A@serialssolutions.com>
	<BANLkTik+kytJDi0CCkir=D=Uio9kEmHVsg@mail.gmail.com>
	<2CDF4665-8640-4050-87BD-0815E11FD23E@serialssolutions.com>
	<BANLkTiknqSDQ8M0pB7PBjSBNWSqivXKLxA@mail.gmail.com>
Date: Mon, 9 May 2011 14:44:07 -0700
Message-ID: <BANLkTikHmMxFqfLcXQpvCOAqwa=Tb7Upig@mail.gmail.com>
Subject: Re: Memory Usage During Read
From: Sanjeev Kulkarni <sanjeev@locomatix.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001485eba2d82ebc5c04a2debd7d

--001485eba2d82ebc5c04a2debd7d
Content-Type: text/plain; charset=ISO-8859-1

Hi Adam,
We have been facing some similar issues of late. Wondering if Jonathan's
suggestions worked for you.
Thanks!

On Sat, May 7, 2011 at 6:37 PM, Jonathan Ellis <jbellis@gmail.com> wrote:

> The live:serialized size ratio depends on what your data looks like
> (small columns will be less efficient than large blobs) but using the
> rule of thumb of 10x, around 1G * (1 + memtable_flush_writers +
> memtable_flush_queue_size).
>
> So first thing I would do is drop writers and queue to 1 and 1.
>
> Then I would drop the max heap to 1G, memtable size to 8MB so the heap
> dump is easier to analyze. Then let it OOM and look at the dump with
> http://www.eclipse.org/mat/
>
> On Sat, May 7, 2011 at 3:54 PM, Serediuk, Adam
> <Adam.Serediuk@serialssolutions.com> wrote:
> > How much memory should a single hot cf with a 128mb memtable take with
> row and key caching disabled during read?
> >
> > Because I'm seeing heap go from 3.5gb skyrocketing straight to max
> (regardless of the size, 8gb and 24gb both do the same) at which time the
> jvm will do nothing but full gc and is unable to reclaim any meaningful
> amount of memory. Cassandra then becomes unusable.
> >
> > I see the same behavior with smaller memtables, eg 64mb.
> >
> > This happens well into the read operation an only on a small number of
> nodes in the cluster(1-4 out of a total of 60 nodes.)
> >
> > Sent from my iPhone
> >
> > On May 6, 2011, at 22:45, "Jonathan Ellis" <jbellis@gmail.com> wrote:
> >
> >> You don't GC storm without legitimately having a too-full heap.  It's
> >> normal to see occasional full GCs from fragmentation, but that will
> >> actually compact the heap and everything goes back to normal IF you
> >> had space actually freed up.
> >>
> >> You say you've played w/ memtable size but that would still be my bet.
> >> Most people severely underestimate how much space this takes (10x in
> >> memory over serialized size), which will bite you when you have lots
> >> of CFs defined.
> >>
> >> Otherwise, force a heap dump after a full GC and take a look to see
> >> what's referencing all the memory.
> >>
> >> On Fri, May 6, 2011 at 12:25 PM, Serediuk, Adam
> >> <Adam.Serediuk@serialssolutions.com> wrote:
> >>> We're troubleshooting a memory usage problem during batch reads. We've
> spent the last few days profiling and trying different GC settings. The
> symptoms are that after a certain amount of time during reads one or more
> nodes in the cluster will exhibit extreme memory pressure followed by a gc
> storm. We've tried every possible JVM setting and different GC methods and
> the issue persists. This is pointing towards something instantiating a lot
> of objects and keeping references so that they can't be cleaned up.
> >>>
> >>> Typically nothing is ever logged other than the GC failures however
> just now one of the nodes emitted logs we've never seen before:
> >>>
> >>>  INFO [ScheduledTasks:1] 2011-05-06 15:04:55,085 StorageService.java
> (line 2218) Unable to reduce heap usage since there are no dirty column
> families
> >>>
> >>> We have tried increasing the heap on these nodes to large values, eg
> 24GB and still run into the same issue. We're running 8GB of heap normally
> and only one or two nodes will ever exhibit this issue, randomly. We don't
> use key/row caching and our memtable sizing is 64mb/0.3. Larger or smaller
> memtables make no difference in avoiding the issue. We're on 0.7.5, mmap,
> jna and jdk 1.6.0_24
> >>>
> >>> We've somewhat hit the wall in troubleshooting and any advice is
> greatly appreciated.
> >>>
> >>> --
> >>> Adam
> >>>
> >>
> >>
> >>
> >> --
> >> Jonathan Ellis
> >> Project Chair, Apache Cassandra
> >> co-founder of DataStax, the source for professional Cassandra support
> >> http://www.datastax.com
> >>
> >
> >
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of DataStax, the source for professional Cassandra support
> http://www.datastax.com
>

--001485eba2d82ebc5c04a2debd7d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Hi Adam,<div>We have been facing some similar issues of late. Wondering if =
Jonathan&#39;s suggestions worked for you.</div><div>Thanks!<br><br><div cl=
ass=3D"gmail_quote">On Sat, May 7, 2011 at 6:37 PM, Jonathan Ellis <span di=
r=3D"ltr">&lt;<a href=3D"mailto:jbellis@gmail.com">jbellis@gmail.com</a>&gt=
;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;">The live:serialized size ratio depends on w=
hat your data looks like<br>
(small columns will be less efficient than large blobs) but using the<br>
rule of thumb of 10x, around 1G * (1 + memtable_flush_writers +<br>
memtable_flush_queue_size).<br>
<br>
So first thing I would do is drop writers and queue to 1 and 1.<br>
<br>
Then I would drop the max heap to 1G, memtable size to 8MB so the heap<br>
dump is easier to analyze. Then let it OOM and look at the dump with<br>
<a href=3D"http://www.eclipse.org/mat/" target=3D"_blank">http://www.eclips=
e.org/mat/</a><br>
<br>
On Sat, May 7, 2011 at 3:54 PM, Serediuk, Adam<br>
<div><div></div><div class=3D"h5">&lt;<a href=3D"mailto:Adam.Serediuk@seria=
lssolutions.com">Adam.Serediuk@serialssolutions.com</a>&gt; wrote:<br>
&gt; How much memory should a single hot cf with a 128mb memtable take with=
 row and key caching disabled during read?<br>
&gt;<br>
&gt; Because I&#39;m seeing heap go from 3.5gb skyrocketing straight to max=
 (regardless of the size, 8gb and 24gb both do the same) at which time the =
jvm will do nothing but full gc and is unable to reclaim any meaningful amo=
unt of memory. Cassandra then becomes unusable.<br>

&gt;<br>
&gt; I see the same behavior with smaller memtables, eg 64mb.<br>
&gt;<br>
&gt; This happens well into the read operation an only on a small number of=
 nodes in the cluster(1-4 out of a total of 60 nodes.)<br>
&gt;<br>
&gt; Sent from my iPhone<br>
&gt;<br>
&gt; On May 6, 2011, at 22:45, &quot;Jonathan Ellis&quot; &lt;<a href=3D"ma=
ilto:jbellis@gmail.com">jbellis@gmail.com</a>&gt; wrote:<br>
&gt;<br>
&gt;&gt; You don&#39;t GC storm without legitimately having a too-full heap=
. =A0It&#39;s<br>
&gt;&gt; normal to see occasional full GCs from fragmentation, but that wil=
l<br>
&gt;&gt; actually compact the heap and everything goes back to normal IF yo=
u<br>
&gt;&gt; had space actually freed up.<br>
&gt;&gt;<br>
&gt;&gt; You say you&#39;ve played w/ memtable size but that would still be=
 my bet.<br>
&gt;&gt; Most people severely underestimate how much space this takes (10x =
in<br>
&gt;&gt; memory over serialized size), which will bite you when you have lo=
ts<br>
&gt;&gt; of CFs defined.<br>
&gt;&gt;<br>
&gt;&gt; Otherwise, force a heap dump after a full GC and take a look to se=
e<br>
&gt;&gt; what&#39;s referencing all the memory.<br>
&gt;&gt;<br>
&gt;&gt; On Fri, May 6, 2011 at 12:25 PM, Serediuk, Adam<br>
&gt;&gt; &lt;<a href=3D"mailto:Adam.Serediuk@serialssolutions.com">Adam.Ser=
ediuk@serialssolutions.com</a>&gt; wrote:<br>
&gt;&gt;&gt; We&#39;re troubleshooting a memory usage problem during batch =
reads. We&#39;ve spent the last few days profiling and trying different GC =
settings. The symptoms are that after a certain amount of time during reads=
 one or more nodes in the cluster will exhibit extreme memory pressure foll=
owed by a gc storm. We&#39;ve tried every possible JVM setting and differen=
t GC methods and the issue persists. This is pointing towards something ins=
tantiating a lot of objects and keeping references so that they can&#39;t b=
e cleaned up.<br>

&gt;&gt;&gt;<br>
&gt;&gt;&gt; Typically nothing is ever logged other than the GC failures ho=
wever just now one of the nodes emitted logs we&#39;ve never seen before:<b=
r>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; =A0INFO [ScheduledTasks:1] 2011-05-06 15:04:55,085 StorageServ=
ice.java (line 2218) Unable to reduce heap usage since there are no dirty c=
olumn families<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; We have tried increasing the heap on these nodes to large valu=
es, eg 24GB and still run into the same issue. We&#39;re running 8GB of hea=
p normally and only one or two nodes will ever exhibit this issue, randomly=
. We don&#39;t use key/row caching and our memtable sizing is 64mb/0.3. Lar=
ger or smaller memtables make no difference in avoiding the issue. We&#39;r=
e on 0.7.5, mmap, jna and jdk 1.6.0_24<br>

&gt;&gt;&gt;<br>
&gt;&gt;&gt; We&#39;ve somewhat hit the wall in troubleshooting and any adv=
ice is greatly appreciated.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --<br>
&gt;&gt;&gt; Adam<br>
&gt;&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Jonathan Ellis<br>
&gt;&gt; Project Chair, Apache Cassandra<br>
&gt;&gt; co-founder of DataStax, the source for professional Cassandra supp=
ort<br>
&gt;&gt; <a href=3D"http://www.datastax.com" target=3D"_blank">http://www.d=
atastax.com</a><br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
--<br>
Jonathan Ellis<br>
Project Chair, Apache Cassandra<br>
co-founder of DataStax, the source for professional Cassandra support<br>
<a href=3D"http://www.datastax.com" target=3D"_blank">http://www.datastax.c=
om</a><br>
</div></div></blockquote></div><br></div>

--001485eba2d82ebc5c04a2debd7d--