Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of cryptofive@gmail.com designates
 209.85.214.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <4F966B9C.2020606@gmail.com>
References: 
 <CAEAJz-pw6kkvjX6chO25_jkUFQ12BhkvCjzkzLkUV2j_LGinOQ@mail.gmail.com>
	<CABWW-d38yQTyMr8SQ24PrCcar6Royq42LM=V-JEFfh89x2tqqA@mail.gmail.com>
	<CAEAJz-qOTa-zrT=GRuN6vzKn9NrdoC0tbcAZvS7BeCWvXL_=Mw@mail.gmail.com>
	<4F966B9C.2020606@gmail.com>
Date: Tue, 24 Apr 2012 23:08:03 -0700
Message-ID: 
 <CAEAJz-pVA4APogHDu82Q3LTkmY=5+Jzcwsqjwk2Vt67BvzG5fA@mail.gmail.com>
Subject: Re: Cassandra dying when gets many deletes
From: crypto five <cryptofive@gmail.com>
To: Vitalii Tymchyshyn <tivv00@gmail.com>
Cc: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=bcaec5555120b952b404be7ab1ea

--bcaec5555120b952b404be7ab1ea
Content-Type: text/plain; charset=KOI8-U
Content-Transfer-Encoding: quoted-printable

I agree with your observations.
>From another hand I found that ColumnFamily.size() doesn't calculate object
size correctly. It doesn't count two fields Objects sizes and returns 0 if
there is no object in columns container.
I increased initial size variable value to 24 which is size of two
objects(I didn't now what's correct value), and cassandra started
calculating live ratio correctly, increasing trhouhput value and flushing
memtables.

On Tue, Apr 24, 2012 at 2:00 AM, Vitalii Tymchyshyn <tivv00@gmail.com>wrote=
:

> **
> Hello.
>
> For me " there are no dirty column families" in your message tells it's
> possibly the same problem.
> The issue is that column families that gets full row deletes only do not
> get ANY SINGLE dirty byte accounted and so can't be picked by flusher. An=
y
> ratio can't help simply because it is multiplied by 0. Check your cfstats=
.
>
> 24.04.12 09:54, crypto five =CE=C1=D0=C9=D3=C1=D7(=CC=C1):
>
> Thank you Vitalii.
>
>  Looking at the Jonathan's answer to your patch I think it's probably not
> my case. I see that LiveRatio is calculated in my case, but calculations
> look strange:
>
>  WARN [MemoryMeter:1] 2012-04-23 23:29:48,430 Memtable.java (line 181)
> setting live ratio to maximum of 64 instead of Infinity
>  INFO [MemoryMeter:1] 2012-04-23 23:29:48,432 Memtable.java (line 186)
> CFS(Keyspace=3D'lexems', ColumnFamily=3D'countersCF') liveRatio is 64.0
> (just-counted was 64.0).  calculation took 63355ms for 0 columns
>
>  Looking at the comments in the code: "If it gets higher than 64
> something is probably broken.", looks like it's probably the problem.
> Not sure how to investigate it.
>
> 2012/4/23 =F7=A6=D4=C1=CC=A6=CA =F4=C9=CD=DE=C9=DB=C9=CE <tivv00@gmail.co=
m>
>
>> See https://issues.apache.org/jira/browse/CASSANDRA-3741
>> I did post a fix there that helped me.
>>
>>
>> 2012/4/24 crypto five <cryptofive@gmail.com>
>>
>>> Hi,
>>>
>>>  I have 50 millions of rows in column family on 4G RAM box. I
>>> allocatedf 2GB to cassandra.
>>> I have program which is traversing this CF and cleaning some data there=
,
>>> it generates about 20k delete statements per second.
>>> After about of 3 millions deletions cassandra stops responding to
>>> queries: it doesn't react to CLI, nodetool etc.
>>> I see in the logs that it tries to free some memory but can't even if I
>>> wait whole day.
>>> Also I see following in  the logs:
>>>
>>>  INFO [ScheduledTasks:1] 2012-04-23 18:38:13,333 StorageService.java
>>> (line 2647) Unable to reduce heap usage since there are no dirty column
>>> families
>>>
>>>  When I am looking at memory dump I see that memory goes to
>>> ConcurrentSkipListMap(10%), HeapByteBuffer(13%), DecoratedKey(6%),
>>> int[](6%), BigInteger(8.2%), ConcurrentSkipListMap$HeadIndex(7.2%),
>>> ColumnFamily(6.5%), ThreadSafeSortedColumns(13.7%), long[](5.9%).
>>>
>>>  What can I do to make cassandra stop dying?
>>> Why it can't free the memory?
>>> Any ideas?
>>>
>>>  Thank you.
>>>
>>
>>
>>
>>   --
>> Best regards,
>>  Vitalii Tymchyshyn
>>
>
>
>

--bcaec5555120b952b404be7ab1ea
Content-Type: text/html; charset=KOI8-U
Content-Transfer-Encoding: quoted-printable

<div class=3D"gmail_extra">I agree with your observations.=9A</div><div cla=
ss=3D"gmail_extra">From another hand I found that ColumnFamily.size() doesn=
&#39;t calculate object size correctly. It doesn&#39;t count two fields Obj=
ects sizes and returns 0 if there is no object in columns container.</div>
<div class=3D"gmail_extra">I increased initial size variable value to 24 wh=
ich is size of two objects(I didn&#39;t now what&#39;s correct value), and =
cassandra started calculating live ratio correctly, increasing trhouhput va=
lue and flushing memtables.=9A</div>
<div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Apr 24, 2=
012 at 2:00 AM, Vitalii Tymchyshyn <span dir=3D"ltr">&lt;<a href=3D"mailto:=
tivv00@gmail.com" target=3D"_blank">tivv00@gmail.com</a>&gt;</span> wrote:<=
br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left=
:1px #ccc solid;padding-left:1ex">
<u></u>

 =20
   =20
 =20
  <div bgcolor=3D"#ffffff" text=3D"#000000">
    Hello.<br>
    <br>
    For me &quot; there are no dirty column families&quot; in your message =
tells
    it&#39;s possibly the same problem.<br>
    The issue is that column families that gets full row deletes only do
    not get ANY SINGLE dirty byte accounted and so can&#39;t be picked by
    flusher. Any ratio can&#39;t help simply because it is multiplied by 0.
    Check your cfstats.<br>
    <br>
    24.04.12 09:54, crypto five =CE=C1=D0=C9=D3=C1=D7(=CC=C1):
    <div><div class=3D"h5"><blockquote type=3D"cite">
      <div class=3D"gmail_extra"><span>Thank you Vitalii.=9A</span>
        <div><br>
        </div>
        <div>Looking at the Jonathan&#39;s answer to your patch I
          think it&#39;s probably not my case. I see that LiveRatio is
          calculated in my case, but calculations look strange:</div>
        <div><br>
        </div>
        <div>
          <div>WARN [MemoryMeter:1] 2012-04-23 23:29:48,430
            Memtable.java (line 181) setting live ratio to maximum of 64
            instead of Infinity</div>
          <div>=9AINFO [MemoryMeter:1] 2012-04-23 23:29:48,432
            Memtable.java (line 186) CFS(Keyspace=3D&#39;lexems&#39;,
            ColumnFamily=3D&#39;countersCF&#39;) liveRatio is 64.0 (just-co=
unted
            was 64.0). =9Acalculation took 63355ms for 0 columns</div>
          <div><br>
          </div>
          <div>Looking at the comments in the code: &quot;<span>If it gets =
higher than 64 something is
              probably broken.</span>&quot;, looks like it&#39;s probably t=
he
            problem.</div>
          <div>Not sure how to investigate it.</div>
        </div>
        <br>
        <div class=3D"gmail_quote">2012/4/23 =F7=A6=D4=C1=CC=A6=CA =F4=C9=
=CD=DE=C9=DB=C9=CE <span dir=3D"ltr">&lt;<a href=3D"mailto:tivv00@gmail.com=
" target=3D"_blank">tivv00@gmail.com</a>&gt;</span><br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8=
ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
            <div class=3D"gmail_extra">See=9A<a href=3D"https://issues.apac=
he.org/jira/browse/CASSANDRA-3741" target=3D"_blank">https://issues.apache.=
org/jira/browse/CASSANDRA-3741</a></div>
            <div class=3D"gmail_extra">I did post a fix there that helped
              me.
              <div>
                <div><br>
                  <br>
                  <div class=3D"gmail_quote">2012/4/24 crypto five <span di=
r=3D"ltr">&lt;<a href=3D"mailto:cryptofive@gmail.com" target=3D"_blank">cry=
ptofive@gmail.com</a>&gt;</span><br>
                    <blockquote class=3D"gmail_quote" style=3D"margin:0pt 0=
pt 0pt 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
                      Hi,
                      <div><br>
                      </div>
                      <div>I have 50 millions of rows in column family
                        on 4G RAM box. I allocatedf 2GB to cassandra.</div>
                      <div>I have program which is traversing this CF
                        and cleaning some data there, it generates about
                        20k delete statements per second.</div>
                      <div>After about of 3 millions deletions cassandra
                        stops responding to queries: it doesn&#39;t react t=
o
                        CLI, nodetool etc.</div>
                      <div>I see in the logs that it tries to free some
                        memory but can&#39;t even if I wait whole day.=9A</=
div>
                      <div>Also I see following in =9Athe logs:</div>
                      <div><br>
                      </div>
                      <div>INFO [ScheduledTasks:1] 2012-04-23
                        18:38:13,333 StorageService.java (line 2647)
                        Unable to reduce heap usage since there are no
                        dirty column families</div>
                      <div>
                        <br>
                      </div>
                      <div>When I am looking at memory dump I see that
                        memory goes to ConcurrentSkipListMap(10%),
                        HeapByteBuffer(13%), DecoratedKey(6%),
                        int[](6%), BigInteger(8.2%),
                        ConcurrentSkipListMap$HeadIndex(7.2%),
                        ColumnFamily(6.5%),
                        ThreadSafeSortedColumns(13.7%), long[](5.9%).</div>
                      <div><br>
                      </div>
                      <div>What can I do to make cassandra stop dying?=9A</=
div>
                      <div>Why it can&#39;t free the memory?</div>
                      <div>Any ideas?</div>
                      <div><br>
                      </div>
                      <div>Thank you.</div>
                    </blockquote>
                  </div>
                  <br>
                  <br clear=3D"all">
                  <div><br>
                  </div>
                </div>
              </div>
              <span><font color=3D"#888888">-- <br>
                  Best regards,<br>
                  =9AVitalii Tymchyshyn<br>
                </font></span></div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div>

--bcaec5555120b952b404be7ab1ea--