Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of mightye@gmail.com designates
 209.85.218.41 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <54B8BFC3.1070203@enercast.de>
References: <54B78432.7010003@t-online.de>
 <CAORswty9CKeGF6hmCk3L94K+KuazGdTy=GMpQuDw46ZuKbuVDQ@mail.gmail.com>
 <54B8BFC3.1070203@enercast.de>
From: Eric Stevens <mightye@gmail.com>
Date: Fri, 16 Jan 2015 07:13:53 -0700
Message-ID: 
 <CAORswtz6TX6yQzrtPZvqghMKuGzYqb2u10qAcEzxK+2bXY03JQ@mail.gmail.com>
Subject: Re: Many really small SSTables
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Cc: j.kesten@enercast.de
Content-Type: multipart/alternative; boundary=047d7b33d97459aaea050cc59668

--047d7b33d97459aaea050cc59668
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

There's another thread going on right now in this list about compactions
not happening when they seemingly should.  Tyler Hobbs postulates a bug and
workaround for it, so maybe try that out, and if that fixes anything for
you, certainly let him know.  The bug Tyler postulates on is triggered when
you have a write-heavy with zero read workload, and if you're testing data
loading, maybe you're triggering that.

Also, it's probably a long shot, but make sure that your SSTable counts
haven't gone down since you last looked, if you're load testing your
cluster and throwing big bursts of writes at it, you can temporarily fall
behind on compaction (getting a large number of sstables), and if you stop
writes, you can catch back up again as a result - maybe you looked at table
counts during loading, and looked at compactionstats during a quiet time.

On Fri, Jan 16, 2015 at 12:37 AM, Jan Kesten <j.kesten@enercast.de> wrote:

>  Hi Eric and all,
>
> I almost expected this kind answer. I did a nodetool compactionstats
> already to see if those sstables are beeing compacted, but on all nodes
> there are 0 outstanding compactions (right now in the morning, not runnin=
g
> any tests on this cluster).
>
> The reported read latency is about 1-3ms and on nodes which have many
> sstables (new highscore are ~18k sstables). The 99% percentile is about
> 30-40 micros and a cell count of about 80-90 (if I got the docs right the=
se
> are the number of sstables accessed, that changed from 2.0 to 2.1 I think
> as I see this only on testing cluster).
>
> I looks to me that compactions were not triggered. I tried a nodetool
> compact on one node overnight - but that crashed the entire node.
>
> Roland
>
> Am 15.01.2015 um 19:14 schrieb Eric Stevens:
>
> Yes, many sstables can have a huge negative impact read performance, and
> will also create memory pressure on that node.
>
>  There are a lot of things which can produce this effect, and it strongly
> also suggests you're falling behind on compaction in general (check
> nodetool compactionstats, you should have <5 outstanding/pending,
> preferably 0-1).  To see whether and how much it is impacting your read
> performance, check nodetool cfstats <keyspace.table> and nodetool
> cfhistograms <keyspace> <table>.
>
>
> On Thu, Jan 15, 2015 at 2:11 AM, Roland Etzenhammer <
> r.etzenhammer@t-online.de> wrote:
>
>> Hi,
>>
>> I'm testing around with cassandra fair a bit, using 2.1.2 which I know
>> has some major issues,but it is a test environment. After some bulk
>> loading, testing with incremental repairs and running out of heap once I
>> found that now I have a quit large number of sstables which are really
>> small:
>>
>> <1k              0      0,0%
>> <10k          2780     76,8%
>> <100k         3392     93,7%
>> <1000k        3461     95,6%
>> <10000k       3471     95,9%
>> <100000k      3517     97,1%
>> <1000000k     3596     99,3%
>> all           3621    100,0%
>>
>> 76,8% of all sstables in this particular column familiy are smaller that
>> 10kB, 93.7% are smaller then 100kB.
>>
>> Just for my understanding - does that impact performance? And is there
>> any way to reduce the number of sstables? A full run of nodetool compact=
 is
>> running for a really long time (more than 1day).
>>
>> Thanks for any input,
>> Roland
>>
>
>
> --
> i.A. Jan Kesten Systemadministration enercast GmbH Friedrich - Ebert -
> Stra=C3=9Fe 104 D=E2=80=9334119 Kassel Tel.: +49 561 / 4739664-0 Fax:
> (+49)561/4739664-9 mailto: j.kesten@enercast.de http://www.enercast.de AG
> Kassel HRB 15471 Thomas Landgraf Gesch=C3=A4ftsf=C3=BChrer t.landgraf@ene=
rcast.de
> Tel.: (+49)561/4739664-0 FAX: -9 Mobil: (+49)172/6565087 enercast GmbH
> Friedrich-Ebert-Str. 104 D-34119 Kassel HRB15471 http://www.enercast.de
> Online-Prognosen f=C3=BCr erneuerbare Energien Gesch=C3=A4ftsf=C3=BChrung=
: Thomas Landgraf
> (CEO), Bernd Kratz (CTO), Philipp Rinder (CSO) Diese E-Mail und etwaige
> Anh=C3=A4nge k=C3=B6nnen vertrauliche und/oder rechtlich gesch=C3=BCtzte =
Informationen
> enthalten. Falls Sie nicht der angegebene Empf=C3=A4nger sind oder falls =
diese
> E-Mail irrt=C3=BCmlich an Sie adressiert wurde, benachrichtigen Sie uns b=
itte
> sofort durch Antwort-E-Mail und l=C3=B6schen Sie diese E-Mail nebst etwai=
gen
> Anlagen von Ihrem System. Ebenso d=C3=BCrfen Sie diese E-Mail oder ihre A=
nlagen
> nicht kopieren oder an Dritte weitergeben. Vielen Dank. This e-mail and a=
ny
> attachment may contain confidential and/or privileged information. If you
> are not the named addressee or if this transmission has been addressed to
> you in error, please notify us immediately by reply e-mail and then delet=
e
> this e-mail and any attachment from your system. Please understand that y=
ou
> must not copy this e-mail or any attachment or disclose the contents to a=
ny
> other person. Thank you for your cooperation.
>

--047d7b33d97459aaea050cc59668
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">There&#39;s another thread going on right now in this list=
 about compactions not happening when they seemingly should.=C2=A0 Tyler Ho=
bbs postulates a bug and workaround for it, so maybe try that out, and if t=
hat fixes anything for you, certainly let him know.=C2=A0 The bug Tyler pos=
tulates on is triggered when you have a write-heavy with zero read workload=
, and if you&#39;re testing data loading, maybe you&#39;re triggering that.=
<div><br></div><div>Also, it&#39;s probably a long shot, but make sure that=
 your SSTable counts haven&#39;t gone down since you last looked, if you=
9;re load testing your cluster and throwing big bursts of writes at it, you=
 can temporarily fall behind on compaction (getting a large number of sstab=
les), and if you stop writes, you can catch back up again as a result - may=
be you looked at table counts during loading, and looked at compactionstats=
 during a quiet time.</div></div><div class=3D"gmail_extra"><br><div class=
=3D"gmail_quote">On Fri, Jan 16, 2015 at 12:37 AM, Jan Kesten <span dir=3D"=
ltr">&lt;<a href=3D"mailto:j.kesten@enercast.de" target=3D"_blank">j.kesten=
@enercast.de</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    Hi Eric and all,<br>
    <br>
    I almost expected this kind answer. I did a nodetool compactionstats
    already to see if those sstables are beeing compacted, but on all
    nodes there are 0 outstanding compactions (right now in the morning,
    not running any tests on this cluster). <br>
    <br>
    The reported read latency is about 1-3ms and on nodes which have
    many sstables (new highscore are ~18k sstables). The 99% percentile
    is about 30-40 micros and a cell count of about 80-90 (if I got the
    docs right these are the number of sstables accessed, that changed
    from 2.0 to 2.1 I think as I see this only on testing cluster).<br>
    <br>
    I looks to me that compactions were not triggered. I tried a
    nodetool compact on one node overnight - but that crashed the entire
    node. <br>
    <br>
    Roland<br>
    <br>
    <div>Am 15.01.2015 um 19:14 schrieb Eric
      Stevens:<br>
    </div><div><div class=3D"h5">
    <blockquote type=3D"cite">
      <div dir=3D"ltr">Yes, many sstables can have a huge negative impact
        read performance, and will also create memory pressure on that
        node.
        <div><br>
        </div>
        <div>There are a lot of things which can produce this effect,
          and it strongly also suggests you&#39;re falling behind on
          compaction in general (check nodetool compactionstats, you
          should have &lt;5 outstanding/pending, preferably 0-1).=C2=A0 To
          see whether and how much it is impacting your read
          performance, check nodetool cfstats &lt;keyspace.table&gt; and
          nodetool cfhistograms &lt;keyspace&gt; &lt;table&gt;.</div>
        <div><br>
        </div>
      </div>
      <div class=3D"gmail_extra"><br>
        <div class=3D"gmail_quote">On Thu, Jan 15, 2015 at 2:11 AM, Roland
          Etzenhammer <span dir=3D"ltr">&lt;<a href=3D"mailto:r.etzenhammer=
@t-online.de" target=3D"_blank">r.etzenhammer@t-online.de</a>&gt;</span>
          wrote:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">Hi,<br>
            <br>
            I&#39;m testing around with cassandra fair a bit, using 2.1.2
            which I know has some major issues,but it is a test
            environment. After some bulk loading, testing with
            incremental repairs and running out of heap once I found
            that now I have a quit large number of sstables which are
            really small:<br>
            <br>
            &lt;1k=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 0=C2=A0 =
=C2=A0 =C2=A0 0,0%<br>
            &lt;10k=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 2780=C2=A0 =C2=A0 =C2=
=A076,8%<br>
            &lt;100k=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A03392=C2=A0 =C2=A0 =C2=
=A093,7%<br>
            &lt;1000k=C2=A0 =C2=A0 =C2=A0 =C2=A0 3461=C2=A0 =C2=A0 =C2=A095=
,6%<br>
            &lt;10000k=C2=A0 =C2=A0 =C2=A0 =C2=A03471=C2=A0 =C2=A0 =C2=A095=
,9%<br>
            &lt;100000k=C2=A0 =C2=A0 =C2=A0 3517=C2=A0 =C2=A0 =C2=A097,1%<b=
r>
            &lt;1000000k=C2=A0 =C2=A0 =C2=A03596=C2=A0 =C2=A0 =C2=A099,3%<b=
r>
            all=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A03621=C2=A0 =C2=A0 1=
00,0%<br>
            <br>
            76,8% of all sstables in this particular column familiy are
            smaller that 10kB, 93.7% are smaller then 100kB.<br>
            <br>
            Just for my understanding - does that impact performance?
            And is there any way to reduce the number of sstables? A
            full run of nodetool compact is running for a really long
            time (more than 1day).<br>
            <br>
            Thanks for any input,<br>
            Roland<br>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
    </div></div><span class=3D"HOEnZb"><font color=3D"#888888"><div>-- <br>
      i.A.
      Jan Kesten
      Systemadministration
      enercast GmbH
      Friedrich - Ebert - Stra=C3=9Fe 104 D=E2=80=9334119 Kassel
      Tel.: <a href=3D"tel:%2B49%20561%20%2F%204739664-0" value=3D"+4956147=
396640" target=3D"_blank">+49 561 / 4739664-0</a>
      Fax: <a href=3D"tel:%28%2B49%29561%2F4739664-9" value=3D"+49561473966=
49" target=3D"_blank">(+49)561/4739664-9</a>
      mailto: <a href=3D"mailto:j.kesten@enercast.de" target=3D"_blank">j.k=
esten@enercast.de</a>
      <a href=3D"http://www.enercast.de" target=3D"_blank">http://www.enerc=
ast.de</a>
      AG Kassel HRB 15471 Thomas Landgraf Gesch=C3=A4ftsf=C3=BChrer
      <a href=3D"mailto:t.landgraf@enercast.de" target=3D"_blank">t.landgra=
f@enercast.de</a>
      Tel.: <a href=3D"tel:%28%2B49%29561%2F4739664-0" value=3D"+4956147396=
640" target=3D"_blank">(+49)561/4739664-0</a> FAX: -9 Mobil: <a href=3D"tel=
:%28%2B49%29172%2F6565087" value=3D"+491726565087" target=3D"_blank">(+49)1=
72/6565087</a>
      enercast GmbH Friedrich-Ebert-Str. 104 D-34119 Kassel HRB15471
      <a href=3D"http://www.enercast.de" target=3D"_blank">http://www.enerc=
ast.de</a> Online-Prognosen f=C3=BCr erneuerbare Energien
      Gesch=C3=A4ftsf=C3=BChrung: Thomas Landgraf (CEO), Bernd Kratz (CTO),
      Philipp Rinder (CSO)
      Diese E-Mail und etwaige Anh=C3=A4nge k=C3=B6nnen vertrauliche und/od=
er
      rechtlich gesch=C3=BCtzte Informationen enthalten. Falls Sie nicht de=
r
      angegebene Empf=C3=A4nger sind oder falls diese E-Mail irrt=C3=BCmlic=
h an
      Sie adressiert wurde, benachrichtigen Sie uns bitte sofort durch
      Antwort-E-Mail und l=C3=B6schen Sie diese E-Mail nebst etwaigen Anlag=
en
      von Ihrem System. Ebenso d=C3=BCrfen Sie diese E-Mail oder ihre Anlag=
en
      nicht kopieren oder an Dritte weitergeben. Vielen Dank.
      This e-mail and any attachment may contain confidential and/or
      privileged information. If you are not the named addressee or if
      this transmission has been addressed to you in error, please
      notify us immediately by reply e-mail and then delete this e-mail
      and any attachment from your system. Please understand that you
      must not copy this e-mail or any attachment or disclose the
      contents to any other person. Thank you for your cooperation.
    </div>
  </font></span></div>

</blockquote></div><br></div>

--047d7b33d97459aaea050cc59668--