Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: error (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <5213E718.7030203@gmail.com>
References: <5212A108.9050804@gmail.com>
	<CAKmMYa_tgGc=BWtO0cyBphcnUFaocPd-dGxvndENrCCEtRLjTQ@mail.gmail.com>
	<5212D106.4050700@gmail.com>
	<CA+BDQ7xudiieQDgs5dith23=X-1JO0ETBc6ZHniu4=7eh+cjjQ@mail.gmail.com>
	<5213757D.8030106@gmail.com>
	<CAKmMYa-WvRghtd=L0NsK4yUf7hd6Nbrfq6tYgDPO02AeyzQfWA@mail.gmail.com>
	<5213E718.7030203@gmail.com>
Date: Tue, 20 Aug 2013 18:06:08 -0500
Message-ID: 
 <CAKmMYa94E6d2+qFyPfWe81m82J=E8UEtDJ8o-cTYf34-HFgNnw@mail.gmail.com>
Subject: Re: insert performance (1.2.8)
From: Nate McCall <nate@thelastpickle.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b2e4d7c23b40204e4691a53

--047d7b2e4d7c23b40204e4691a53
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Ugh - sorry, I knew Sylvain and Micha=EBl had worked on this recently but i=
t
is only in 2.0 - I could have sworn it got marked for inclusion back into
1.2 but I was wrong:
https://issues.apache.org/jira/browse/CASSANDRA-4693

This is indeed an issue if you don't know the column count before hand (or
had a very large number of them like in your case). Again, apologies, I
would not have recommended that route if I knew it was only in 2.0.

I would be willing to bet you could hit those insert numbers pretty easily
with thrift given the shape of your mutation.


On Tue, Aug 20, 2013 at 5:00 PM, Keith Freeman <8forty@gmail.com> wrote:

>  So I tried inserting prepared statements separately (no batch), and my
> server nodes load definitely dropped significantly.  Throughput from my
> client improved a bit, but only a few %.  I was able to *almost* get 5000
> rows/sec (sort of) by also reducing the rows/insert-thread to 20-50 and
> eliminating all overhead from the timing, i.e. timing only the tight for
> loop of inserts.  But that's still a lot slower than I expected.
>
> I couldn't do batches because the driver doesn't allow prepared statement=
s
> in a batch (QueryBuilder API).  It appears the batch itself could possibl=
y
> be a prepared statement, but since I have 40+ columns on each insert that
> would take some ugly code to build so I haven't tried it yet.
>
> I'm using CL "ONE" on the inserts and RF 2 in my schema.
>
>
> On 08/20/2013 08:04 AM, Nate McCall wrote:
>
> John makes a good point re:prepared statements (I'd increase batch sizes
> again once you did this as well - separate, incremental runs of course so
> you can gauge the effect of each). That should take out some of the
> processing overhead of statement validation in the server (some - that lo=
ad
> spike still seems high though).
>
>  I'd actually be really interested as to what your results were after
> doing so - i've not tried any A/B testing here for prepared statements on
> inserts.
>
>  Given your load is on the server, i'm not sure adding more async
> indirection on the client would buy you too much though.
>
>  Also, at what RF and consistency level are you writing?
>
>
> On Tue, Aug 20, 2013 at 8:56 AM, Keith Freeman <8forty@gmail.com> wrote:
>
>>  Ok, I'll try prepared statements.   But while sending my statements
>> async might speed up my client, it wouldn't improve throughput on the
>> cassandra nodes would it?  They're running at pretty high loads and only
>> about 10% idle, so my concern is that they can't handle the data any
>> faster, so something's wrong on the server side.  I don't really think
>> there's anything on the client side that matters for this problem.
>>
>> Of course I know there are obvious h/w things I can do to improve server
>> performance: SSDs, more RAM, more cores, etc.  But I thought the servers=
 I
>> have would be able to handle more rows/sec than say Mysql, since write
>> speed is supposed to be one of Cassandra's strengths.
>>
>>
>> On 08/19/2013 09:03 PM, John Sanda wrote:
>>
>> I'd suggest using prepared statements that you initialize at application
>> start up and switching to use Session.executeAsync coupled with Google
>> Guava Futures API to get better throughput on the client side.
>>
>>
>> On Mon, Aug 19, 2013 at 10:14 PM, Keith Freeman <8forty@gmail.com> wrote=
:
>>
>>>  Sure, I've tried different numbers for batches and threads, but
>>> generally I'm running 10-30 threads at a time on the client, each sendi=
ng a
>>> batch of 100 insert statements in every call, using the
>>> QueryBuilder.batch() API from the latest datastax java driver, then cal=
ling
>>> the Session.execute() function (synchronous) on the Batch.
>>>
>>> I can't post my code, but my client does this on each iteration:
>>> -- divides up the set of inserts by the number of threads
>>> -- stores the current time
>>> -- tells all the threads to send their inserts
>>> -- then when they've all returned checks the elapsed time
>>>
>>> At about 2000 rows for each iteration, 20 threads with 100 inserts each
>>> finish in about 1 second.  For 4000 rows, 40 threads with 100 inserts e=
ach
>>> finish in about 1.5 - 2 seconds, and as I said all 3 cassandra nodes ha=
ve a
>>> heavy CPU load while the client is hardly loaded.  I've tried with 10
>>> threads and more inserts per batch, or up to 60 threads with fewer, doe=
sn't
>>> seem to make a lot of difference.
>>>
>>>
>>> On 08/19/2013 05:00 PM, Nate McCall wrote:
>>>
>>>  How big are the batch sizes? In other words, how many rows are you
>>> sending per insert operation?
>>>
>>>  Other than the above, not much else to suggest without seeing some
>>> example code (on pastebin, gist or similar, ideally).
>>>
>>> On Mon, Aug 19, 2013 at 5:49 PM, Keith Freeman <8forty@gmail.com> wrote=
:
>>>
>>>> I've got a 3-node cassandra cluster (16G/4-core VMs ESXi v5 on 2.5Ghz
>>>> machines not shared with any other VMs).  I'm inserting time-series da=
ta
>>>> into a single column-family using "wide rows" (timeuuids) and have a 3=
-part
>>>> partition key so my primary key is something like ((a, b, day),
>>>> in-time-uuid), x, y, z).
>>>>
>>>> My java client is feeding rows (about 1k of raw data size each) in
>>>> batches using multiple threads, and the fastest I can get it run relia=
bly
>>>> is about 2000 rows/second.  Even at that speed, all 3 cassandra nodes =
are
>>>> very CPU bound, with loads of 6-9 each (and the client machine is hard=
ly
>>>> breaking a sweat).  I've tried turning off compression in my table whi=
ch
>>>> reduced the loads slightly but not much.  There are no other updates o=
r
>>>> reads occurring, except the datastax opscenter.
>>>>
>>>> I was expecting to be able to insert at least 10k rows/second with thi=
s
>>>> configuration, and after a lot of reading of docs, blogs, and google, =
can't
>>>> really figure out what's slowing my client down.  When I increase the
>>>> insert speed of my client beyond 2000/second, the server responses are=
 just
>>>> too slow and the client falls behind.  I had a single-node Mysql datab=
ase
>>>> that can handle 10k of these data rows/second, so I really feel like I=
'm
>>>> missing something in Cassandra.  Any ideas?
>>>>
>>>>
>>>
>>>
>>
>>
>>  --
>>
>> - John
>>
>>
>>
>
>

--047d7b2e4d7c23b40204e4691a53
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Ugh - sorry, I knew Sylvain and=A0<span style=3D"color:rgb=
(0,0,0);font-family:Arial,FreeSans,Helvetica,sans-serif;font-size:13px;line=
-height:17px;text-align:right">Micha=EBl had worked on this recently but it=
 is only in 2.0 - I could have sworn it got marked for=A0</span>inclusion b=
ack into 1.2 but I was wrong:<div>
<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-4693">https://is=
sues.apache.org/jira/browse/CASSANDRA-4693</a><br></div><div><br></div><div=
>This is indeed an issue if you don&#39;t know the column count before hand=
 (or had a very large number of them like in your case). Again, apologies, =
I would not have recommended that route if I knew it was only in 2.0.=A0</d=
iv>
<div><br></div><div>I would be willing to bet you could hit those insert nu=
mbers pretty easily with thrift given the shape of your mutation.=A0</div><=
/div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Tue, =
Aug 20, 2013 at 5:00 PM, Keith Freeman <span dir=3D"ltr">&lt;<a href=3D"mai=
lto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</span> wro=
te:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div text=3D"#000000" bgcolor=3D"#FFFFFF">
    So I tried inserting prepared statements separately (no batch), and
    my server nodes load definitely dropped significantly.=A0 Throughput
    from my client improved a bit, but only a few %.=A0 I was able to
    *almost* get 5000 rows/sec (sort of) by also reducing the
    rows/insert-thread to 20-50 and eliminating all overhead from the
    timing, i.e. timing only the tight for loop of inserts.=A0 But that&#39=
;s
    still a lot slower than I expected.<br>
    <br>
    I couldn&#39;t do batches because the driver doesn&#39;t allow prepared
    statements in a batch (QueryBuilder API).=A0 It appears the batch
    itself could possibly be a prepared statement, but since I have 40+
    columns on each insert that would take some ugly code to build so I
    haven&#39;t tried it yet.<br>
    <br>
    I&#39;m using CL &quot;ONE&quot; on the inserts and RF 2 in my schema.<=
div><div class=3D"h5"><br>
    <br>
    <div>On 08/20/2013 08:04 AM, Nate McCall
      wrote:<br>
    </div>
    <blockquote type=3D"cite">
      <div dir=3D"ltr">John makes a good point re:prepared statements (I=
9;d
        increase batch sizes again once you did this as well - separate,
        incremental runs of course so you can gauge the effect of each).
        That should take out some of the processing overhead of
        statement validation in the server (some - that load spike still
        seems high though).=A0
        <div>
          <br>
        </div>
        <div>I&#39;d actually be really interested as to what your results
          were after doing so - i&#39;ve not tried any A/B testing here for
          prepared statements on inserts.=A0</div>
        <div><br>
        </div>
        <div>Given your load is on the server, i&#39;m not sure adding more
          async indirection on the client would buy you too much
          though.=A0</div>
        <div><br>
        </div>
        <div>Also, at what RF and consistency level are you writing?</div>
      </div>
      <div class=3D"gmail_extra"><br>
        <br>
        <div class=3D"gmail_quote">On Tue, Aug 20, 2013 at 8:56 AM, Keith
          Freeman <span dir=3D"ltr">&lt;<a href=3D"mailto:8forty@gmail.com"=
 target=3D"_blank">8forty@gmail.com</a>&gt;</span>
          wrote:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">
            <div text=3D"#000000" bgcolor=3D"#FFFFFF"> Ok, I&#39;ll try pre=
pared
              statements.=A0=A0 But while sending my statements async might
              speed up my client, it wouldn&#39;t improve throughput on the
              cassandra nodes would it?=A0 They&#39;re running at pretty hi=
gh
              loads and only about 10% idle, so my concern is that they
              can&#39;t handle the data any faster, so something&#39;s wron=
g on
              the server side.=A0 I don&#39;t really think there&#39;s anyt=
hing on
              the client side that matters for this problem.<br>
              <br>
              Of course I know there are obvious h/w things I can do to
              improve server performance: SSDs, more RAM, more cores,
              etc.=A0 But I thought the servers I have would be able to
              handle more rows/sec than say Mysql, since write speed is
              supposed to be one of Cassandra&#39;s strengths.
              <div>
                <div><br>
                  <br>
                  <div>On 08/19/2013 09:03 PM, John Sanda wrote:<br>
                  </div>
                  <blockquote type=3D"cite">
                    <div dir=3D"ltr">I&#39;d suggest using prepared stateme=
nts
                      that you initialize at application start up and
                      switching to use Session.executeAsync coupled with
                      Google Guava Futures API to get better throughput
                      on the client side.</div>
                    <div class=3D"gmail_extra"><br>
                      <br>
                      <div class=3D"gmail_quote">On Mon, Aug 19, 2013 at
                        10:14 PM, Keith Freeman <span dir=3D"ltr">&lt;<a hr=
ef=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</=
span>
                        wrote:<br>
                        <blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                          <div bgcolor=3D"#FFFFFF" text=3D"#000000"> Sure,
                            I&#39;ve tried different numbers for batches an=
d
                            threads, but generally I&#39;m running 10-30
                            threads at a time on the client, each
                            sending a batch of 100 insert statements in
                            every call, using the QueryBuilder.batch()
                            API from the latest datastax java driver,
                            then calling the Session.execute() function
                            (synchronous) on the Batch.<br>
                            <br>
                            I can&#39;t post my code, but my client does
                            this on each iteration:<br>
                            -- divides up the set of inserts by the
                            number of threads<br>
                            -- stores the current time<br>
                            -- tells all the threads to send their
                            inserts<br>
                            -- then when they&#39;ve all returned checks th=
e
                            elapsed time<br>
                            <br>
                            At about 2000 rows for each iteration, 20
                            threads with 100 inserts each finish in
                            about 1 second.=A0 For 4000 rows, 40 threads
                            with 100 inserts each finish in about 1.5 -
                            2 seconds, and as I said all 3 cassandra
                            nodes have a heavy CPU load while the client
                            is hardly loaded.=A0 I&#39;ve tried with 10
                            threads and more inserts per batch, or up to
                            60 threads with fewer, doesn&#39;t seem to make
                            a lot of difference.
                            <div>
                              <div><br>
                                <br>
                                <div>On 08/19/2013 05:00 PM, Nate McCall
                                  wrote:<br>
                                </div>
                                <blockquote type=3D"cite">
                                  <div dir=3D"ltr">
                                    <div class=3D"gmail_extra">How big are
                                      the batch sizes? In other words,
                                      how many rows are you sending per
                                      insert operation?</div>
                                    <div class=3D"gmail_extra"><br>
                                    </div>
                                    <div class=3D"gmail_extra">Other than
                                      the above, not much else to
                                      suggest without seeing some
                                      example code (on pastebin, gist or
                                      similar, ideally).=A0<br>
                                      <br>
                                      <div class=3D"gmail_quote">On Mon,
                                        Aug 19, 2013 at 5:49 PM, Keith
                                        Freeman <span dir=3D"ltr">&lt;<a hr=
ef=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</=
span>
                                        wrote:<br>
                                        <blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> I&#=
39;ve
                                          got a 3-node cassandra cluster
                                          (16G/4-core VMs ESXi v5 on
                                          2.5Ghz machines not shared
                                          with any other VMs). =A0I&#39;m
                                          inserting time-series data
                                          into a single column-family
                                          using &quot;wide rows&quot; (time=
uuids)
                                          and have a 3-part partition
                                          key so my primary key is
                                          something like ((a, b, day),
                                          in-time-uuid), x, y, z).<br>
                                          <br>
                                          My java client is feeding rows
                                          (about 1k of raw data size
                                          each) in batches using
                                          multiple threads, and the
                                          fastest I can get it run
                                          reliably is about 2000
                                          rows/second. =A0Even at that
                                          speed, all 3 cassandra nodes
                                          are very CPU bound, with loads
                                          of 6-9 each (and the client
                                          machine is hardly breaking a
                                          sweat). =A0I&#39;ve tried turning
                                          off compression in my table
                                          which reduced the loads
                                          slightly but not much. =A0There
                                          are no other updates or reads
                                          occurring, except the datastax
                                          opscenter.<br>
                                          <br>
                                          I was expecting to be able to
                                          insert at least 10k
                                          rows/second with this
                                          configuration, and after a lot
                                          of reading of docs, blogs, and
                                          google, can&#39;t really figure
                                          out what&#39;s slowing my client
                                          down. =A0When I increase the
                                          insert speed of my client
                                          beyond 2000/second, the server
                                          responses are just too slow
                                          and the client falls behind.
                                          =A0I had a single-node Mysql
                                          database that can handle 10k
                                          of these data rows/second, so
                                          I really feel like I&#39;m missin=
g
                                          something in Cassandra. =A0Any
                                          ideas?<br>
                                          <br>
                                        </blockquote>
                                      </div>
                                      <br>
                                    </div>
                                  </div>
                                </blockquote>
                                <br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                      <br>
                      <br clear=3D"all">
                      <div><br>
                      </div>
                      -- <br>
                      <br>
                      - John </div>
                  </blockquote>
                  <br>
                </div>
              </div>
            </div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div>

--047d7b2e4d7c23b40204e4691a53--