Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: error (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: 
 <CAFTVKpMmgj7Q=SurAr2ng6PbBs84BD3ZahhcGnanjy19Wt5ViQ@mail.gmail.com>
References: <5212A108.9050804@gmail.com>
	<CAKmMYa_tgGc=BWtO0cyBphcnUFaocPd-dGxvndENrCCEtRLjTQ@mail.gmail.com>
	<5212D106.4050700@gmail.com>
	<CA+BDQ7xudiieQDgs5dith23=X-1JO0ETBc6ZHniu4=7eh+cjjQ@mail.gmail.com>
	<5213757D.8030106@gmail.com>
	<CAKmMYa-WvRghtd=L0NsK4yUf7hd6Nbrfq6tYgDPO02AeyzQfWA@mail.gmail.com>
	<CAFTVKpMmgj7Q=SurAr2ng6PbBs84BD3ZahhcGnanjy19Wt5ViQ@mail.gmail.com>
Date: Tue, 20 Aug 2013 09:43:19 -0500
Message-ID: 
 <CAKmMYa8b2PQ6Oc=Nycj7_JtZ9woYggQUs9zFeT-aphXiGbK5Ow@mail.gmail.com>
Subject: Re: insert performance (1.2.8)
From: Nate McCall <nate@thelastpickle.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a11c1af4cf173b604e46213c0

--001a11c1af4cf173b604e46213c0
Content-Type: text/plain; charset=ISO-8859-1

Thanks for putting this up - sorry I missed your post the other week. I
would be real curious as to your results if you added a prepared statement
for those inserts.


On Tue, Aug 20, 2013 at 9:14 AM, Przemek Maciolek <pmaciolek@gmail.com>wrote:

> I had similar issues (sent a note on the list few weeks ago but nobody
> responded). I think there's a serious bottleneck with using wide rows and
> composite keys. I made a trivial benchmark, which you check here:
> http://pastebin.com/qAcRcqbF  - it's written in cql-rb, but I ran the
> test using astyanax/cql3 enabled and the results were the same.
>
> In my case, inserting 10 000 entries took following time (seconds):
>
> Using composite keys
> Separetely: 12.892867
> Batch: 189.731306
>
> This means, I got 1000 rows/s when inserting them seperately and 52 (!!!)
> when inserting them in a huge batch.
>
> Using just partition key and wide row
> Separetely: 11.292507
> Batch: 0.093355
>
> Again, 1000 rows/s when inserting them one by one. But batch obviously
> improves thing and I easily got >10000 rows/s.
>
> Anyone else with similar experiences?
>
> Thanks,
> Przemek
>
>
> On Tue, Aug 20, 2013 at 4:04 PM, Nate McCall <nate@thelastpickle.com>wrote:
>
>> John makes a good point re:prepared statements (I'd increase batch sizes
>> again once you did this as well - separate, incremental runs of course so
>> you can gauge the effect of each). That should take out some of the
>> processing overhead of statement validation in the server (some - that load
>> spike still seems high though).
>>
>> I'd actually be really interested as to what your results were after
>> doing so - i've not tried any A/B testing here for prepared statements on
>> inserts.
>>
>> Given your load is on the server, i'm not sure adding more async
>> indirection on the client would buy you too much though.
>>
>> Also, at what RF and consistency level are you writing?
>>
>>
>> On Tue, Aug 20, 2013 at 8:56 AM, Keith Freeman <8forty@gmail.com> wrote:
>>
>>>  Ok, I'll try prepared statements.   But while sending my statements
>>> async might speed up my client, it wouldn't improve throughput on the
>>> cassandra nodes would it?  They're running at pretty high loads and only
>>> about 10% idle, so my concern is that they can't handle the data any
>>> faster, so something's wrong on the server side.  I don't really think
>>> there's anything on the client side that matters for this problem.
>>>
>>> Of course I know there are obvious h/w things I can do to improve server
>>> performance: SSDs, more RAM, more cores, etc.  But I thought the servers I
>>> have would be able to handle more rows/sec than say Mysql, since write
>>> speed is supposed to be one of Cassandra's strengths.
>>>
>>>
>>> On 08/19/2013 09:03 PM, John Sanda wrote:
>>>
>>> I'd suggest using prepared statements that you initialize at application
>>> start up and switching to use Session.executeAsync coupled with Google
>>> Guava Futures API to get better throughput on the client side.
>>>
>>>
>>> On Mon, Aug 19, 2013 at 10:14 PM, Keith Freeman <8forty@gmail.com>wrote:
>>>
>>>>  Sure, I've tried different numbers for batches and threads, but
>>>> generally I'm running 10-30 threads at a time on the client, each sending a
>>>> batch of 100 insert statements in every call, using the
>>>> QueryBuilder.batch() API from the latest datastax java driver, then calling
>>>> the Session.execute() function (synchronous) on the Batch.
>>>>
>>>> I can't post my code, but my client does this on each iteration:
>>>> -- divides up the set of inserts by the number of threads
>>>> -- stores the current time
>>>> -- tells all the threads to send their inserts
>>>> -- then when they've all returned checks the elapsed time
>>>>
>>>> At about 2000 rows for each iteration, 20 threads with 100 inserts each
>>>> finish in about 1 second.  For 4000 rows, 40 threads with 100 inserts each
>>>> finish in about 1.5 - 2 seconds, and as I said all 3 cassandra nodes have a
>>>> heavy CPU load while the client is hardly loaded.  I've tried with 10
>>>> threads and more inserts per batch, or up to 60 threads with fewer, doesn't
>>>> seem to make a lot of difference.
>>>>
>>>>
>>>> On 08/19/2013 05:00 PM, Nate McCall wrote:
>>>>
>>>>  How big are the batch sizes? In other words, how many rows are you
>>>> sending per insert operation?
>>>>
>>>>  Other than the above, not much else to suggest without seeing some
>>>> example code (on pastebin, gist or similar, ideally).
>>>>
>>>> On Mon, Aug 19, 2013 at 5:49 PM, Keith Freeman <8forty@gmail.com>wrote:
>>>>
>>>>> I've got a 3-node cassandra cluster (16G/4-core VMs ESXi v5 on 2.5Ghz
>>>>> machines not shared with any other VMs).  I'm inserting time-series data
>>>>> into a single column-family using "wide rows" (timeuuids) and have a 3-part
>>>>> partition key so my primary key is something like ((a, b, day),
>>>>> in-time-uuid), x, y, z).
>>>>>
>>>>> My java client is feeding rows (about 1k of raw data size each) in
>>>>> batches using multiple threads, and the fastest I can get it run reliably
>>>>> is about 2000 rows/second.  Even at that speed, all 3 cassandra nodes are
>>>>> very CPU bound, with loads of 6-9 each (and the client machine is hardly
>>>>> breaking a sweat).  I've tried turning off compression in my table which
>>>>> reduced the loads slightly but not much.  There are no other updates or
>>>>> reads occurring, except the datastax opscenter.
>>>>>
>>>>> I was expecting to be able to insert at least 10k rows/second with
>>>>> this configuration, and after a lot of reading of docs, blogs, and google,
>>>>> can't really figure out what's slowing my client down.  When I increase the
>>>>> insert speed of my client beyond 2000/second, the server responses are just
>>>>> too slow and the client falls behind.  I had a single-node Mysql database
>>>>> that can handle 10k of these data rows/second, so I really feel like I'm
>>>>> missing something in Cassandra.  Any ideas?
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>>>  --
>>>
>>> - John
>>>
>>>
>>>
>>
>

--001a11c1af4cf173b604e46213c0
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks for putting this up - sorry I missed your post the =
other week. I would be real curious as to your results if you added a prepa=
red statement for those inserts.=A0</div><div class=3D"gmail_extra"><br><br=
><div class=3D"gmail_quote">
On Tue, Aug 20, 2013 at 9:14 AM, Przemek Maciolek <span dir=3D"ltr">&lt;<a =
href=3D"mailto:pmaciolek@gmail.com" target=3D"_blank">pmaciolek@gmail.com</=
a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir=3D"ltr">I had similar issues (sent a note on the list few weeks ag=
o but nobody responded). I think there&#39;s a serious bottleneck with usin=
g wide rows and composite keys. I made a trivial benchmark, which you check=
 here:=A0<a href=3D"http://pastebin.com/qAcRcqbF" style=3D"font-family:aria=
l,sans-serif;font-size:12.727272033691406px" target=3D"_blank">http://paste=
bin.com/qAcRcqbF</a><span style=3D"font-family:arial,sans-serif;font-size:1=
2.727272033691406px">=A0 - it&#39;s written in cql-rb, but I ran the test u=
sing astyanax/cql3 enabled and the results were the same.</span><div>

<br></div><div><div style=3D"font-family:arial,sans-serif;font-size:12.7272=
72033691406px">In my case, inserting 10 000 entries took following time (se=
conds):</div><div style=3D"font-family:arial,sans-serif;font-size:12.727272=
033691406px">

<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">Using composite keys</div>
<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">=
Separetely: 12.892867</div><div style=3D"font-family:arial,sans-serif;font-=
size:12.727272033691406px">Batch: 189.731306</div><div style=3D"font-family=
:arial,sans-serif;font-size:12.727272033691406px">

<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">This means, I got 1000 rows/s when inserting them seperately and =
52 (!!!) when inserting them in a huge batch.</div><div style=3D"font-famil=
y:arial,sans-serif;font-size:12.727272033691406px">

<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">
Using just partition key and wide row</div><div style=3D"font-family:arial,=
sans-serif;font-size:12.727272033691406px">Separetely: 11.292507</div><div =
style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">Batch=
: 0.093355</div>

<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">=
<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">Again, 1000 rows/s when inserting them one by one. But batch obvi=
ously improves thing and I easily got &gt;10000 rows/s.</div>

<div style=3D"font-family:arial,sans-serif;font-size:12.727272033691406px">=
<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">Anyone else with similar experiences?</div><div style=3D"font-fam=
ily:arial,sans-serif;font-size:12.727272033691406px">

<br></div><div style=3D"font-family:arial,sans-serif;font-size:12.727272033=
691406px">Thanks,</div><div style=3D"font-family:arial,sans-serif;font-size=
:12.727272033691406px">Przemek</div></div></div><div class=3D"HOEnZb"><div =
class=3D"h5">
<div class=3D"gmail_extra"><br>
<br><div class=3D"gmail_quote">On Tue, Aug 20, 2013 at 4:04 PM, Nate McCall=
 <span dir=3D"ltr">&lt;<a href=3D"mailto:nate@thelastpickle.com" target=3D"=
_blank">nate@thelastpickle.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">

<div dir=3D"ltr">John makes a good point re:prepared statements (I&#39;d in=
crease batch sizes again once you did this as well - separate, incremental =
runs of course so you can gauge the effect of each). That should take out s=
ome of the processing overhead of statement validation in the server (some =
- that load spike still seems high though).=A0<div>


<br></div><div>I&#39;d actually be really interested as to what your result=
s were after doing so - i&#39;ve not tried any A/B testing here for prepare=
d statements on inserts.=A0</div><div><br></div><div>Given your load is on =
the server, i&#39;m not sure adding more async indirection on the client wo=
uld buy you too much though.=A0</div>


<div><br></div><div>Also, at what RF and consistency level are you writing?=
</div></div><div><div><div class=3D"gmail_extra"><br><br><div class=3D"gmai=
l_quote">On Tue, Aug 20, 2013 at 8:56 AM, Keith Freeman <span dir=3D"ltr">&=
lt;<a href=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</=
a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div text=3D"#000000" bgcolor=3D"#FFFFFF">
    Ok, I&#39;ll try prepared statements.=A0=A0 But while sending my statem=
ents
    async might speed up my client, it wouldn&#39;t improve throughput on
    the cassandra nodes would it?=A0 They&#39;re running at pretty high loa=
ds
    and only about 10% idle, so my concern is that they can&#39;t handle th=
e
    data any faster, so something&#39;s wrong on the server side.=A0 I don&=
#39;t
    really think there&#39;s anything on the client side that matters for
    this problem.<br>
    <br>
    Of course I know there are obvious h/w things I can do to improve
    server performance: SSDs, more RAM, more cores, etc.=A0 But I thought
    the servers I have would be able to handle more rows/sec than say
    Mysql, since write speed is supposed to be one of Cassandra&#39;s
    strengths.<div><div><br>
    <br>
    <div>On 08/19/2013 09:03 PM, John Sanda
      wrote:<br>
    </div>
    <blockquote type=3D"cite">
      <div dir=3D"ltr">I&#39;d suggest using prepared statements that you
        initialize at application start up and switching to use
        Session.executeAsync coupled with Google Guava Futures API to
        get better throughput on the client side.</div>
      <div class=3D"gmail_extra"><br>
        <br>
        <div class=3D"gmail_quote">On Mon, Aug 19, 2013 at 10:14 PM, Keith
          Freeman <span dir=3D"ltr">&lt;<a href=3D"mailto:8forty@gmail.com"=
 target=3D"_blank">8forty@gmail.com</a>&gt;</span>
          wrote:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">
            <div bgcolor=3D"#FFFFFF" text=3D"#000000"> Sure, I&#39;ve tried
              different numbers for batches and threads, but generally
              I&#39;m running 10-30 threads at a time on the client, each
              sending a batch of 100 insert statements in every call,
              using the QueryBuilder.batch() API from the latest
              datastax java driver, then calling the Session.execute()
              function (synchronous) on the Batch.<br>
              <br>
              I can&#39;t post my code, but my client does this on each
              iteration:<br>
              -- divides up the set of inserts by the number of threads<br>
              -- stores the current time<br>
              -- tells all the threads to send their inserts<br>
              -- then when they&#39;ve all returned checks the elapsed time=
<br>
              <br>
              At about 2000 rows for each iteration, 20 threads with 100
              inserts each finish in about 1 second.=A0 For 4000 rows, 40
              threads with 100 inserts each finish in about 1.5 - 2
              seconds, and as I said all 3 cassandra nodes have a heavy
              CPU load while the client is hardly loaded.=A0 I&#39;ve tried
              with 10 threads and more inserts per batch, or up to 60
              threads with fewer, doesn&#39;t seem to make a lot of
              difference.
              <div>
                <div><br>
                  <br>
                  <div>On 08/19/2013 05:00 PM, Nate McCall wrote:<br>
                  </div>
                  <blockquote type=3D"cite">
                    <div dir=3D"ltr">
                      <div class=3D"gmail_extra">How big are the batch
                        sizes? In other words, how many rows are you
                        sending per insert operation?</div>
                      <div class=3D"gmail_extra"><br>
                      </div>
                      <div class=3D"gmail_extra">Other than the above, not
                        much else to suggest without seeing some example
                        code (on pastebin, gist or similar, ideally).=A0<br=
>
                        <br>
                        <div class=3D"gmail_quote">On Mon, Aug 19, 2013 at
                          5:49 PM, Keith Freeman <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;<=
/span>
                          wrote:<br>
                          <blockquote class=3D"gmail_quote" style=3D"margin=
:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> I&#39;ve got a
                            3-node cassandra cluster (16G/4-core VMs
                            ESXi v5 on 2.5Ghz machines not shared with
                            any other VMs). =A0I&#39;m inserting time-serie=
s
                            data into a single column-family using &quot;wi=
de
                            rows&quot; (timeuuids) and have a 3-part
                            partition key so my primary key is something
                            like ((a, b, day), in-time-uuid), x, y, z).<br>
                            <br>
                            My java client is feeding rows (about 1k of
                            raw data size each) in batches using
                            multiple threads, and the fastest I can get
                            it run reliably is about 2000 rows/second.
                            =A0Even at that speed, all 3 cassandra nodes
                            are very CPU bound, with loads of 6-9 each
                            (and the client machine is hardly breaking a
                            sweat). =A0I&#39;ve tried turning off compressi=
on
                            in my table which reduced the loads slightly
                            but not much. =A0There are no other updates or
                            reads occurring, except the datastax
                            opscenter.<br>
                            <br>
                            I was expecting to be able to insert at
                            least 10k rows/second with this
                            configuration, and after a lot of reading of
                            docs, blogs, and google, can&#39;t really figur=
e
                            out what&#39;s slowing my client down. =A0When =
I
                            increase the insert speed of my client
                            beyond 2000/second, the server responses are
                            just too slow and the client falls behind.
                            =A0I had a single-node Mysql database that can
                            handle 10k of these data rows/second, so I
                            really feel like I&#39;m missing something in
                            Cassandra. =A0Any ideas?<br>
                            <br>
                          </blockquote>
                        </div>
                        <br>
                      </div>
                    </div>
                  </blockquote>
                  <br>
                </div>
              </div>
            </div>
          </blockquote>
        </div>
        <br>
        <br clear=3D"all">
        <div><br>
        </div>
        -- <br>
        <br>
        - John
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a11c1af4cf173b604e46213c0--