Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: error (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <521509B9.5040702@gmail.com>
References: <5212A108.9050804@gmail.com>
	<CAKmMYa_tgGc=BWtO0cyBphcnUFaocPd-dGxvndENrCCEtRLjTQ@mail.gmail.com>
	<5212D106.4050700@gmail.com>
	<CA+BDQ7xudiieQDgs5dith23=X-1JO0ETBc6ZHniu4=7eh+cjjQ@mail.gmail.com>
	<5213757D.8030106@gmail.com>
	<CAKmMYa-WvRghtd=L0NsK4yUf7hd6Nbrfq6tYgDPO02AeyzQfWA@mail.gmail.com>
	<5213E718.7030203@gmail.com>
	<CAKmMYa94E6d2+qFyPfWe81m82J=E8UEtDJ8o-cTYf34-HFgNnw@mail.gmail.com>
	<521411CE.4070302@gmail.com>
	<CAKmMYa9iT4FtoWOBD3bOghBVkya64C=KeB8HtKqZWT0oXCRM-Q@mail.gmail.com>
	<521509B9.5040702@gmail.com>
Date: Wed, 21 Aug 2013 15:16:22 -0500
Message-ID: 
 <CAKmMYa-uK9eb5LABVieFrGeOcxw3eSdO6+Yi=vy-SBTkA9qfCg@mail.gmail.com>
Subject: Re: insert performance (1.2.8)
From: Nate McCall <nate@thelastpickle.com>
To: Cassandra Users <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=047d7b621ef2dd95b904e47ad8cb

--047d7b621ef2dd95b904e47ad8cb
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

The only thing I can think to suggest at this point is upping that batch
size - say to 500 and see what happens.

Do you have any monitoring on this cluster? If not, what do you see as the
output of 'nodetool tpstats' while you run this test?


On Wed, Aug 21, 2013 at 1:40 PM, Keith Freeman <8forty@gmail.com> wrote:

>  Building the giant batch string wasn't as bad as I thought, and at first
> I had great(!) results (using "unlogged" batches): 2500 rows/sec (batches
> of 100 in 48 threads) ran very smoothly, and the load on the cassandra
> server nodes averaged about 1.0 or less continuously.
>
> But then I upped it to 5000 rows/sec, and the load on the server nodes
> jumped to a continuous load on all 3 of 8-10 with peaks over 14.  I also
> tried running 2 separate clients at 2500 rows/sec with the same results. =
 I
> don't see any compactions while at this load, so would this likely be the
> result of GC thrashing?
>
> Seems like I'm spending a lot of effort and am still not getting very
> close to being able to insert 10k rows (10M of data each) per second, whi=
ch
> is pretty disappointing.
>
>
> On 08/20/2013 07:16 PM, Nate McCall wrote:
>
> Thrift will allow for more large, free-form batch contstruction. The
> increase will be doing a lot more in the same payload message. Otherwise
> CQL is more efficient.
>
>  If you do build those giant string, yes you should see a performance
> improvement.
>
>
> On Tue, Aug 20, 2013 at 8:03 PM, Keith Freeman <8forty@gmail.com> wrote:
>
>>  Thanks.  Can you tell me why would using thrift would improve
>> performance?
>>
>> Also, if I do try to build those giant strings for a prepared batch
>> statement, should I expect another performance improvement?
>>
>>
>>
>> On 08/20/2013 05:06 PM, Nate McCall wrote:
>>
>> Ugh - sorry, I knew Sylvain and Micha=EBl had worked on this recently bu=
t
>> it is only in 2.0 - I could have sworn it got marked for inclusion back
>> into 1.2 but I was wrong:
>> https://issues.apache.org/jira/browse/CASSANDRA-4693
>>
>>  This is indeed an issue if you don't know the column count before hand
>> (or had a very large number of them like in your case). Again, apologies=
, I
>> would not have recommended that route if I knew it was only in 2.0.
>>
>>  I would be willing to bet you could hit those insert numbers pretty
>> easily with thrift given the shape of your mutation.
>>
>>
>> On Tue, Aug 20, 2013 at 5:00 PM, Keith Freeman <8forty@gmail.com> wrote:
>>
>>>  So I tried inserting prepared statements separately (no batch), and my
>>> server nodes load definitely dropped significantly.  Throughput from my
>>> client improved a bit, but only a few %.  I was able to *almost* get 50=
00
>>> rows/sec (sort of) by also reducing the rows/insert-thread to 20-50 and
>>> eliminating all overhead from the timing, i.e. timing only the tight fo=
r
>>> loop of inserts.  But that's still a lot slower than I expected.
>>>
>>> I couldn't do batches because the driver doesn't allow prepared
>>> statements in a batch (QueryBuilder API).  It appears the batch itself
>>> could possibly be a prepared statement, but since I have 40+ columns on
>>> each insert that would take some ugly code to build so I haven't tried =
it
>>> yet.
>>>
>>> I'm using CL "ONE" on the inserts and RF 2 in my schema.
>>>
>>>
>>> On 08/20/2013 08:04 AM, Nate McCall wrote:
>>>
>>> John makes a good point re:prepared statements (I'd increase batch size=
s
>>> again once you did this as well - separate, incremental runs of course =
so
>>> you can gauge the effect of each). That should take out some of the
>>> processing overhead of statement validation in the server (some - that =
load
>>> spike still seems high though).
>>>
>>>  I'd actually be really interested as to what your results were after
>>> doing so - i've not tried any A/B testing here for prepared statements =
on
>>> inserts.
>>>
>>>  Given your load is on the server, i'm not sure adding more async
>>> indirection on the client would buy you too much though.
>>>
>>>  Also, at what RF and consistency level are you writing?
>>>
>>>
>>> On Tue, Aug 20, 2013 at 8:56 AM, Keith Freeman <8forty@gmail.com> wrote=
:
>>>
>>>>  Ok, I'll try prepared statements.   But while sending my statements
>>>> async might speed up my client, it wouldn't improve throughput on the
>>>> cassandra nodes would it?  They're running at pretty high loads and on=
ly
>>>> about 10% idle, so my concern is that they can't handle the data any
>>>> faster, so something's wrong on the server side.  I don't really think
>>>> there's anything on the client side that matters for this problem.
>>>>
>>>> Of course I know there are obvious h/w things I can do to improve
>>>> server performance: SSDs, more RAM, more cores, etc.  But I thought th=
e
>>>> servers I have would be able to handle more rows/sec than say Mysql, s=
ince
>>>> write speed is supposed to be one of Cassandra's strengths.
>>>>
>>>>
>>>> On 08/19/2013 09:03 PM, John Sanda wrote:
>>>>
>>>> I'd suggest using prepared statements that you initialize at
>>>> application start up and switching to use Session.executeAsync coupled=
 with
>>>> Google Guava Futures API to get better throughput on the client side.
>>>>
>>>>
>>>> On Mon, Aug 19, 2013 at 10:14 PM, Keith Freeman <8forty@gmail.com>wrot=
e:
>>>>
>>>>>  Sure, I've tried different numbers for batches and threads, but
>>>>> generally I'm running 10-30 threads at a time on the client, each sen=
ding a
>>>>> batch of 100 insert statements in every call, using the
>>>>> QueryBuilder.batch() API from the latest datastax java driver, then c=
alling
>>>>> the Session.execute() function (synchronous) on the Batch.
>>>>>
>>>>> I can't post my code, but my client does this on each iteration:
>>>>> -- divides up the set of inserts by the number of threads
>>>>> -- stores the current time
>>>>> -- tells all the threads to send their inserts
>>>>> -- then when they've all returned checks the elapsed time
>>>>>
>>>>> At about 2000 rows for each iteration, 20 threads with 100 inserts
>>>>> each finish in about 1 second.  For 4000 rows, 40 threads with 100 in=
serts
>>>>> each finish in about 1.5 - 2 seconds, and as I said all 3 cassandra n=
odes
>>>>> have a heavy CPU load while the client is hardly loaded.  I've tried =
with
>>>>> 10 threads and more inserts per batch, or up to 60 threads with fewer=
,
>>>>> doesn't seem to make a lot of difference.
>>>>>
>>>>>
>>>>> On 08/19/2013 05:00 PM, Nate McCall wrote:
>>>>>
>>>>>  How big are the batch sizes? In other words, how many rows are you
>>>>> sending per insert operation?
>>>>>
>>>>>  Other than the above, not much else to suggest without seeing some
>>>>> example code (on pastebin, gist or similar, ideally).
>>>>>
>>>>> On Mon, Aug 19, 2013 at 5:49 PM, Keith Freeman <8forty@gmail.com>wrot=
e:
>>>>>
>>>>>> I've got a 3-node cassandra cluster (16G/4-core VMs ESXi v5 on 2.5Gh=
z
>>>>>> machines not shared with any other VMs).  I'm inserting time-series =
data
>>>>>> into a single column-family using "wide rows" (timeuuids) and have a=
 3-part
>>>>>> partition key so my primary key is something like ((a, b, day),
>>>>>> in-time-uuid), x, y, z).
>>>>>>
>>>>>> My java client is feeding rows (about 1k of raw data size each) in
>>>>>> batches using multiple threads, and the fastest I can get it run rel=
iably
>>>>>> is about 2000 rows/second.  Even at that speed, all 3 cassandra node=
s are
>>>>>> very CPU bound, with loads of 6-9 each (and the client machine is ha=
rdly
>>>>>> breaking a sweat).  I've tried turning off compression in my table w=
hich
>>>>>> reduced the loads slightly but not much.  There are no other updates=
 or
>>>>>> reads occurring, except the datastax opscenter.
>>>>>>
>>>>>> I was expecting to be able to insert at least 10k rows/second with
>>>>>> this configuration, and after a lot of reading of docs, blogs, and g=
oogle,
>>>>>> can't really figure out what's slowing my client down.  When I incre=
ase the
>>>>>> insert speed of my client beyond 2000/second, the server responses a=
re just
>>>>>> too slow and the client falls behind.  I had a single-node Mysql dat=
abase
>>>>>> that can handle 10k of these data rows/second, so I really feel like=
 I'm
>>>>>> missing something in Cassandra.  Any ideas?
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>  --
>>>>
>>>> - John
>>>>
>>>>
>>>>
>>>
>>>
>>
>>
>
>

--047d7b621ef2dd95b904e47ad8cb
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">The only thing I can think to suggest at this point is upp=
ing that batch size - say to 500 and see what happens.=A0<div><br></div><di=
v>Do you have any monitoring on this cluster? If not, what do you see as th=
e output of &#39;nodetool tpstats&#39; while you run this test?</div>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed,=
 Aug 21, 2013 at 1:40 PM, Keith Freeman <span dir=3D"ltr">&lt;<a href=3D"ma=
ilto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</span> wr=
ote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div text=3D"#000000" bgcolor=3D"#FFFFFF">
    Building the giant batch string wasn&#39;t as bad as I thought, and at
    first I had great(!) results (using &quot;unlogged&quot; batches): 2500
    rows/sec (batches of 100 in 48 threads) ran very smoothly, and the
    load on the cassandra server nodes averaged about 1.0 or less
    continuously.<br>
    <br>
    But then I upped it to 5000 rows/sec, and the load on the server
    nodes jumped to a continuous load on all 3 of 8-10 with peaks over
    14.=A0 I also tried running 2 separate clients at 2500 rows/sec with
    the same results.=A0 I don&#39;t see any compactions while at this load=
,
    so would this likely be the result of GC thrashing?<br>
    <br>
    Seems like I&#39;m spending a lot of effort and am still not getting
    very close to being able to insert 10k rows (10M of data each) per
    second, which is pretty disappointing.<div><div class=3D"h5"><br>
    <br>
    <div>On 08/20/2013 07:16 PM, Nate McCall
      wrote:<br>
    </div>
    <blockquote type=3D"cite">
      <div dir=3D"ltr">Thrift will allow for more large, free-form batch
        contstruction. The increase will be doing a lot more in the same
        payload message. Otherwise CQL is more efficient.=A0
        <div><br>
        </div>
        <div>If you do build those giant string, yes you should see a
          performance improvement.=A0</div>
      </div>
      <div class=3D"gmail_extra"><br>
        <br>
        <div class=3D"gmail_quote">On Tue, Aug 20, 2013 at 8:03 PM, Keith
          Freeman <span dir=3D"ltr">&lt;<a href=3D"mailto:8forty@gmail.com"=
 target=3D"_blank">8forty@gmail.com</a>&gt;</span>
          wrote:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">
            <div bgcolor=3D"#FFFFFF" text=3D"#000000"> Thanks.=A0 Can you t=
ell
              me why would using thrift would improve performance?<br>
              <br>
              Also, if I do try to build those giant strings for a
              prepared batch statement, should I expect another
              performance improvement?
              <div>
                <div><br>
                  <br>
                  <br>
                  <div>On 08/20/2013 05:06 PM, Nate McCall wrote:<br>
                  </div>
                  <blockquote type=3D"cite">
                    <div dir=3D"ltr">Ugh - sorry, I knew Sylvain and=A0<spa=
n style=3D"line-height:17px;text-align:right;font-size:13px;font-family:Ari=
al,FreeSans,Helvetica,sans-serif">Micha=EBl

                        had worked on this recently but it is only in
                        2.0 - I could have sworn it got marked for=A0</span=
>inclusion
                      back into 1.2 but I was wrong:
                      <div> <a href=3D"https://issues.apache.org/jira/brows=
e/CASSANDRA-4693" target=3D"_blank">https://issues.apache.org/jira/browse/C=
ASSANDRA-4693</a><br>
                      </div>
                      <div><br>
                      </div>
                      <div>This is indeed an issue if you don&#39;t know th=
e
                        column count before hand (or had a very large
                        number of them like in your case). Again,
                        apologies, I would not have recommended that
                        route if I knew it was only in 2.0.=A0</div>
                      <div><br>
                      </div>
                      <div>I would be willing to bet you could hit those
                        insert numbers pretty easily with thrift given
                        the shape of your mutation.=A0</div>
                    </div>
                    <div class=3D"gmail_extra"><br>
                      <br>
                      <div class=3D"gmail_quote">On Tue, Aug 20, 2013 at
                        5:00 PM, Keith Freeman <span dir=3D"ltr">&lt;<a hre=
f=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</s=
pan>
                        wrote:<br>
                        <blockquote class=3D"gmail_quote" style=3D"margin:0=
 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                          <div text=3D"#000000" bgcolor=3D"#FFFFFF"> So I
                            tried inserting prepared statements
                            separately (no batch), and my server nodes
                            load definitely dropped significantly.=A0
                            Throughput from my client improved a bit,
                            but only a few %.=A0 I was able to *almost*
                            get 5000 rows/sec (sort of) by also reducing
                            the rows/insert-thread to 20-50 and
                            eliminating all overhead from the timing,
                            i.e. timing only the tight for loop of
                            inserts.=A0 But that&#39;s still a lot slower t=
han
                            I expected.<br>
                            <br>
                            I couldn&#39;t do batches because the driver
                            doesn&#39;t allow prepared statements in a batc=
h
                            (QueryBuilder API).=A0 It appears the batch
                            itself could possibly be a prepared
                            statement, but since I have 40+ columns on
                            each insert that would take some ugly code
                            to build so I haven&#39;t tried it yet.<br>
                            <br>
                            I&#39;m using CL &quot;ONE&quot; on the inserts=
 and RF 2
                            in my schema.
                            <div>
                              <div><br>
                                <br>
                                <div>On 08/20/2013 08:04 AM, Nate McCall
                                  wrote:<br>
                                </div>
                                <blockquote type=3D"cite">
                                  <div dir=3D"ltr">John makes a good point
                                    re:prepared statements (I&#39;d increas=
e
                                    batch sizes again once you did this
                                    as well - separate, incremental runs
                                    of course so you can gauge the
                                    effect of each). That should take
                                    out some of the processing overhead
                                    of statement validation in the
                                    server (some - that load spike still
                                    seems high though).=A0
                                    <div> <br>
                                    </div>
                                    <div>I&#39;d actually be really
                                      interested as to what your results
                                      were after doing so - i&#39;ve not
                                      tried any A/B testing here for
                                      prepared statements on inserts.=A0</d=
iv>
                                    <div><br>
                                    </div>
                                    <div>Given your load is on the
                                      server, i&#39;m not sure adding more
                                      async indirection on the client
                                      would buy you too much though.=A0</di=
v>
                                    <div><br>
                                    </div>
                                    <div>Also, at what RF and
                                      consistency level are you writing?</d=
iv>
                                  </div>
                                  <div class=3D"gmail_extra"><br>
                                    <br>
                                    <div class=3D"gmail_quote">On Tue, Aug
                                      20, 2013 at 8:56 AM, Keith Freeman
                                      <span dir=3D"ltr">&lt;<a href=3D"mail=
to:8forty@gmail.com" target=3D"_blank">8forty@gmail.com</a>&gt;</span>
                                      wrote:<br>
                                      <blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                                        <div text=3D"#000000" bgcolor=3D"#F=
FFFFF"> Ok, I&#39;ll
                                          try prepared statements.=A0=A0 Bu=
t
                                          while sending my statements
                                          async might speed up my
                                          client, it wouldn&#39;t improve
                                          throughput on the cassandra
                                          nodes would it?=A0 They&#39;re
                                          running at pretty high loads
                                          and only about 10% idle, so my
                                          concern is that they can&#39;t
                                          handle the data any faster, so
                                          something&#39;s wrong on the
                                          server side.=A0 I don&#39;t reall=
y
                                          think there&#39;s anything on the
                                          client side that matters for
                                          this problem.<br>
                                          <br>
                                          Of course I know there are
                                          obvious h/w things I can do to
                                          improve server performance:
                                          SSDs, more RAM, more cores,
                                          etc.=A0 But I thought the
                                          servers I have would be able
                                          to handle more rows/sec than
                                          say Mysql, since write speed
                                          is supposed to be one of
                                          Cassandra&#39;s strengths.
                                          <div>
                                            <div><br>
                                              <br>
                                              <div>On 08/19/2013 09:03
                                                PM, John Sanda wrote:<br>
                                              </div>
                                              <blockquote type=3D"cite">
                                                <div dir=3D"ltr">I&#39;d
                                                  suggest using prepared
                                                  statements that you
                                                  initialize at
                                                  application start up
                                                  and switching to use
                                                  Session.executeAsync
                                                  coupled with Google
                                                  Guava Futures API to
                                                  get better throughput
                                                  on the client side.</div>
                                                <div class=3D"gmail_extra">=
<br>
                                                  <br>
                                                  <div class=3D"gmail_quote=
">On
                                                    Mon, Aug 19, 2013 at
                                                    10:14 PM, Keith
                                                    Freeman <span dir=3D"lt=
r">&lt;<a href=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@gmail.c=
om</a>&gt;</span>
                                                    wrote:<br>
                                                    <blockquote class=3D"gm=
ail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-le=
ft:1ex">
                                                      <div bgcolor=3D"#FFFF=
FF" text=3D"#000000">
                                                        Sure, I&#39;ve trie=
d
                                                        different
                                                        numbers for
                                                        batches and
                                                        threads, but
                                                        generally I&#39;m
                                                        running 10-30
                                                        threads at a
                                                        time on the
                                                        client, each
                                                        sending a batch
                                                        of 100 insert
                                                        statements in
                                                        every call,
                                                        using the
                                                        QueryBuilder.batch(=
)
                                                        API from the
                                                        latest datastax
                                                        java driver,
                                                        then calling the
                                                        Session.execute()
                                                        function
                                                        (synchronous) on
                                                        the Batch.<br>
                                                        <br>
                                                        I can&#39;t post my
                                                        code, but my
                                                        client does this
                                                        on each
                                                        iteration:<br>
                                                        -- divides up
                                                        the set of
                                                        inserts by the
                                                        number of
                                                        threads<br>
                                                        -- stores the
                                                        current time<br>
                                                        -- tells all the
                                                        threads to send
                                                        their inserts<br>
                                                        -- then when
                                                        they&#39;ve all
                                                        returned checks
                                                        the elapsed time<br=
>
                                                        <br>
                                                        At about 2000
                                                        rows for each
                                                        iteration, 20
                                                        threads with 100
                                                        inserts each
                                                        finish in about
                                                        1 second.=A0 For
                                                        4000 rows, 40
                                                        threads with 100
                                                        inserts each
                                                        finish in about
                                                        1.5 - 2 seconds,
                                                        and as I said
                                                        all 3 cassandra
                                                        nodes have a
                                                        heavy CPU load
                                                        while the client
                                                        is hardly
                                                        loaded.=A0 I&#39;ve
                                                        tried with 10
                                                        threads and more
                                                        inserts per
                                                        batch, or up to
                                                        60 threads with
                                                        fewer, doesn&#39;t
                                                        seem to make a
                                                        lot of
                                                        difference.
                                                        <div>
                                                          <div><br>
                                                          <br>
                                                          <div>On
                                                          08/19/2013
                                                          05:00 PM, Nate
                                                          McCall wrote:<br>
                                                          </div>
                                                          <blockquote type=
=3D"cite">
                                                          <div dir=3D"ltr">
                                                          <div class=3D"gma=
il_extra">How

                                                          big are the
                                                          batch sizes?
                                                          In other
                                                          words, how
                                                          many rows are
                                                          you sending
                                                          per insert
                                                          operation?</div>
                                                          <div class=3D"gma=
il_extra"><br>
                                                          </div>
                                                          <div class=3D"gma=
il_extra">Other

                                                          than the
                                                          above, not
                                                          much else to
                                                          suggest
                                                          without seeing
                                                          some example
                                                          code (on
                                                          pastebin, gist
                                                          or similar,
                                                          ideally).=A0<br>
                                                          <br>
                                                          <div class=3D"gma=
il_quote">On

                                                          Mon, Aug 19,
                                                          2013 at 5:49
                                                          PM, Keith
                                                          Freeman <span dir=
=3D"ltr">&lt;<a href=3D"mailto:8forty@gmail.com" target=3D"_blank">8forty@g=
mail.com</a>&gt;</span>
                                                          wrote:<br>
                                                          <blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex">
                                                          I&#39;ve got a
                                                          3-node
                                                          cassandra
                                                          cluster
                                                          (16G/4-core
                                                          VMs ESXi v5 on
                                                          2.5Ghz
                                                          machines not
                                                          shared with
                                                          any other
                                                          VMs). =A0I&#39;m
                                                          inserting
                                                          time-series
                                                          data into a
                                                          single
                                                          column-family
                                                          using &quot;wide
                                                          rows&quot;
                                                          (timeuuids)
                                                          and have a
                                                          3-part
                                                          partition key
                                                          so my primary
                                                          key is
                                                          something like
                                                          ((a, b, day),
                                                          in-time-uuid),
                                                          x, y, z).<br>
                                                          <br>
                                                          My java client
                                                          is feeding
                                                          rows (about 1k
                                                          of raw data
                                                          size each) in
                                                          batches using
                                                          multiple
                                                          threads, and
                                                          the fastest I
                                                          can get it run
                                                          reliably is
                                                          about 2000
                                                          rows/second.
                                                          =A0Even at that
                                                          speed, all 3
                                                          cassandra
                                                          nodes are very
                                                          CPU bound,
                                                          with loads of
                                                          6-9 each (and
                                                          the client
                                                          machine is
                                                          hardly
                                                          breaking a
                                                          sweat). =A0I&#39;=
ve
                                                          tried turning
                                                          off
                                                          compression in
                                                          my table which
                                                          reduced the
                                                          loads slightly
                                                          but not much.
                                                          =A0There are no
                                                          other updates
                                                          or reads
                                                          occurring,
                                                          except the
                                                          datastax
                                                          opscenter.<br>
                                                          <br>
                                                          I was
                                                          expecting to
                                                          be able to
                                                          insert at
                                                          least 10k
                                                          rows/second
                                                          with this
                                                          configuration,
                                                          and after a
                                                          lot of reading
                                                          of docs,
                                                          blogs, and
                                                          google, can&#39;t
                                                          really figure
                                                          out what&#39;s
                                                          slowing my
                                                          client down.
                                                          =A0When I
                                                          increase the
                                                          insert speed
                                                          of my client
                                                          beyond
                                                          2000/second,
                                                          the server
                                                          responses are
                                                          just too slow
                                                          and the client
                                                          falls behind.
                                                          =A0I had a
                                                          single-node
                                                          Mysql database
                                                          that can
                                                          handle 10k of
                                                          these data
                                                          rows/second,
                                                          so I really
                                                          feel like I&#39;m
                                                          missing
                                                          something in
                                                          Cassandra.
                                                          =A0Any ideas?<br>
                                                          <br>
                                                          </blockquote>
                                                          </div>
                                                          <br>
                                                          </div>
                                                          </div>
                                                          </blockquote>
                                                          <br>
                                                          </div>
                                                        </div>
                                                      </div>
                                                    </blockquote>
                                                  </div>
                                                  <br>
                                                  <br clear=3D"all">
                                                  <div><br>
                                                  </div>
                                                  -- <br>
                                                  <br>
                                                  - John </div>
                                              </blockquote>
                                              <br>
                                            </div>
                                          </div>
                                        </div>
                                      </blockquote>
                                    </div>
                                    <br>
                                  </div>
                                </blockquote>
                                <br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                      </div>
                      <br>
                    </div>
                  </blockquote>
                  <br>
                </div>
              </div>
            </div>
          </blockquote>
        </div>
        <br>
      </div>
    </blockquote>
    <br>
  </div></div></div>

</blockquote></div><br></div>

--047d7b621ef2dd95b904e47ad8cb--