Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of rsvihla@datastax.com designates
 209.85.213.43 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAORswtxnNJQKeJO_q+nFzC1uGUemMmpKh+VjbPZGzeHUWgA0iQ@mail.gmail.com>
References: 
 <CAEvoPJr_1hKB0M7xzUJFZZvaGr11BWQFkFqva7x+4rCJ4x-HGA@mail.gmail.com>
	<045D8FD556C73347A47F956EE65F8220185546E7@S11MAILD013N2.sh11.lan>
	<CACUnPaAJ9SfSdH1JtjeBToQouKuSM01C2=Jq_hZ9ee_c6qFs=Q@mail.gmail.com>
	<CAORswtxnNJQKeJO_q+nFzC1uGUemMmpKh+VjbPZGzeHUWgA0iQ@mail.gmail.com>
Date: Sat, 13 Dec 2014 08:12:15 -0600
Message-ID: 
 <CAEvoPJoo=Vp9wtbJgaELs0XWjRLFF_XOtuP4AhmCSQaSKhuY8w@mail.gmail.com>
Subject: Re: batch_size_warn_threshold_in_kb
From: Ryan Svihla <rsvihla@datastax.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=089e0160a3e0aedde3050a19988e

--089e0160a3e0aedde3050a19988e
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Are batches to the same partition key (which results in a single mutation,
and obviously eliminates the primary problem)? Is your client network
and/or CPU bound?

Remember, the coordinator node is _just_ doing what your client is doing
with executeAsync, only now it's dealing with the heap pressure of
compaction and flush writers, while youre client is busy writing code.

Not trying to be argumentative, but I talk to the driver writers almost
daily, and I've moved a lot of customers off batches and every single one
of them sped up things substantially, that experience plus the theory leads
me to believe there is a bottleneck on your client. Final point the more
you grow your cluster the more the cost of losing token awareness in all
writes in the batch grows


On Sat, Dec 13, 2014 at 7:32 AM, Eric Stevens <mightye@gmail.com> wrote:
>
> Jon,
>
> > The really important thing to really take away from Ryan's original
> post is that batches are not there for performance.
> > tl;dr: you probably don't want batch, you most likely want many async
> calls
>
> My own rudimentary testing does not bear this out - at least not if you
> mean to say that batches don't offer a performance advantage (vs this jus=
t
> being a happy side effect).  Unlogged batches provide a substantial
> improvement on performance for burst writes in my findings.
>
> My test setup:
>
>    - Amazon i2.8xl instances in 3 AZ's using EC2Snitch
>    - Cluster size of 3, RF=3D3
>    - DataStax Java Driver, with token aware routing, using Prepared
>    Statements, vs Unlogged Batches of Prepared Statements.
>    - Test client on separate machine in same AZ as one of the server node=
s
>    - Data Size: 50,000 records
>    - Test Runs: 25 (unique data generated before each run)
>    - Data written to 5 tables, one table at a time (all 500k records go
>    to each table)
>    - Timing begins when first record is written to a table and ends when
>    the last async call completes for that table.  Timing is measured
>    independently for each strategy, table, and run.
>    - To eliminate bias, order between tables is randomized on each run,
>    and order between single vs batched execution is randomized on each ru=
n.
>    - Asynchronicity is tested using three different typical Scala
>    parallelism strategies.
>       - "traverse" =3D Futures.traverse(statements).map(_.executeAsync())=
 -
>       let the Futures system schedule the parallelism it thinks is approp=
riate
>       - "scatter" =3D Futures.sequence(statements.map(_.executeAsync())) =
-
>       Create as many async calls as possible at a time, then let the Futu=
res
>       system gather together the results
>       - "parallel" =3D statements.par.map(_.execute()) - using a parallel
>       collection to initiate as many blocking calls as possible within th=
e
>       default thread pool.
>    - I kept an eye on compaction throughout, and we never went above 2
>    pending compaction tasks
>
> I know this test is fairly contrived, but it's difficult to dismiss a
> throughput differences of this magnitude over several million data points=
.
> Times are in nanos.
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) as single
> statements using strategy scatter
> Total Run Time
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 51,391,100,107
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 52,206,907,605
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 53,903,886,095
>         test2 ((aid, bckt), end)                             =3D
> 54,613,620,320
>         test5 ((aid, bckt, end))                             =3D
> 55,820,739,557
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) in batches
> of 100 using strategy scatter
> Total Run Time
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 9,199,579,182
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 11,661,638,491
>         test2 ((aid, bckt), end)                             =3D
> 12,059,853,548
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 12,957,113,345
>         test5 ((aid, bckt, end))                             =3D
> 31,166,071,275
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) as single
> statements using strategy traverse
> Total Run Time
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 52,368,815,408
>         test2 ((aid, bckt), end)                             =3D
> 52,676,830,110
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 54,096,838,258
>         test5 ((aid, bckt, end))                             =3D
> 54,657,464,976
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 55,668,202,827
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) in batches
> of 100 using strategy traverse
> Total Run Time
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 9,633,141,094
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 12,519,381,544
>         test2 ((aid, bckt), end)                             =3D
> 12,653,843,637
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 17,644,182,274
>         test5 ((aid, bckt, end))                             =3D
> 27,902,501,534
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) as single
> statements using strategy parallel
> Total Run Time
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 360,523,086,443
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 364,375,212,413
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 370,989,615,452
>         test2 ((aid, bckt), end)                             =3D
> 378,368,728,469
>         test5 ((aid, bckt, end))                             =3D
> 380,737,675,612
>
> =3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D
> 25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) in batches
> of 100 using strategy parallel
> Total Run Time
>         test3 ((aid, bckt), end, proto) reverse order        =3D
> 20,971,045,814
>         test1 ((aid, bckt), proto, end) reverse order        =3D
> 21,379,583,690
>         test4 ((aid, bckt), proto, end) no explicit ordering =3D
> 21,505,965,087
>         test2 ((aid, bckt), end)                             =3D
> 24,433,580,144
>         test5 ((aid, bckt, end))                             =3D
> 37,346,062,553
>
>
>
> On Fri Dec 12 2014 at 11:00:12 AM Jonathan Haddad <jon@jonhaddad.com>
> wrote:
>
>> The really important thing to really take away from Ryan's original post
>> is that batches are not there for performance.  The only case I consider
>> batches to be useful for is when you absolutely need to know that severa=
l
>> tables all get a mutation (via logged batches).  The use case for this i=
s
>> when you've got multiple tables that are serving as different views for
>> data.  It is absolutely not going to help you if you're trying to lump
>> queries together to reduce network & server overhead - in fact it'll do =
the
>> opposite.  If you're trying to do that, instead perform many async
>> queries.  The overhead of batches in cassandra is significant and you're
>> going to hit a lot of problems if you use them excessively (timeouts /
>> failures).
>>
>> tl;dr: you probably don't want batch, you most likely want many async
>> calls
>>
>>
>> On Thu Dec 11 2014 at 11:15:00 PM Mohammed Guller <mohammed@glassbeam.co=
m>
>> wrote:
>>
>>>  Ryan,
>>>
>>> Thanks for the quick response.
>>>
>>>
>>>
>>> I did see that jira before posting my question on this list. However, I
>>> didn=E2=80=99t see any information about why 5kb+ data will cause insta=
bility. 5kb
>>> or even 50kb seems too small. For example, if each mutation is 1000+ by=
tes,
>>> then with just 5 mutations, you will hit that threshold.
>>>
>>>
>>>
>>> In addition, Patrick is saying that he does not recommend more than 100
>>> mutations per batch. So why not warn users just on the # of mutations i=
n a
>>> batch?
>>>
>>>
>>>
>>> Mohammed
>>>
>>>
>>>
>>> *From:* Ryan Svihla [mailto:rsvihla@datastax.com]
>>> *Sent:* Thursday, December 11, 2014 12:56 PM
>>> *To:* user@cassandra.apache.org
>>> *Subject:* Re: batch_size_warn_threshold_in_kb
>>>
>>>
>>>
>>> Nothing magic, just put in there based on experience. You can find the
>>> story behind the original recommendation here
>>>
>>>
>>>
>>> https://issues.apache.org/jira/browse/CASSANDRA-6487
>>>
>>>
>>>
>>> Key reasoning for the desire comes from Patrick McFadden:
>>>
>>>
>>> "Yes that was in bytes. Just in my own experience, I don't recommend
>>> more than ~100 mutations per batch. Doing some quick math I came up wit=
h 5k
>>> as 100 x 50 byte mutations.
>>>
>>> Totally up for debate."
>>>
>>>
>>>
>>> It's totally changeable, however, it's there in no small part because s=
o
>>> many people confuse the BATCH keyword as a performance optimization, th=
is
>>> helps flag those cases of misuse.
>>>
>>>
>>>
>>> On Thu, Dec 11, 2014 at 2:43 PM, Mohammed Guller <mohammed@glassbeam.co=
m>
>>> wrote:
>>>
>>> Hi =E2=80=93
>>>
>>> The cassandra.yaml file has property called *batch_size_warn_threshold_=
in_kb.
>>> *
>>>
>>> The default size is 5kb and according to the comments in the yaml file,
>>> it is used to log WARN on any batch size exceeding this value in kiloby=
tes.
>>> It says caution should be taken on increasing the size of this threshol=
d as
>>> it can lead to node instability.
>>>
>>>
>>>
>>> Does anybody know the significance of this magic number 5kb? Why would =
a
>>> higher number (say 10kb) lead to node instability?
>>>
>>>
>>>
>>> Mohammed
>>>
>>>
>>>
>>>
>>> --
>>>
>>> [image: datastax_logo.png] <http://www.datastax.com/>
>>>
>>> Ryan Svihla
>>>
>>> Solution Architect
>>>
>>>
>>> [image: twitter.png] <https://twitter.com/foundev>[image: linkedin.png]
>>> <http://www.linkedin.com/pub/ryan-svihla/12/621/727/>
>>>
>>>
>>>
>>> DataStax is the fastest, most scalable distributed database technology,
>>> delivering Apache Cassandra to the world=E2=80=99s most innovative ente=
rprises.
>>> Datastax is built to be agile, always-on, and predictably scalable to a=
ny
>>> size. With more than 500 customers in 45 countries, DataStax is the
>>> database technology and transactional backbone of choice for the worlds
>>> most innovative companies such as Netflix, Adobe, Intuit, and eBay.
>>>
>>>
>>>
>>

--=20

[image: datastax_logo.png] <http://www.datastax.com/>

Ryan Svihla

Solution Architect

[image: twitter.png] <https://twitter.com/foundev> [image: linkedin.png]
<http://www.linkedin.com/pub/ryan-svihla/12/621/727/>

DataStax is the fastest, most scalable distributed database technology,
delivering Apache Cassandra to the world=E2=80=99s most innovative enterpri=
ses.
Datastax is built to be agile, always-on, and predictably scalable to any
size. With more than 500 customers in 45 countries, DataStax is the
database technology and transactional backbone of choice for the worlds
most innovative companies such as Netflix, Adobe, Intuit, and eBay.

--089e0160a3e0aedde3050a19988e
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Are batches to the same partition key (which results =
in a single mutation, and obviously eliminates the primary problem)? Is you=
r client network and/or CPU bound?<br><br>Remember, the coordinator node is=
 _just_ doing what your client is doing with executeAsync, only now it&#39;=
s dealing with the heap pressure of compaction and flush writers, while you=
re client is busy writing code. <br><br> Not trying to be argumentative, bu=
t I talk to the driver writers almost daily, and I&#39;ve moved a lot of cu=
stomers off=20
batches and every single one of them sped up things substantially, that exp=
erience plus the theory leads me to believe there is a bottleneck on your c=
lient. Final point the more you grow your cluster the more the cost of losi=
ng token awareness in all writes in the batch grows<br><br></div></div><div=
 class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Sat, Dec 13, 2014 =
at 7:32 AM, Eric Stevens <span dir=3D"ltr">&lt;<a href=3D"mailto:mightye@gm=
ail.com" target=3D"_blank">mightye@gmail.com</a>&gt;</span> wrote:<blockquo=
te class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc so=
lid;padding-left:1ex"><div dir=3D"ltr">Jon,<span class=3D""><div><br></div>=
<div>&gt;<span style=3D"font-size:13px">=C2=A0The really important thing to=
 really take away from Ryan&#39;s original post is that batches are not the=
re for performance.=C2=A0</span><span style=3D"font-size:13px">=C2=A0</span=
></div></span><span class=3D""><div>&gt; tl;dr: you probably don&#39;t want=
 batch, you most likely want many async calls</div><div><br></div></span><d=
iv>My own rudimentary testing does not bear this out - at least not if you =
mean to say that batches don&#39;t offer a performance advantage (vs this j=
ust being a happy side effect).=C2=A0 Unlogged batches provide a substantia=
l improvement on performance for burst writes in my findings.</div><div><br=
></div><div>My test setup:</div><div><ul><li>Amazon i2.8xl instances in 3 A=
Z&#39;s using EC2Snitch</li><li>Cluster size of 3, RF=3D3</li><li>DataStax =
Java Driver, with token aware routing, using Prepared Statements, vs Unlogg=
ed Batches of Prepared Statements.</li><li>Test client on separate machine =
in same AZ as one of the server nodes</li><li>Data Size: 50,000 records</li=
><li>Test Runs: 25 (unique data generated before each run)</li><li>Data wri=
tten to 5 tables, one table at a time (all 500k records go to each table)</=
li><li>Timing begins when first record is written to a table and ends when =
the last async call completes for that table.=C2=A0 Timing is measured inde=
pendently for each strategy, table, and run.</li><li>To eliminate bias, ord=
er between tables is randomized on each run, and order between single vs ba=
tched execution is randomized on each run.</li><li>Asynchronicity is tested=
 using three different typical Scala parallelism strategies. =C2=A0</li><ul=
><li>&quot;traverse&quot; =3D Futures.traverse(statements).map(_.executeAsy=
nc()) - let the Futures system schedule the parallelism it thinks is approp=
riate</li><li>&quot;scatter&quot; =3D Futures.sequence(statements.map(_.exe=
cuteAsync())) - Create as many async calls as possible at a time, then let =
the Futures system gather together the results</li><li>&quot;parallel&quot;=
 =3D statements.par.map(_.execute()) - using a parallel collection to initi=
ate as many blocking calls as possible within the default thread pool.</li>=
</ul><li>I kept an eye on compaction throughout, and we never went above 2 =
pending compaction tasks</li></ul><div>I know this test is fairly contrived=
, but it&#39;s difficult to dismiss a throughput differences of this magnit=
ude over several million data points.=C2=A0 Times are in nanos.</div></div>=
<div><font face=3D"monospace"><br><div>=3D=3D=3D=3D Execution Results for 2=
5 runs of 50000 records =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</div><div>2=
5 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) as single sta=
tements using strategy scatter</div><div>Total Run Time</div><div>=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 test3 ((aid, bckt), end, proto) reverse order =C2=A0 =
=C2=A0 =C2=A0 =C2=A0=3D 51,391,100,107</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 test1 ((aid, bckt), proto, end) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=
=A0=3D 52,206,907,605</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test4 ((aid, bc=
kt), proto, end) no explicit ordering =3D 53,903,886,095</div><div>=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 test2 ((aid, bckt), end) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 5=
4,613,620,320</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test5 ((aid, bckt, end)=
) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =3D 55,820,739,557</div><div><br></div><div>=3D=3D=
=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D</div><div>25 runs of 50,000 records (3 protos, 5 agents,=
 ~15 per bucket) in batches of 100 using strategy scatter</div><div>Total R=
un Time</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test3 ((aid, bckt), end, prot=
o) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=3D 9,199,579,182</div><div>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 test4 ((aid, bckt), proto, end) no explicit orderi=
ng =3D 11,661,638,491</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test2 ((aid, bc=
kt), end) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 12,059,853,548</div><div>=C2=A0 =C2=
=A0 =C2=A0 =C2=A0 test1 ((aid, bckt), proto, end) reverse order =C2=A0 =C2=
=A0 =C2=A0 =C2=A0=3D 12,957,113,345</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 t=
est5 ((aid, bckt, end)) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 31,166,071,275</div><d=
iv><br></div><div>=3D=3D=3D=3D Execution Results for 25 runs of 50000 recor=
ds =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</div><div>25 runs of 50,000 reco=
rds (3 protos, 5 agents, ~15 per bucket) as single statements using strateg=
y traverse</div><div>Total Run Time</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 t=
est1 ((aid, bckt), proto, end) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=3D=
 52,368,815,408</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test2 ((aid, bckt), e=
nd) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 52,676,830,110</div><div>=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 test4 ((aid, bckt), proto, end) no explicit ordering =3D 54,096,=
838,258</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test5 ((aid, bckt, end)) =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =3D 54,657,464,976</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 test3 ((aid, bckt), end, proto) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=
=A0=3D 55,668,202,827</div><div><br></div><div>=3D=3D=3D=3D Execution Resul=
ts for 25 runs of 50000 records =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</di=
v><div>25 runs of 50,000 records (3 protos, 5 agents, ~15 per bucket) in ba=
tches of 100 using strategy traverse</div><div>Total Run Time</div><div>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 test3 ((aid, bckt), end, proto) reverse order =C2=
=A0 =C2=A0 =C2=A0 =C2=A0=3D 9,633,141,094</div><div>=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 test4 ((aid, bckt), proto, end) no explicit ordering =3D 12,519,381,=
544</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test2 ((aid, bckt), end) =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =3D 12,653,843,637</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =
test1 ((aid, bckt), proto, end) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=
=3D 17,644,182,274</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test5 ((aid, bckt,=
 end)) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 27,902,501,534</div><div><br></div><div=
>=3D=3D=3D=3D Execution Results for 25 runs of 50000 records =3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D</div><div>25 runs of 50,000 records (3 protos, =
5 agents, ~15 per bucket) as single statements using strategy parallel</div=
><div>Total Run Time</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test1 ((aid, bck=
t), proto, end) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=3D 360,523,086,44=
3</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test3 ((aid, bckt), end, proto) rev=
erse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=3D 364,375,212,413</div><div>=C2=A0 =
=C2=A0 =C2=A0 =C2=A0 test4 ((aid, bckt), proto, end) no explicit ordering =
=3D 370,989,615,452</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test2 ((aid, bckt=
), end) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 378,368,728,469</div><div>=C2=A0 =C2=A0=
 =C2=A0 =C2=A0 test5 ((aid, bckt, end)) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 380,737,=
675,612</div><div><br></div><div>=3D=3D=3D=3D Execution Results for 25 runs=
 of 50000 records =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D</div><div>25 runs=
 of 50,000 records (3 protos, 5 agents, ~15 per bucket) in batches of 100 u=
sing strategy parallel</div><div>Total Run Time</div><div>=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 test3 ((aid, bckt), end, proto) reverse order =C2=A0 =C2=A0 =C2=
=A0 =C2=A0=3D 20,971,045,814</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test1 ((=
aid, bckt), proto, end) reverse order =C2=A0 =C2=A0 =C2=A0 =C2=A0=3D 21,379=
,583,690</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test4 ((aid, bckt), proto, e=
nd) no explicit ordering =3D 21,505,965,087</div><div>=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 test2 ((aid, bckt), end) =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =3D 24,433,580,144<=
/div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 test5 ((aid, bckt, end)) =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =3D 37,346,062,553</div></font></div><div><div class=3D"h5"><=
div><br></div><div><br></div><div><br></div><div><div class=3D"gmail_quote"=
>On Fri Dec 12 2014 at 11:00:12 AM Jonathan Haddad &lt;<a href=3D"mailto:jo=
n@jonhaddad.com" target=3D"_blank">jon@jonhaddad.com</a>&gt; wrote:<br><blo=
ckquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left=
-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;paddi=
ng-left:1ex">The really important thing to really take away from Ryan&#39;s=
 original post is that batches are not there for performance.=C2=A0 The onl=
y case I consider batches to be useful for is when you absolutely need to k=
now that several tables all get a mutation (via logged batches).=C2=A0 The =
use case for this is when you&#39;ve got multiple tables that are serving a=
s different views for data.=C2=A0 It is absolutely not going to help you if=
 you&#39;re trying to lump queries together to reduce network &amp; server =
overhead - in fact it&#39;ll do the opposite.=C2=A0 If you&#39;re trying to=
 do that, instead perform many async queries.=C2=A0 The overhead of batches=
 in cassandra is significant and you&#39;re going to hit a lot of problems =
if you use them excessively (timeouts / failures).<div><br></div><div>tl;dr=
: you probably don&#39;t want batch, you most likely want many async calls<=
/div><div><br><br><div class=3D"gmail_quote">On Thu Dec 11 2014 at 11:15:00=
 PM Mohammed Guller &lt;<a href=3D"mailto:mohammed@glassbeam.com" target=3D=
"_blank">mohammed@glassbeam.com</a>&gt; wrote:<br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-lef=
t-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">


<div link=3D"blue" vlink=3D"purple" lang=3D"EN-US">
<div>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)">Ryan,<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)">Thanks for the quick response.<u></u><u></u>=
</span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)">I did see that jira before posting my questi=
on on this list. However, I didn=E2=80=99t see any information about why 5k=
b+ data will cause instability. 5kb or even 50kb
 seems too small. For example, if each mutation is 1000+ bytes, then with j=
ust 5 mutations, you will hit that threshold.
<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)">In addition, Patrick is saying that he does =
not recommend more than 100 mutations per batch. So why not warn users just=
 on the # of mutations in a batch?<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)">Mohammed<u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11pt;font-family:Calibri,sa=
ns-serif;color:rgb(31,73,125)"><u></u>=C2=A0<u></u></span></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:10pt;font-family:Tahoma,=
sans-serif">From:</span></b><span style=3D"font-size:10pt;font-family:Tahom=
a,sans-serif"> Ryan Svihla [mailto:<a href=3D"mailto:rsvihla@datastax.com" =
target=3D"_blank">rsvihla@datastax.com</a>]
<br>
<b>Sent:</b> Thursday, December 11, 2014 12:56 PM<br>
<b>To:</b> <a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">u=
ser@cassandra.apache.org</a><br>
<b>Subject:</b> Re: batch_size_warn_threshold_in_<u></u>k<u></u>b<u></u><u>=
</u></span></p></div></div><div link=3D"blue" vlink=3D"purple" lang=3D"EN-U=
S"><div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">Nothing magic, just put in there based on experience=
. You can find the story behind the original recommendation here<u></u><u><=
/u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><a href=3D"https://issues.apache.org/jira/browse/CAS=
SANDRA-6487" target=3D"_blank">https://issues.apache.org/<u></u>jira<u></u>=
/browse/CASSANDRA-6487</a><u></u><u></u></p>
</div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">Key reasoning for the desire comes from Patrick McFa=
dden:<u></u><u></u></p>
</div>
<p class=3D"MsoNormal"><br>
&quot;Yes that was in bytes. Just in my own experience, I don&#39;t recomme=
nd more than ~100 mutations per batch. Doing some quick math I came up with=
 5k as 100 x 50 byte mutations.<br>
<br>
Totally up for debate.&quot;<u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<div>
<p class=3D"MsoNormal">It&#39;s totally changeable, however, it&#39;s there=
 in no small part because so many people confuse the BATCH keyword as a per=
formance optimization, this helps flag those cases of misuse.<u></u><u></u>=
</p>
<div>
<div>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
<div>
<p class=3D"MsoNormal">On Thu, Dec 11, 2014 at 2:43 PM, Mohammed Guller &lt=
;<a href=3D"mailto:mohammed@glassbeam.com" target=3D"_blank">mohammed@glass=
beam.com</a>&gt; wrote:<u></u><u></u></p>
<div>
<div>
<p class=3D"MsoNormal" style=3D"margin-bottom:2.25pt"><span style=3D"color:=
rgb(99,100,102)">Hi =E2=80=93
</span><u></u><u></u></p>
<p class=3D"MsoNormal" style=3D"margin-bottom:2.25pt"><span style=3D"color:=
rgb(99,100,102)">The cassandra.yaml file has property called
<b>batch_size_warn_threshold_in_<u></u>k<u></u>b. </b></span><u></u><u></u>=
</p>
<p class=3D"MsoNormal" style=3D"margin-bottom:3.75pt"><span style=3D"color:=
rgb(99,100,102)">The default size is 5kb and according to the comments in t=
he yaml file, it is used to log WARN on any batch size exceeding this value=
 in kilobytes. It says
 caution should be taken on increasing the size of this threshold as it can=
 lead to node instability.</span><u></u><u></u></p>
<p class=3D"MsoNormal">=C2=A0<u></u><u></u></p>
<p class=3D"MsoNormal">Does anybody know the significance of this magic num=
ber 5kb? Why would a higher number (say 10kb) lead to node instability?<u><=
/u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"color:rgb(136,136,136)">=C2=A0<u></u>=
<u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"color:rgb(136,136,136)">Mohammed
<u></u><u></u></span></p>
</div>
</div>
</div>
<p class=3D"MsoNormal"><br clear=3D"all">
<u></u><u></u></p>
<div>
<p class=3D"MsoNormal"><u></u>=C2=A0<u></u></p>
</div>
<p class=3D"MsoNormal">-- <u></u><u></u></p>
<div>
<div>
<div>
<p style=3D"margin:0in 0in 0.0001pt"><span style=3D"color:rgb(136,136,136)"=
><a href=3D"http://www.datastax.com/" target=3D"_blank"><span style=3D"font=
-size:9pt;font-family:Arial,sans-serif;color:rgb(17,85,204);text-decoration=
:none"><img src=3D"https://lh5.googleusercontent.com/jJ2Psn7W3nI4grN78l9rhg=
qSAHPvOGKtkSncVviaaU6wqCyX_O7V343cIveDHLvo-m_bCXivhR1X9xVMjFaLodlh6gB-xgiCb=
CnhfHQqL0KV3CjxqaxSOVT-6SpSEEIOdA" alt=3D"datastax_logo.png" border=3D"0"><=
/span></a><u></u><u></u></span></p>
<p style=3D"margin:0in 0in 0.0001pt"><span style=3D"font-size:11.5pt;font-f=
amily:Calibri,sans-serif;color:black">Ryan Svihla</span><span style=3D"colo=
r:rgb(136,136,136)"><u></u><u></u></span></p>
<p style=3D"margin:0in 0in 0.0001pt"><span style=3D"font-size:11.5pt;font-f=
amily:Calibri,sans-serif;color:black">Solution Architect
</span><span style=3D"color:rgb(136,136,136)"><u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"color:rgb(136,136,136)"><br>
</span><span style=3D"font-size:11.5pt;font-family:Calibri,sans-serif;color=
:rgb(102,102,102)"><a href=3D"https://twitter.com/foundev" target=3D"_blank=
"><span style=3D"text-decoration:none"><img src=3D"https://lh5.googleuserco=
ntent.com/XHgtWuIKXKV0H6h45Tar8ApuFumdKYalFv9vZQZzLSW8K1B4X_v59KcuWY1zmrXw2=
J1mlP20YfcnYr5uTjXnV36N8GgQOqe3St2mRUcKjJoRs2qD96b6fOWjQyvuUQxo3Q" alt=3D"t=
witter.png" border=3D"0"></span></a><a href=3D"http://www.linkedin.com/pub/=
ryan-svihla/12/621/727/" target=3D"_blank"><span style=3D"text-decoration:n=
one"><img src=3D"https://lh5.googleusercontent.com/R5pNJwLLMRVugpyV0Zd07jdO=
mRbGnfEJPXMWmnYft5eUUbwz7quM4aM85wAyWIEPnSpCIIOSqvji3nSiiK6fwPUjT6aUdjhTli0=
bTWIpfk1e5fLuRp0Yl-17dIuZQ6NV3A" alt=3D"linkedin.png" border=3D"0"></span><=
/a></span><span style=3D"color:rgb(136,136,136)"><u></u><u></u></span></p>
</div>
<div>
<p class=3D"MsoNormal"><span style=3D"color:rgb(136,136,136)"><u></u>=C2=A0=
<u></u></span></p>
</div>
<div>
<p style=3D"margin:0in 0in 0.0001pt"><span style=3D"font-size:9pt;font-fami=
ly:Arial,sans-serif;color:black">DataStax is the fastest, most scalable dis=
tributed database technology, delivering Apache Cassandra to the world=E2=
=80=99s most innovative enterprises.
 Datastax is built to be agile, always-on, and predictably scalable to any =
size. With more than 500 customers in 45 countries,
</span><span style=3D"font-size:9pt;font-family:Arial,sans-serif;color:rgb(=
34,34,34)">DataStax is the database technology and transactional backbone o=
f choice for the worlds most innovative companies such as Netflix, Adobe, I=
ntuit, and eBay.</span><span style=3D"font-size:9pt;font-family:Arial,sans-=
serif;color:black">
</span><span style=3D"color:rgb(136,136,136)"><u></u><u></u></span></p>
<div>
<p class=3D"MsoNormal"><span style=3D"color:rgb(136,136,136)"><u></u>=C2=A0=
<u></u></span></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div></div></blockquote></div></div></blockquote></div></div></div></div><=
/div>
</blockquote></div><br clear=3D"all"><br>-- <br><div class=3D"gmail_signatu=
re"><div dir=3D"ltr"><div><div dir=3D"ltr"><span><font color=3D"#888888"><d=
iv dir=3D"ltr"><span><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0p=
t;margin-bottom:0pt"><a href=3D"http://www.datastax.com/" style=3D"text-dec=
oration:none" target=3D"_blank"><span style=3D"font-size:12px;font-family:A=
rial;color:rgb(17,85,204);text-decoration:underline;vertical-align:baseline=
;white-space:pre-wrap"><img src=3D"https://lh5.googleusercontent.com/jJ2Psn=
7W3nI4grN78l9rhgqSAHPvOGKtkSncVviaaU6wqCyX_O7V343cIveDHLvo-m_bCXivhR1X9xVMj=
FaLodlh6gB-xgiCbCnhfHQqL0KV3CjxqaxSOVT-6SpSEEIOdA" style=3D"border:none" al=
t=3D"datastax_logo.png" height=3D"39px;" width=3D"187px;"></span></a></p><p=
 dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-bottom:0pt"><s=
pan style=3D"font-size:15px;font-family:Calibri;color:rgb(0,0,0);background=
-color:transparent;vertical-align:baseline;white-space:pre-wrap">Ryan Svihl=
a</span></p><p dir=3D"ltr" style=3D"line-height:1.15;margin-top:0pt;margin-=
bottom:0pt"><span style=3D"font-size:15px;font-family:Calibri;color:rgb(0,0=
,0);background-color:transparent;vertical-align:baseline;white-space:pre-wr=
ap">Solution Architect</span><span style=3D"font-size:15px;font-family:Cali=
bri;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;w=
hite-space:pre-wrap"></span></p><br></span><span><span style=3D"font-size:1=
5px;font-family:Calibri;color:rgb(102,102,102);vertical-align:baseline;whit=
e-space:pre-wrap;background-color:transparent"><a href=3D"https://twitter.c=
om/foundev" target=3D"_blank"><img src=3D"https://lh5.googleusercontent.com=
/XHgtWuIKXKV0H6h45Tar8ApuFumdKYalFv9vZQZzLSW8K1B4X_v59KcuWY1zmrXw2J1mlP20Yf=
cnYr5uTjXnV36N8GgQOqe3St2mRUcKjJoRs2qD96b6fOWjQyvuUQxo3Q" style=3D"border:n=
one" alt=3D"twitter.png" height=3D"27px;" width=3D"27px;"></a></span><span =
style=3D"font-size:15px;font-family:Calibri;color:rgb(102,102,102);vertical=
-align:baseline;white-space:pre-wrap;background-color:transparent"> </span>=
<span style=3D"font-size:15px;font-family:Calibri;color:rgb(102,102,102);ve=
rtical-align:baseline;white-space:pre-wrap;background-color:transparent"><a=
 href=3D"http://www.linkedin.com/pub/ryan-svihla/12/621/727/" target=3D"_bl=
ank"><img src=3D"https://lh5.googleusercontent.com/R5pNJwLLMRVugpyV0Zd07jdO=
mRbGnfEJPXMWmnYft5eUUbwz7quM4aM85wAyWIEPnSpCIIOSqvji3nSiiK6fwPUjT6aUdjhTli0=
bTWIpfk1e5fLuRp0Yl-17dIuZQ6NV3A" style=3D"border:none" alt=3D"linkedin.png"=
 height=3D"28px;" width=3D"28px;"></a></span></span><br></div><div dir=3D"l=
tr"><span><br></span></div><div dir=3D"ltr"><span><p dir=3D"ltr" style=3D"l=
ine-height:1;margin-top:0pt;margin-bottom:0pt"><span style=3D"font-size:12p=
x;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pr=
e-wrap">DataStax is the fastest, most scalable distributed database technol=
ogy, delivering Apache Cassandra to the world=E2=80=99s most innovative ent=
erprises. Datastax is built to be agile, always-on, and predictably scalabl=
e to any size. With more than 500 customers in 45 countries, </span><span s=
tyle=3D"font-size:12px;font-family:Arial;color:rgb(34,34,34);vertical-align=
:baseline;white-space:pre-wrap">DataStax is the database technology and tra=
nsactional backbone of choice for the worlds most innovative companies such=
 as Netflix, Adobe, Intuit, and eBay.</span><span style=3D"font-size:12px;f=
ont-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-w=
rap"> </span></p><div><span style=3D"font-size:12px;font-family:Arial;color=
:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap"><br></span></div>=
</span></div></font></span></div></div></div></div>
</div>

--089e0160a3e0aedde3050a19988e--