Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Message-ID: 
 <1449594547.24150.YahooMailAndroidMobile@web192903.mail.sg3.yahoo.com>
Date: Wed, 9 Dec 2015 01:09:07 +0800
From: Anuj Wadehra <anujw_2003@yahoo.co.in>
Subject: Re: Re: Re: Cassandra Tuning Issue
To: "user@cassandra.apache.org" <user@cassandra.apache.org>,
  "user@cassandra.apache.org" <user@cassandra.apache.org>
In-Reply-To: 
 <CAOxAL61nddkpZ9JrZ18PukaoeQkcEyXn=UwK3DwHRc3UpL3iZA@mail.gmail.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
 boundary="-1985756734-1664357670-1449594547=:24150"

---1985756734-1664357670-1449594547=:24150
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable

Hi Jerry,=0A=0A=0AIts great that you got performance improvement. Moreover,=
 I agree with what Graham said. I think that you are using extremely large =
Heaps with CMS and that too in very odd ratio..Having 40G for new gen and l=
eaving only 20G old gen seems unreasonable..Its hard to believe that you ar=
e having reasonable Gc pauses..Please recheck..I would suggest you to test =
your performance with much smaller heap..may be 16G max heap n 4G new gen..=
moreover make sure that you apply all the recommended Production settings s=
uggested by DataStax at http://docs.datastax.com/en/cassandra/2.1/cassandra=
/install/installRecommendSettings.html=0A=0A=0ADont worry about wasting you=
r memory, it will be used for OS caching and you can get even better perfor=
mance..=0A=0A=0AThanks=0A=0AAnuj=0A=0ASent from Yahoo Mail on Android=0A=0A=
From:"Jack Krupansky" <jack.krupansky@gmail.com>=0ADate:Tue, 8 Dec, 2015 at=
 8:07 pm=0ASubject:Re: Re: Re: Cassandra Tuning Issue=0A=0AGreat! Make sure=
 to inform the C* email list as well so that others know.=0A=0A=0A-- Jack K=
rupansky=0A=0A=0AOn Tue, Dec 8, 2015 at 7:44 AM, xutom <xutom2006@126.com> =
wrote:=0A=0A=0A=0ADear Jack,=0A=C2=A0=C2=A0=C2=A0 Thank you very much! Now =
we have much better performance when we insert the same partition keys in t=
he same batch.=0A=0Ajerry=0A=0A=0AAt 2015-12-07 13:08:31, "Jack Krupansky" =
<jack.krupansky@gmail.com> wrote:=0A=0AIf you combine inserts for multiple =
partition keys in the same batch you negate most of the effect of token-awa=
re routing. It's best to insert only rows with the same partition key in a =
single batch. You also need to set the partition key for routing for the ba=
tch.=0A=0A=0AAlso, RF=3D2 is not recommended since it does not permit quoru=
m operations if a replica node is down. RF=3D3 is generally more appropriat=
e.=0A=0A=0A-- Jack Krupansky=0A=0A=0AOn Sun, Dec 6, 2015 at 10:27 PM, xutom=
 <xutom2006@126.com> wrote:=0A=0ADear all,=0A=C2=A0=C2=A0=C2=A0 Thanks for =
ur reply!=0A=C2=A0=C2=A0=C2=A0 Now I`m using Apache Cassandra 2.1.1 and my =
JDK is 1.7.0_79,=C2=A0 my keyspace replication factor is 2=EF=BC=8Cand I do=
 enable the "token aware". The GC configuration is default for such as:=0A#=
 GC tuning options=0AJVM_OPTS=3D"$JVM_OPTS -XX:+UseParNewGC"=0AJVM_OPTS=3D"=
$JVM_OPTS -XX:+UseConcMarkSweepGC"=0AJVM_OPTS=3D"$JVM_OPTS -XX:+CMSParallel=
RemarkEnabled"=0A=C2=A0=C2=A0=C2=A0 And I check the gc log: gc.log.0.curren=
t, I found there is only one Full GC. The stop-the-world times is low.=0ACM=
S-initial-mark: 0.2747280 secs=0ACMS-remark: 0.3623090 secs=0A=0A=C2=A0=C2=
=A0=C2=A0 The insert codes in my test client are following:=0A=C2=A0=C2=A0=
=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 String content =3D RandomStrin=
gUtils.randomAlphabetic(120);=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=
=A0=C2=A0=C2=A0 cluster =3D Cluster=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0=
 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 .builder()=0A=C2=
=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0 .addContactPoint(this.seedIP)=0A=C2=A0=C2=A0=C2=A0 =C2=
=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 .w=
ithCredentials("test", "test")=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=
=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 .withRetryPolicy(Defa=
ultRetryPolicy.INSTANCE)=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=
=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 .withLoadBalancingPolicy(ne=
w TokenAwarePolicy(new DCAwareRoundRobinPolicy())) =0A=C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0=
 .build();=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 sessi=
on =3D cluster.connect("demo");=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=
=A0=C2=A0=C2=A0 ......=0A=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=
=A0=C2=A0 PreparedStatement insertPreparedStatement =3D session.prepare(=0A=
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0=
 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 "=C2=A0=C2=A0 INSERT INTO teacher (i=
d, lastname, firstname, city) " +=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=
=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 "VALUES (?,=
 ?, ?, ?); ");=0A=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=
=C2=A0 BatchStatement batch =3D new BatchStatement();=0A=C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 for (; i < max; i+=3D5) {=0A=C2=A0=C2=
=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 try {=0A=
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0=
 =C2=A0=C2=A0=C2=A0 batch.add(insertPreparedStatement.bind(i, "Entre Nous",=
 "adsfasdfa1", content));=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=
=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 batch.add(insertPreparedSta=
tement.bind(i+1, "Entre Nous", "adsfasdfa2", content));=0A=C2=A0=C2=A0=C2=
=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=
=C2=A0 batch.add(insertPreparedStatement.bind(i+2, "Entre Nous", "adsfasdfa=
3", content));=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 batch.add(insertPreparedStatement.bin=
d(i+3, "Entre Nous", "adsfasdfa4", content));=0A=C2=A0=C2=A0=C2=A0 =C2=A0=
=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 batch=
.add(insertPreparedStatement.bind(i+4, "Entre Nous", "adsfasdfa5", content)=
);=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=
=C2=A0 =C2=A0=C2=A0=C2=A0 =0A//=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=
=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 System.out.println("the =
is is " + i);=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =
=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 session.execute(batch);=0A=0A=C2=A0=
=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 =C2=
=A0=C2=A0=C2=A0 thisTimeCount +=3D 5;=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=
=A0 =C2=A0=C2=A0=C2=A0 =C2=A0=C2=A0=C2=A0 }=0A=C2=A0=C2=A0=C2=A0 =C2=A0=C2=
=A0=C2=A0 =C2=A0=C2=A0=C2=A0 }=0A=0A=0A=0A=0AAt 2015-12-07 00:40:06, "Graha=
m Sanderson" <graham@vast.com> wrote:=0A=0AWhat version of C* are you using=
; what JVM version - you showed a partial GC config but if that is still CM=
S (not G1) then you are going to have insane GC pauses...=C2=A0=0A=0A=0ADep=
ending on C* versions are you using on/off heap memtables and what type=0A=
=0A=0AThose are the sorts of issues related to fat nodes; I'd be worried ab=
out - we run very nicely at 20G total heap and 8G new - the rest of our 128=
G memory is disk cache/mmap and all of the off heap stuff so it doesn't go =
to waste=0A=0A=0AThat said I think Jack is probably on the right path with =
overloaded coordinators- though you'd still expect to see CPU usage unless =
your timeouts are too low for the load, In which case the coordinator would=
 be getting no responses in time and quite possibly the other nodes are jus=
t dropping the mutations (since they don't get to them before they know the=
 coordinator would have timed out) - I forget the command to check dropped =
mutations off the top of my head but you can see it in opcenter=0A=0A=0AIf =
you have GC problems you certainly=0A=0AExpect to see GC cpu usage but depe=
nding on how long you run your tests it might take you a little while to ru=
n thru 40G=0A=0A=0AI'm personally not a fan off >32G (ish) heaps as you can=
't do compressed oops and also it is unrealistic for CMS ... The word is th=
at G1 is now working ok with C* especially on newer C* and JDK versions, bu=
t that said it takes quite a lot of thru-put to require insane quantities o=
f young gen... We are guessing that when we remove all our legacy thrift ba=
tch inserts we will need less - and as for 20G total we actually don't need=
 that much (we dropped from 24 when we moved memtables off heap, and believ=
e we can drop further)=0A=0A=0ASent from my iPhone=0A=0A=0AOn Dec 6, 2015, =
at 9:07 AM, Jack Krupansky <jack.krupansky@gmail.com> wrote:=0A=0AWhat repl=
ication factor are you using? Even if your writes use CL.ONE, Cassandra wil=
l be attempting writes to the replica nodes in the background.=0A=0A=0AAre =
your writes "token aware"? If not, the receiving node has the overhead of f=
orwarding the request to the node that owns the token for the primary key.=
=0A=0A=0AFor the record, Cassandra is not designed and optimized for so-cal=
led "fat nodes". The design focus is "commodity hardware" and "distributed =
cluster" (typically a dozen or more nodes.)=0A=0A=0AThat said, it would be =
good if we had a rule of thumb for how many simultaneous requests a node ca=
n handle, both external requests and inter-node traffic. I think there is a=
n open Jira to enforce a limit on inflight requests so that nodes don't ove=
rloaded and start failing in the middle of writes as you seem to be seeing.=
=0A=0A=0A-- Jack Krupansky=0A=0A=0AOn Sun, Dec 6, 2015 at 9:29 AM, jerry <x=
utom2006@126.com> wrote:=0A=0ADear All,=0A=0A=C2=A0 =C2=A0 Now I have a 4 n=
odes Cassandra cluster, and I want to know the highest performance of my Ca=
ssandra cluster. I write a JAVA client to batch insert datas into ALL 4 nod=
es Cassandra, when I start less than 30 subthreads in my client application=
s to insert datas into cassandra, it will be ok for everything, but when I =
start more than 80 or 100 subthreads in my client applications, there will =
be too much timeout Exceptions (Such as: Cassandra timeout during write que=
ry at consistency ONE (1 replica were required but only 0 acknowledged the =
write)). And no matter how many subthreads or even I start multiple clients=
 with multiple subthreads on different computers, I can get the highest per=
formance for about 60000 - 80000 TPS. By the way, each row I insert into ca=
ssandra is about 130 Bytes.=0A=C2=A0 =C2=A0 My 4 nodes of Cassandra is :=0A=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 CPU: 4*15=0A=C2=A0 =C2=A0 =C2=A0 =C2=A0 Memory:=
 512G=0A=C2=A0 =C2=A0 =C2=A0 =C2=A0 Disk: flash card (only one disk but bet=
ter than SSD)=0A=C2=A0 =C2=A0 My cassandra configurations are:=0A=C2=A0 =C2=
=A0 =C2=A0 =C2=A0 MAX_HEAP_SIZE: 60G=0A=C2=A0 =C2=A0 =C2=A0 =C2=A0 NEW_HEAP=
_SIZE: 40G=0A=0A=C2=A0 =C2=A0 When I insert datas into my cassandra cluster=
, each nodes has NOT reached bottleneck such as CPU or Memory or Disk. Each=
 of the three main hardwares is idle=E3=80=82So I think maybe there is some=
thing wrong about my configuration of cassandra cluster. Can somebody pleas=
e help me to My Cassandra Tuning? Thanks in advances!=0A=0A=0A=0A=0A=C2=A0=
=0A=0A=0A=0A=0A=C2=A0=0A=0A=0A
---1985756734-1664357670-1449594547=:24150
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: quoted-printable

<table cellspacing=3D"0" cellpadding=3D"0" border=3D"0"><tr><td valign=3D"t=
op">Hi Jerry,<div id=3D"yMail_cursorElementTracker_0.19507145741954446"><br=
></div><div id=3D"yMail_cursorElementTracker_0.19507145741954446">Its great=
 that you got performance improvement. Moreover, I agree with what Graham s=
aid. I think that you are using extremely large Heaps with CMS and that too=
 in very odd ratio..Having 40G for new gen and leaving only 20G old gen see=
ms unreasonable..Its hard to believe that you are having reasonable Gc paus=
es..Please recheck..I would suggest you to test your performance with much =
smaller heap..may be 16G max heap n 4G new gen..moreover make sure that you=
 apply all the recommended Production settings suggested by DataStax at htt=
p://docs.datastax.com/en/cassandra/2.1/cassandra/install/installRecommendSe=
ttings.html</div><div id=3D"yMail_cursorElementTracker_0.19507145741954446"=
><br></div><div id=3D"yMail_cursorElementTracker_0.19507145741954446">Dont =
worry
 about wasting your memory, it will be used for OS caching and you can get =
even better performance..</div><div id=3D"yMail_cursorElementTracker_0.1950=
7145741954446"><br></div><div id=3D"yMail_cursorElementTracker_0.1950714574=
1954446">Thanks</div><div id=3D"yMail_cursorElementTracker_0.19507145741954=
446">Anuj</div><div id=3D"yMail_cursorElementTracker_0.19507145741954446"><=
p><a href=3D"https://overview.mail.yahoo.com/mobile/?.src=3DAndroid">Sent f=
rom Yahoo Mail on Android</a></p> <hr><table cellspacing=3D"0" cellpadding=
=3D"0" border=3D"0"> <tbody> <tr> <td valign=3D"top"> <div style=3D"font-fa=
mily:Roboto, sans-serif;color:#7e7d80;"><b>From</b>:"Jack Krupansky" &lt;ja=
ck.krupansky@gmail.com&gt;<br><b>Date</b>:Tue, 8 Dec, 2015 at 8:07 pm<br><b=
>Subject</b>:Re: Re: Re: Cassandra Tuning Issue<br><br></div> <div dir=3D"l=
tr">Great! Make sure to inform the C* email list as well so that others kno=
w.</div><div class=3D"gmail_extra"><br clear=3D"all"><div><div class=3D"gma=
il_signature"><div
 dir=3D"ltr">-- Jack Krupansky</div></div></div>=0A<br clear=3D"none"><div =
class=3D"yQTDBase yqt4269974196" id=3D"yqt00478"><div class=3D"gmail_quote"=
>On Tue, Dec 8, 2015 at 7:44 AM, xutom <span dir=3D"ltr">&lt;<a rel=3D"nofo=
llow" shape=3D"rect" ymailto=3D"mailto:xutom2006@126.com" target=3D"_blank"=
 href=3D"javascript:return">xutom2006@126.com</a>&gt;</span> wrote:<br clea=
r=3D"none"><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bor=
der-left:1px #ccc solid;padding-left:1ex;"><div style=3D"line-height:1.7;co=
lor:#000000;font-size:14px;font-family:Arial;"><div style=3D"line-height:1.=
7;color:#000000;font-size:14px;font-family:Arial;"><br clear=3D"none"><br c=
lear=3D"none"><div>Dear Jack,<br clear=3D"none">&nbsp;&nbsp;&nbsp; Thank yo=
u very much! Now we have much better performance when we insert the same pa=
rtition keys in the same batch.<span class=3D"HOEnZb"><font color=3D"#88888=
8"><br clear=3D"none"><br clear=3D"none">jerry<br clear=3D"none"></font></s=
pan></div><div><div class=3D"h5"><div style=3D"zoom:1;"></div><div></div><b=
r clear=3D"none">At
 2015-12-07 13:08:31, "Jack Krupansky" &lt;<a rel=3D"nofollow" shape=3D"rec=
t" ymailto=3D"mailto:jack.krupansky@gmail.com" target=3D"_blank" href=3D"ja=
vascript:return">jack.krupansky@gmail.com</a>&gt; wrote:<br clear=3D"none">=
 <blockquote style=3D"PADDING-LEFT:1ex;MARGIN:0px 0px 0px 0.8ex;BORDER-LEFT=
:#ccc 1px solid;"><div dir=3D"ltr">If you combine inserts for multiple part=
ition keys in the same batch you negate most of the effect of token-aware r=
outing. It's best to insert only rows with the same partition key in a sing=
le batch. You also need to set the partition key for routing for the batch.=
<div><br clear=3D"none"></div><div>Also, RF=3D2 is not recommended since it=
 does not permit quorum operations if a replica node is down. RF=3D3 is gen=
erally more appropriate.</div></div><div class=3D"gmail_extra"><br clear=3D=
"all"><div><div><div dir=3D"ltr">-- Jack Krupansky</div></div></div>=0A<br =
clear=3D"none"><div class=3D"gmail_quote">On Sun, Dec 6, 2015 at 10:27 PM, =
xutom <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" ymailto=3D"m=
ailto:xutom2006@126.com" target=3D"_blank" href=3D"javascript:return">xutom=
2006@126.com</a>&gt;</span> wrote:<br clear=3D"none"><blockquote class=3D"g=
mail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-l=
eft:1ex;"><div style=3D"line-height:1.7;color:#000000;font-size:14px;font-f=
amily:Arial;"><div>Dear all,<br clear=3D"none">&nbsp;&nbsp;&nbsp; Thanks fo=
r ur reply!<br clear=3D"none">&nbsp;&nbsp;&nbsp; Now I`m using Apache Cassa=
ndra 2.1.1 and my JDK is 1.7.0_79,&nbsp; my keyspace replication factor is =
2=EF=BC=8Cand I do enable the "token aware". The GC configuration is defaul=
t for such as:<br clear=3D"none"># GC tuning options<br clear=3D"none">JVM_=
OPTS=3D"$JVM_OPTS -XX:+UseParNewGC"<br clear=3D"none">JVM_OPTS=3D"$JVM_OPTS=
 -XX:+UseConcMarkSweepGC"<br clear=3D"none">JVM_OPTS=3D"$JVM_OPTS -XX:+CMSP=
arallelRemarkEnabled"<br
 clear=3D"none">&nbsp;&nbsp;&nbsp; And I check the gc log: gc.log.0.current=
, I found there is only one Full GC. The stop-the-world times is low.<br cl=
ear=3D"none">CMS-initial-mark: 0.2747280 secs<br clear=3D"none">CMS-remark:=
 0.3623090 secs<br clear=3D"none"><br clear=3D"none">&nbsp;&nbsp;&nbsp; The=
 insert codes in my test client are following:<br clear=3D"none">&nbsp;&nbs=
p;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; String content =3D RandomStr=
ingUtils.randomAlphabetic(120);<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;=
&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; cluster =3D Cluster<br clear=3D"none">&nbsp=
;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbs=
p;&nbsp;&nbsp; .builder()<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;=
&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; .addContact=
Point(this.seedIP)<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; =
&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; .withCredentials("=
test", "test")<br
 clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &n=
bsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; .withRetryPolicy(DefaultRetryPolicy.INS=
TANCE)<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;=
&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; .withLoadBalancingPolicy(new T=
okenAwarePolicy(new DCAwareRoundRobinPolicy())) <br clear=3D"none">&nbsp;&n=
bsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&=
nbsp;&nbsp; .build();<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbs=
p; &nbsp;&nbsp;&nbsp; session =3D cluster.connect("demo");<br clear=3D"none=
">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; ......<br clear=
=3D"none"></div>&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; Pr=
eparedStatement insertPreparedStatement =3D session.prepare(<br clear=3D"no=
ne">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&n=
bsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; "&nbsp;&nbsp; INSERT INTO teache=
r (id,
 lastname, firstname, city) " +<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;=
&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
; "VALUES (?, ?, ?, ?); ");<br clear=3D"none"><br clear=3D"none">&nbsp;&nbs=
p;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; BatchStatement batch =
=3D new BatchStatement();<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;=
&nbsp; &nbsp;&nbsp;&nbsp; for (; i &lt; max; i+=3D5) {<br clear=3D"none">&n=
bsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; t=
ry {<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&n=
bsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; batch.add(insertPreparedStatemen=
t.bind(i, "Entre Nous", "adsfasdfa1", content));<br clear=3D"none">&nbsp;&n=
bsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&=
nbsp;&nbsp; batch.add(insertPreparedStatement.bind(i+1, "Entre Nous", "adsf=
asdfa2",
 content));<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&=
nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; batch.add(insertPreparedS=
tatement.bind(i+2, "Entre Nous", "adsfasdfa3", content));<br clear=3D"none"=
>&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp=
; &nbsp;&nbsp;&nbsp; batch.add(insertPreparedStatement.bind(i+3, "Entre Nou=
s", "adsfasdfa4", content));<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nb=
sp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; batch.ad=
d(insertPreparedStatement.bind(i+4, "Entre Nous", "adsfasdfa5", content));<=
br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; =
&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; <br clear=3D"none">//&nbsp;&nbsp;&nbs=
p; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nb=
sp; System.out.println("the is is " + i);<br clear=3D"none">&nbsp;&nbsp;&nb=
sp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&n=
bsp;
 session.execute(batch);<br clear=3D"none"><div>&nbsp;&nbsp;&nbsp; &nbsp;&n=
bsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; thisTim=
eCount +=3D 5;<br clear=3D"none">&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; &nbs=
p;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; }<br clear=3D"none">&nbsp;&nbsp;&nbsp; &n=
bsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp; }<br clear=3D"none"></div><div><div><br=
 clear=3D"none"><br clear=3D"none"><div style=3D"zoom:1;"></div><div></div>=
<br clear=3D"none">At 2015-12-07 00:40:06, "Graham Sanderson" &lt;<a rel=3D=
"nofollow" shape=3D"rect" ymailto=3D"mailto:graham@vast.com" target=3D"_bla=
nk" href=3D"javascript:return">graham@vast.com</a>&gt; wrote:<br clear=3D"n=
one"> <blockquote style=3D"PADDING-LEFT:1ex;MARGIN:0px 0px 0px 0.8ex;BORDER=
-LEFT:#ccc 1px solid;"><div>What version of C* are you using; what JVM vers=
ion - you showed a partial GC config but if that is still CMS (not G1) then=
 you are going to have insane GC pauses...&nbsp;</div><div><br clear=3D"non=
e"></div><div>Depending on
 C* versions are you using on/off heap memtables and what type</div><div><b=
r clear=3D"none"></div><div>Those are the sorts of issues related to fat no=
des; I'd be worried about - we run very nicely at 20G total heap and 8G new=
 - the rest of our 128G memory is disk cache/mmap and all of the off heap s=
tuff so it doesn't go to waste</div><div><br clear=3D"none"></div><div>That=
 said I think Jack is probably on the right path with overloaded coordinato=
rs- though you'd still expect to see CPU usage unless your timeouts are too=
 low for the load, In which case the coordinator would be getting no respon=
ses in time and quite possibly the other nodes are just dropping the mutati=
ons (since they don't get to them before they know the coordinator would ha=
ve timed out) - I forget the command to check dropped mutations off the top=
 of my head but you can see it in opcenter</div><div><br clear=3D"none"></d=
iv><div>If you have GC problems you certainly</div><div>Expect to see GC
 cpu usage but depending on how long you run your tests it might take you a=
 little while to run thru 40G</div><div><br clear=3D"none"></div><div>I'm p=
ersonally not a fan off &gt;32G (ish) heaps as you can't do compressed oops=
 and also it is unrealistic for CMS ... The word is that G1 is now working =
ok with C* especially on newer C* and JDK versions, but that said it takes =
quite a lot of thru-put to require insane quantities of young gen... We are=
 guessing that when we remove all our legacy thrift batch inserts we will n=
eed less - and as for 20G total we actually don't need that much (we droppe=
d from 24 when we moved memtables off heap, and believe we can drop further=
)</div><div><br clear=3D"none">Sent from my iPhone</div><div><br clear=3D"n=
one">On Dec 6, 2015, at 9:07 AM, Jack Krupansky &lt;<a rel=3D"nofollow" sha=
pe=3D"rect" ymailto=3D"mailto:jack.krupansky@gmail.com" target=3D"_blank" h=
ref=3D"javascript:return">jack.krupansky@gmail.com</a>&gt; wrote:<br
 clear=3D"none"><br clear=3D"none"></div><blockquote type=3D"cite"><div><di=
v dir=3D"ltr">What replication factor are you using? Even if your writes us=
e CL.ONE, Cassandra will be attempting writes to the replica nodes in the b=
ackground.<div><br clear=3D"none"></div><div>Are your writes "token aware"?=
 If not, the receiving node has the overhead of forwarding the request to t=
he node that owns the token for the primary key.</div><div><br clear=3D"non=
e"></div><div>For the record, Cassandra is not designed and optimized for s=
o-called "fat nodes". The design focus is "commodity hardware" and "distrib=
uted cluster" (typically a dozen or more nodes.)</div><div><br clear=3D"non=
e"></div><div>That said, it would be good if we had a rule of thumb for how=
 many simultaneous requests a node can handle, both external requests and i=
nter-node traffic. I think there is an open Jira to enforce a limit on infl=
ight requests so that nodes don't overloaded and start failing in the middl=
e of
 writes as you seem to be seeing.</div></div><div class=3D"gmail_extra"><br=
 clear=3D"all"><div><div><div dir=3D"ltr">-- Jack Krupansky</div></div></di=
v>=0A<br clear=3D"none"><div class=3D"gmail_quote">On Sun, Dec 6, 2015 at 9=
:29 AM, jerry <span dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" ymai=
lto=3D"mailto:xutom2006@126.com" target=3D"_blank" href=3D"javascript:retur=
n">xutom2006@126.com</a>&gt;</span> wrote:<br clear=3D"none"><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;p=
adding-left:1ex;" id=3D"yMail_cursorElementTracker_0.11027733259834349">Dea=
r All,<br clear=3D"none">=0A<br clear=3D"none">=0A&nbsp; &nbsp; Now I have =
a 4 nodes Cassandra cluster, and I want to know the highest performance of =
my Cassandra cluster. I write a JAVA client to batch insert datas into ALL =
4 nodes Cassandra, when I start less than 30 subthreads in my client applic=
ations to insert datas into cassandra, it will be ok for everything, but wh=
en I start more than 80 or 100 subthreads in my client applications, there =
will be too much timeout Exceptions (Such as: Cassandra timeout during writ=
e query at consistency ONE (1 replica were required but only 0 acknowledged=
 the write)). And no matter how many subthreads or even I start multiple cl=
ients with multiple subthreads on different computers, I can get the highes=
t performance for about 60000 - 80000 TPS. By the way, each row I insert in=
to cassandra is about 130 Bytes.<br clear=3D"none">=0A&nbsp; &nbsp; My 4 no=
des of Cassandra is :<br clear=3D"none">=0A&nbsp; &nbsp; &nbsp; &nbsp; CPU:=
 4*15<br clear=3D"none">=0A&nbsp; &nbsp; &nbsp; &nbsp; Memory: 512G<br clea=
r=3D"none">=0A&nbsp; &nbsp; &nbsp; &nbsp; Disk: flash card (only one disk b=
ut better than SSD)<br clear=3D"none">=0A&nbsp; &nbsp; My cassandra configu=
rations are:<br clear=3D"none">=0A&nbsp; &nbsp; &nbsp; &nbsp; MAX_HEAP_SIZE=
: 60G<br clear=3D"none">=0A&nbsp; &nbsp; &nbsp; &nbsp; NEW_HEAP_SIZE: 40G<b=
r clear=3D"none">=0A<br clear=3D"none">=0A&nbsp; &nbsp; When I insert datas=
 into my cassandra cluster, each nodes has NOT reached bottleneck such as C=
PU or Memory or Disk. Each of the three main hardwares is idle=E3=80=82So I=
 think maybe there is something wrong about my configuration of cassandra c=
luster. Can somebody please help me to My Cassandra Tuning? Thanks in advan=
ces!<br clear=3D"none">=0A</blockquote></div><br clear=3D"none"></div>=0A</=
div></blockquote></blockquote></div></div></div><br clear=3D"none"><br clea=
r=3D"none"><span title=3D"neteasefooter"></span><p>&nbsp;</p></blockquote><=
/div><br clear=3D"none"></div>=0A</blockquote></div></div></div></div><br c=
lear=3D"none"><br clear=3D"none"><span title=3D"neteasefooter"></span><p>&n=
bsp;</p></blockquote></div></div><br clear=3D"none"></div></td>  </tr>   </=
tbody>   </table></div></td></tr></table>
---1985756734-1664357670-1449594547=:24150--