Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of abarua@247-inc.com designates
 213.199.154.15 as permitted sender)
From: Arindam Barua <abarua@247-inc.com>
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Subject: RE: Config changes to leverage new hardware
Thread-Topic: Config changes to leverage new hardware
Thread-Index: Ac7bRPlbHTuTjZ9YTeuV7fShtMd93wALZC+AAoI9B+ABJy8LAAAFQLgw
Date: Tue, 26 Nov 2013 01:58:29 +0000
Message-ID: 
 <482ae1cac1ed4162a844aa552be59939@SINPR03MB139.apcprd03.prod.outlook.com>
References: 
 <218c9fe89d68457db021e5559febc6db@SINPR03MB139.apcprd03.prod.outlook.com>
 <2224FB9C-6480-4577-9B24-FEA3ED4C50F8@thelastpickle.com>
 <c2f321aa956d4bbeac9a89344134851b@SINPR03MB139.apcprd03.prod.outlook.com>
 <BD599182-A32D-42D0-AE01-54E35C2DA288@thelastpickle.com>
In-Reply-To: <BD599182-A32D-42D0-AE01-54E35C2DA288@thelastpickle.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_482ae1cac1ed4162a844aa552be59939SINPR03MB139apcprd03pro_"
MIME-Version: 1.0

--_000_482ae1cac1ed4162a844aa552be59939SINPR03MB139apcprd03pro_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable


Here are some calculated 'latency' results reported by cassandra-stress whe=
n asked to write 10M rows, i.e.
cassandra-stress -d <ip1>,<ip2> -n 10000000
(we actually had cassandra-stress running in deamon mode for the below test=
s)


avg_latency

(percentile)


90

99

99.9

99.99

Write: 8 cores, 32 GB, 3-disk RAID 0

0.002982182

0.003963931

0.004692996

0.004792326

Write: 32 cores, 128 GB, 7-disk RAID 0

0.003157515

0.003763181

0.005184429

0.005441946


Read: 8 cores, 32 GB, 3-disk RAID 0

0.002289879

0.057178021

0.173753058

0.24386912

Read: 32 cores, 128 GB, 7-disk RAID 0

0.002317525

0.010937648

0.013205977

0.014270511


The client was another node on the same network with the 8 core, 32 GB RAM =
specs. I wouldn't expect it to bottleneck, but I can monitor it while gener=
ating the load. In general, what would you expect it to bottleneck at?


>> Another interesting thing is that the linux disk cache doesn't seem to b=
e growing in spite of a lot of free memory available.

>Things will only get paged in when they are accessed.

Hmm, interesting. I did a test where I just wrote large files to disk, eg.

dd if=3D/dev/zero of=3Dbigfile18 bs=3D1M count=3D10000

and checked the disk cache, and it increased by exactly the same size of th=
e file written (no reads were done in this case)


-----Original Message-----
From: Aaron Morton [mailto:aaron@thelastpickle.com]
Sent: Monday, November 25, 2013 11:55 AM
To: Cassandra User
Subject: Re: Config changes to leverage new hardware


> However, for both writes and reads there was virtually no difference in t=
he latencies.

What sort of latency were you getting ?


> I'm still not very sure where the current *write* bottleneck is though.

What numbers are you getting ?

Could the bottle neck be the client ? Can it send writes fast enough to sat=
urate the nodes ?


As a rule of thumb you should get 3,000 to 4,000 (non counter) writes per s=
econd per core.


> Sample iostat data (captured every 10s) for the dedicated disk where comm=
it logs are written is below. Does this seem like a bottle neck?

Does not look too bad.


> Another interesting thing is that the linux disk cache doesn't seem to be=
 growing in spite of a lot of free memory available.

Things will only get paged in when they are accessed.


Cheers


-----------------

Aaron Morton

New Zealand

@aaronmorton


Co-Founder & Principal Consultant

Apache Cassandra Consulting

http://www.thelastpickle.com


On 21/11/2013, at 12:42 pm, Arindam Barua <abarua@247-inc.com<mailto:abarua=
@247-inc.com>> wrote:


>

> Thanks for the suggestions Aaron.

>

> As a follow up, we ran a bunch of tests with different combinations of th=
ese changes on a 2-node ring. The load was generated using cassandra-stress=
, run with default values to write 30 million rows, and read them back.

> However, for both writes and reads there was virtually no difference in t=
he latencies.

>

> The different combinations attempted:

> 1.       Baseline test with none of the below changes.

> 2.       Grabbing the TLAB setting from 1.2

> 3.       Moving the commit logs too to the 7 disk RAID 0.

> 4.       Increasing the concurrent_read to 32, and concurrent_write to 64

> 5.       (3) + (4), i.e. moving commit logs to the RAID + increasing conc=
urrent_read and concurrent_write config to 32 and 64.

>

> The write latencies were very similar, except them being ~3x worse for th=
e 99.9th percentile and above for scenario (5) above.

> The read latencies were also similar, with (3) and (5) being a little wor=
se for the 99.99th percentile.

>

> Overall, not making any changes, i.e. (1) performed as well or slightly b=
etter than any of the other changes.

>

> Running cassandra-stress on both the old and new hardware without making =
any config changes, the write performance was very similar, but the new har=
dware did show ~10x improvement in the read for the 99.9th percentile and h=
igher. After thinking about this, the reason why we were not seeing any dif=
ference with our test framework was perhaps the nature of the test where we=
 write the rows, and then do a bunch of reads to read the rows that were ju=
st written immediately following. The data is read back from the memtables,=
 and never from the disk/sstables. Hence the new hardware's increased RAM a=
nd size of the disk cache or higher number of disks never helps.

>

> I'm still not very sure where the current *write* bottleneck is though. T=
he new hardware has 32 cores vs 8 cores of the old hardware. Moving the com=
mit log from a dedicated disk to a 7 RAID-0 disk system (where it would be =
shared by other data though) didn't make a difference too. (unless the extr=
a contention on the RAID nullified the positive effects of the RAID).

>

> Sample iostat data (captured every 10s) for the dedicated disk where comm=
it logs are written is below. Does this seem like a bottle neck? When the c=
ommit logs are written the await/svctm ratio is high.

>

> Device:         rrqm/s   wrqm/s   r/s   w/s    rMB/s    wMB/s avgrq-sz av=
gqu-sz   await  svctm  %util

>                0.00     8.09  0.04  8.85     0.00     0.07    15.74     0=
.00    0.12   0.03   0.02

>                0.00   768.03  0.00  9.49     0.00     3.04   655.41     0=
.04    4.52   0.33   0.31

>                0.00     8.10  0.04  8.85     0.00     0.07    15.75     0=
.00    0.12   0.03   0.02

>                0.00   752.65  0.00 10.09     0.00     2.98   604.75     0=
.03    3.00   0.26   0.26

>

> Another interesting thing is that the linux disk cache doesn't seem to be=
 growing in spite of a lot of free memory available. The total disk cache u=
sed reported by 'free' is less than the size of the sstables written with o=
ver 100 GB unused RAM.

> Even in production, where we have the older hardware running with 32 GB R=
AM for a long time now, looking at 5 hosts in 1 DC, only 2.5 GB to 8 GB was=
 used for the disk cache. The Cassandra java process uses the 8 GB allocate=
d to it, and at least 10-15 GB on all the hosts is not used at all.

>

> Thanks,

> Arindam

>

> From: Aaron Morton [mailto:aaron@thelastpickle.com]

> Sent: Wednesday, November 06, 2013 8:34 PM

> To: Cassandra User

> Subject: Re: Config changes to leverage new hardware

>

> Running Cassandra 1.1.5 currently, but evaluating to upgrade to 1.2.11 so=
on.

> You will make more use of the extra memory moving to 1.2 as it moves bloo=
m filters and compression data off heap.

>

> Also grab the TLAB setting from cassandra-env.sh in v1.2

>

> As of now, our performance tests (our application specific as well as cas=
sandra-stress) are not showing any significant difference in the hardwares,=
 which is a little disheartening, since the new hardware has a lot more RAM=
 and CPU.

> For reads or writes or both ?

>

> Writes tend to scale with cores as long as the commit log can keep up.

> Reads improve with disk IO and page cache size when the hot set is in mem=
ory.

>

> Old Hardware: 8 cores (2 quad core), 32 GB RAM, four 1-TB disks ( 1

> disk used for commitlog and 3 disks RAID 0 for data) New Hardware: 32

> cores (2 8-core with hyperthreading), 128 GB RAM, eight 1-TB disks ( 1 di=
sk used for commitlog and 7 disks RAID 0 for data) Is the disk IO on the co=
mmit log volume keeping up ?

> You cranked up the concurrent writers and the commit log may not keep up.=
 You could put the commit log on the same RAID volume to see if that improv=
es writes.

>

> The config we tried modifying so far was concurrent_reads to (16 *

> number of drives) and concurrent_writes to (8 * number of cores) as

> per

> 256 write threads is a lot. Make sure the commit log can keep up, I would=
 put it back to 32, maybe try 64. Not sure the concurrent list for the comm=
it log will work well with that many threads.

>

> May want to put the reads down as well.

>

> It's easier to tune the system if you can provide some info on the worklo=
ad.

>

> Cheers

>

> -----------------

> Aaron Morton

> New Zealand

> @aaronmorton

>

> Co-Founder & Principal Consultant

> Apache Cassandra Consulting

> http://www.thelastpickle.com

>

> On 7/11/2013, at 12:35 pm, Arindam Barua <abarua@247-inc.com<mailto:abaru=
a@247-inc.com>> wrote:

>

>

>

> We want to upgrade our Cassandra cluster to have newer hardware, and were=
 wondering if anyone has suggestions on Cassandra or linux config changes t=
hat will prove to be beneficial.

> As of now, our performance tests (our application specific as well as cas=
sandra-stress) are not showing any significant difference in the hardwares,=
 which is a little disheartening, since the new hardware has a lot more RAM=
 and CPU.

>

> Old Hardware: 8 cores (2 quad core), 32 GB RAM, four 1-TB disks ( 1

> disk used for commitlog and 3 disks RAID 0 for data) New Hardware: 32

> cores (2 8-core with hyperthreading), 128 GB RAM, eight 1-TB disks ( 1

> disk used for commitlog and 7 disks RAID 0 for data)

>

> Most of the cassandra config currently is the default, and we are using L=
eveledCompaction strategy. Default key cache, row cache turned off.

> The config we tried modifying so far was concurrent_reads to (16 * number=
 of drives) and concurrent_writes to (8 * number of cores) as per recommend=
ation in cassandra.yaml, but that didn't make much difference.

> We were hoping that at least the extra RAM in the new hardware will be us=
ed for Linux file caching and hence an improvement in performance will be o=
bserved.

>

> Running Cassandra 1.1.5 currently, but evaluating to upgrade to 1.2.11 so=
on.

>

> Thanks,

> Arindam


--_000_482ae1cac1ed4162a844aa552be59939SINPR03MB139apcprd03pro_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr=
osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:dt=3D"uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns:m=3D"http://sc=
hemas.microsoft.com/office/2004/12/omml" xmlns=3D"http://www.w3.org/TR/REC-=
html40">
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
<meta name=3D"Generator" content=3D"Microsoft Word 14 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
	{font-family:Tahoma;
	panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
	{font-family:Consolas;
	panose-1:2 11 6 9 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
p.MsoPlainText, li.MsoPlainText, div.MsoPlainText
	{mso-style-priority:99;
	mso-style-link:"Plain Text Char";
	margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
p
	{mso-style-priority:99;
	mso-margin-top-alt:auto;
	margin-right:0in;
	mso-margin-bottom-alt:auto;
	margin-left:0in;
	font-size:12.0pt;
	font-family:"Times New Roman","serif";}
p.MsoAcetate, li.MsoAcetate, div.MsoAcetate
	{mso-style-priority:99;
	mso-style-link:"Balloon Text Char";
	margin:0in;
	margin-bottom:.0001pt;
	font-size:8.0pt;
	font-family:"Tahoma","sans-serif";}
span.PlainTextChar
	{mso-style-name:"Plain Text Char";
	mso-style-priority:99;
	mso-style-link:"Plain Text";
	font-family:"Calibri","sans-serif";}
span.BalloonTextChar
	{mso-style-name:"Balloon Text Char";
	mso-style-priority:99;
	mso-style-link:"Balloon Text";
	font-family:"Tahoma","sans-serif";}
.MsoChpDefault
	{mso-style-type:export-only;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=3D"EN-US" link=3D"blue" vlink=3D"purple">
<div class=3D"WordSection1">
<p class=3D"MsoNormal"><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p>&nbsp;</o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;">Here are some calculated &#8216;late=
ncy&#8217; results reported by cassandra-stress when asked to write 10M row=
s, i.e.<o:p></o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:11.5pt;font-family:&quot;Ar=
ial&quot;,&quot;sans-serif&quot;;color:black">cassandra-stress -d &lt;ip1&g=
t;,&lt;ip2&gt; -n 10000000</span><span style=3D"font-size:12.0pt;font-famil=
y:&quot;Times New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;">(we actually had cassandra-stress ru=
nning in deamon mode for the below tests)<o:p></o:p></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p>&nbsp;</o:p></span></p>
<table class=3D"MsoNormalTable" border=3D"0" cellspacing=3D"0" cellpadding=
=3D"0" style=3D"border-collapse:collapse">
<tbody>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;background:#CFE2F3;pad=
ding:5.25pt 5.25pt 5.25pt 5.25pt">
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#CFE2F3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">avg_latency</span></b><span style=3D"font-size:12.0pt;font-family:&=
quot;Times New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#CFE2F3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">(percentile)</span></b><span style=3D"font-size:12.0pt;font-family:=
&quot;Times New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#CFE2F3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#CFE2F3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
</td>
</tr>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-top:none;backgr=
ound:#CFE2F3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#CFE2F3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">90</span></b><span style=3D"font-size:12.0pt;font-family:&quot;Time=
s New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#CFE2F3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">99</span></b><span style=3D"font-size:12.0pt;font-family:&quot;Time=
s New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#CFE2F3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">99.9</span></b><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#CFE2F3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">99.99</span></b><span style=3D"font-size:12.0pt;font-family:&quot;T=
imes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
</tr>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-top:none;backgr=
ound:#D9EAD3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Arial&quot;,&quot;sans-serif&quot;;color:black">Write: 8 cores, 32 GB, 3-d=
isk RAID 0</span></b><span style=3D"font-size:12.0pt;font-family:&quot;Time=
s New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#D9EAD3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.002982182</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#D9EAD3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.003963931</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#D9EAD3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.004692996</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;background:#D9EAD3;padding=
:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.004792326</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
</tr>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-top:none;paddin=
g:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal"><b><span style=3D"font-size:10.0pt;font-family:&quot=
;Arial&quot;,&quot;sans-serif&quot;;color:black">Write: 32 cores, 128 GB, 7=
-disk RAID 0</span></b><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.003157515</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.003763181</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.005184429</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.005441946</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
</tr>
</tbody>
</table>
<p class=3D"MsoNormal"><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p>&nbsp;</o:p></span></p>
<table class=3D"MsoNormalTable" border=3D"0" cellspacing=3D"0" cellpadding=
=3D"0" style=3D"border-collapse:collapse">
<tbody>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;background:#D9EAD3;pad=
ding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">Read: 8 cores, 32 GB, 3-disk RAID 0</span></b><span style=3D"font-s=
ize:12.0pt;font-family:&quot;Times New Roman&quot;,&quot;serif&quot;"><o:p>=
</o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#D9EAD3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.002289879</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#D9EAD3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.057178021</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#D9EAD3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.173753058</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-left:none;backg=
round:#D9EAD3;padding:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.24386912</span><span style=3D"font-size:12.0pt;font-family:&quot;Tim=
es New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
</tr>
<tr>
<td valign=3D"top" style=3D"border:solid black 1.0pt;border-top:none;paddin=
g:5.25pt 5.25pt 5.25pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><b><span style=3D"=
font-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color=
:black">Read: 32 cores, 128 GB, 7-disk RAID 0</span></b><span style=3D"font=
-size:12.0pt;font-family:&quot;Times New Roman&quot;,&quot;serif&quot;"><o:=
p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.002317525</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.010937648</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.013205977</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
<td valign=3D"top" style=3D"border-top:none;border-left:none;border-bottom:=
solid black 1.0pt;border-right:solid black 1.0pt;padding:5.25pt 5.25pt 5.25=
pt 5.25pt">
<p class=3D"MsoNormal" style=3D"mso-line-height-alt:0pt"><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,&quot;sans-serif&quot;;color:bl=
ack">0.014270511</span><span style=3D"font-size:12.0pt;font-family:&quot;Ti=
mes New Roman&quot;,&quot;serif&quot;"><o:p></o:p></span></p>
</td>
</tr>
</tbody>
</table>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">The client was another node on the same network w=
ith the 8 core, 32 GB RAM specs. I wouldn&#8217;t expect it to bottleneck, =
but I can monitor it while generating the load. In general, what would you =
expect it to bottleneck at?<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt;&gt; Another interesting thing is that the li=
nux disk cache doesn&#8217;t seem to be growing in spite of a lot of free m=
emory available.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;Things will only get paged in when they are a=
ccessed. <o:p>
</o:p></p>
<p class=3D"MsoPlainText">Hmm, interesting. I did a test where I just wrote=
 large files to disk, eg.<o:p></o:p></p>
<p class=3D"MsoPlainText">dd if=3D/dev/zero of=3Dbigfile18 bs=3D1M count=3D=
10000<o:p></o:p></p>
<p class=3D"MsoPlainText">and checked the disk cache, and it increased by e=
xactly the same size of the file written (no reads were done in this case)<=
o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">-----Original Message-----<br>
From: Aaron Morton [mailto:aaron@thelastpickle.com] <br>
Sent: Monday, November 25, 2013 11:55 AM<br>
To: Cassandra User<br>
Subject: Re: Config changes to leverage new hardware</p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt; However, for both writes and reads there was=
 virtually no difference in the latencies.<o:p></o:p></p>
<p class=3D"MsoPlainText">What sort of latency were you getting ? <o:p></o:=
p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt; I&#8217;m still not very sure where the curr=
ent *write* bottleneck is though.
<o:p></o:p></p>
<p class=3D"MsoPlainText">What numbers are you getting ? <o:p></o:p></p>
<p class=3D"MsoPlainText">Could the bottle neck be the client ? Can it send=
 writes fast enough to saturate the nodes ?<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">As a rule of thumb you should get 3,000 to 4,000 =
(non counter) writes per second per core.
<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt; Sample iostat data (captured every 10s) for =
the dedicated disk where commit logs are written is below. Does this seem l=
ike a bottle neck?<o:p></o:p></p>
<p class=3D"MsoPlainText">Does not look too bad. <o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt; Another interesting thing is that the linux =
disk cache doesn&#8217;t seem to be growing in spite of a lot of free memor=
y available.
<o:p></o:p></p>
<p class=3D"MsoPlainText">Things will only get paged in when they are acces=
sed. <o:p>
</o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">Cheers<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">-----------------<o:p></o:p></p>
<p class=3D"MsoPlainText">Aaron Morton<o:p></o:p></p>
<p class=3D"MsoPlainText">New Zealand<o:p></o:p></p>
<p class=3D"MsoPlainText">@aaronmorton<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">Co-Founder &amp; Principal Consultant<o:p></o:p><=
/p>
<p class=3D"MsoPlainText">Apache Cassandra Consulting<o:p></o:p></p>
<p class=3D"MsoPlainText"><a href=3D"http://www.thelastpickle.com"><span st=
yle=3D"color:windowtext;text-decoration:none">http://www.thelastpickle.com<=
/span></a><o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">On 21/11/2013, at 12:42 pm, Arindam Barua &lt;<a =
href=3D"mailto:abarua@247-inc.com"><span style=3D"color:windowtext;text-dec=
oration:none">abarua@247-inc.com</span></a>&gt; wrote:<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Thanks for the suggestions Aaron.<o:p></o:p>=
</p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; As a follow up, we ran a bunch of tests with=
 different combinations of these changes on a 2-node ring. The load was gen=
erated using cassandra-stress, run with default values to write 30 million =
rows, and read them back.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; However, for both writes and reads there was=
 virtually no difference in the latencies.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; The different combinations attempted:<o:p></=
o:p></p>
<p class=3D"MsoPlainText">&gt; 1.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Basel=
ine test with none of the below changes.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; 2.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Grabb=
ing the TLAB setting from 1.2<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; 3.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Movin=
g the commit logs too to the 7 disk RAID 0.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; 4.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Incre=
asing the concurrent_read to 32, and concurrent_write to 64<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; 5.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; (3) &=
#43; (4), i.e. moving commit logs to the RAID &#43; increasing concurrent_r=
ead and concurrent_write config to 32 and 64.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; The write latencies were very similar, excep=
t them being ~3x worse for the 99.9th percentile and above for scenario (5)=
 above.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; The read latencies were also similar, with (=
3) and (5) being a little worse for the 99.99th percentile.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Overall, not making any changes, i.e. (1) pe=
rformed as well or slightly better than any of the other changes.<o:p></o:p=
></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Running cassandra-stress on both the old and=
 new hardware without making any config changes, the write performance was =
very similar, but the new hardware did show ~10x improvement in the read fo=
r the 99.9th percentile and higher. After
 thinking about this, the reason why we were not seeing any difference with=
 our test framework was perhaps the nature of the test where we write the r=
ows, and then do a bunch of reads to read the rows that were just written i=
mmediately following. The data is
 read back from the memtables, and never from the disk/sstables. Hence the =
new hardware&#8217;s increased RAM and size of the disk cache or higher num=
ber of disks never helps.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; I&#8217;m still not very sure where the curr=
ent *write* bottleneck is though. The new hardware has 32 cores vs 8 cores =
of the old hardware. Moving the commit log from a dedicated disk to a 7 RAI=
D-0 disk system (where it would be shared by
 other data though) didn&#8217;t make a difference too. (unless the extra c=
ontention on the RAID nullified the positive effects of the RAID).<o:p></o:=
p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Sample iostat data (captured every 10s) for =
the dedicated disk where commit logs are written is below. Does this seem l=
ike a bottle neck? When the commit logs are written the await/svctm ratio i=
s high.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Device:&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp; rrqm/s&nbsp;&nbsp; wrqm/s&nbsp;&nbsp; r/s&nbsp;&nbsp; w/s&nbsp;=
&nbsp;&nbsp; rMB/s&nbsp;&nbsp;&nbsp; wMB/s avgrq-sz avgqu-sz&nbsp;&nbsp; aw=
ait&nbsp; svctm&nbsp; %util<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&nbsp;&nbsp; =
8.09&nbsp; 0.04&nbsp; 8.85&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&nbsp;&n=
bsp; 0.07&nbsp;&nbsp;&nbsp; 15.74&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&=
nbsp; 0.12&nbsp;&nbsp; 0.03&nbsp;&nbsp; 0.02<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp; 768.03&nbsp;=
 0.00&nbsp; 9.49&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&nbsp;&nbsp; 3.04&=
nbsp;&nbsp; 655.41&nbsp;&nbsp;&nbsp;&nbsp; 0.04&nbsp;&nbsp;&nbsp; 4.52&nbsp=
;&nbsp; 0.33&nbsp;&nbsp; 0.31<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&nbsp;&nbsp; =
8.10&nbsp; 0.04&nbsp; 8.85&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp; &nbsp;&=
nbsp;0.07&nbsp;&nbsp;&nbsp; 15.75&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&=
nbsp; 0.12&nbsp;&nbsp; 0.03&nbsp;&nbsp; 0.02<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp; 752.65&nbsp;=
 0.00 10.09&nbsp;&nbsp;&nbsp;&nbsp; 0.00&nbsp;&nbsp;&nbsp;&nbsp; 2.98&nbsp;=
&nbsp; 604.75&nbsp;&nbsp;&nbsp;&nbsp; 0.03&nbsp;&nbsp;&nbsp; 3.00&nbsp;&nbs=
p; 0.26&nbsp;&nbsp; 0.26<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Another interesting thing is that the linux =
disk cache doesn&#8217;t seem to be growing in spite of a lot of free memor=
y available. The total disk cache used reported by &#8216;free&#8217; is le=
ss than the size of the sstables written with over 100
 GB unused RAM.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Even in production, where we have the older =
hardware running with 32 GB RAM for a long time now, looking at 5 hosts in =
1 DC, only 2.5 GB to 8 GB was used for the disk cache. The Cassandra java p=
rocess uses the 8 GB allocated to it,
 and at least 10-15 GB on all the hosts is not used at all.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Thanks,<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Arindam<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; From: Aaron Morton [<a href=3D"mailto:aaron@=
thelastpickle.com"><span style=3D"color:windowtext;text-decoration:none">ma=
ilto:aaron@thelastpickle.com</span></a>]<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Sent: Wednesday, November 06, 2013 8:34 PM<o=
:p></o:p></p>
<p class=3D"MsoPlainText">&gt; To: Cassandra User<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Subject: Re: Config changes to leverage new =
hardware<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Running Cassandra 1.1.5 currently, but evalu=
ating to upgrade to 1.2.11 soon.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; You will make more use of the extra memory m=
oving to 1.2 as it moves bloom filters and compression data off heap.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Also grab the TLAB setting from cassandra-en=
v.sh in v1.2<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; As of now, our performance tests (our applic=
ation specific as well as cassandra-stress) are not showing any significant=
 difference in the hardwares, which is a little disheartening, since the ne=
w hardware has a lot more RAM and CPU.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; For reads or writes or both ? <o:p></o:p></p=
>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Writes tend to scale with cores as long as t=
he commit log can keep up.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Reads improve with disk IO and page cache si=
ze when the hot set is in memory.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Old Hardware: 8 cores (2 quad core), 32 GB R=
AM, four 1-TB disks ( 1
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; disk used for commitlog and 3 disks RAID 0 f=
or data) New Hardware: 32
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; cores (2 8-core with hyperthreading), 128 GB=
 RAM, eight 1-TB disks ( 1 disk used for commitlog and 7 disks RAID 0 for d=
ata) Is the disk IO on the commit log volume keeping up ?<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; You cranked up the concurrent writers and th=
e commit log may not keep up. You could put the commit log on the same RAID=
 volume to see if that improves writes.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; The config we tried modifying so far was con=
current_reads to (16 *
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; number of drives) and concurrent_writes to (=
8 * number of cores) as
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; per<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; 256 write threads is a lot. Make sure the co=
mmit log can keep up, I would put it back to 32, maybe try 64. Not sure the=
 concurrent list for the commit log will work well with that many threads.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; May want to put the reads down as well. <o:p=
></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; It&#8217;s easier to tune the system if you =
can provide some info on the workload.
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Cheers<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; -----------------<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Aaron Morton<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; New Zealand<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; @aaronmorton<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Co-Founder &amp; Principal Consultant<o:p></=
o:p></p>
<p class=3D"MsoPlainText">&gt; Apache Cassandra Consulting<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; <a href=3D"http://www.thelastpickle.com"><sp=
an style=3D"color:windowtext;text-decoration:none">http://www.thelastpickle=
.com</span></a><o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; On 7/11/2013, at 12:35 pm, Arindam Barua &lt=
;<a href=3D"mailto:abarua@247-inc.com"><span style=3D"color:windowtext;text=
-decoration:none">abarua@247-inc.com</span></a>&gt; wrote:<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; We want to upgrade our Cassandra cluster to =
have newer hardware, and were wondering if anyone has suggestions on Cassan=
dra or linux config changes that will prove to be beneficial.<o:p></o:p></p=
>
<p class=3D"MsoPlainText">&gt; As of now, our performance tests (our applic=
ation specific as well as cassandra-stress) are not showing any significant=
 difference in the hardwares, which is a little disheartening, since the ne=
w hardware has a lot more RAM and CPU.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Old Hardware: 8 cores (2 quad core), 32 GB R=
AM, four 1-TB disks ( 1
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; disk used for commitlog and 3 disks RAID 0 f=
or data) New Hardware: 32
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; cores (2 8-core with hyperthreading), 128 GB=
 RAM, eight 1-TB disks ( 1
<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; disk used for commitlog and 7 disks RAID 0 f=
or data)<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Most of the cassandra config currently is th=
e default, and we are using LeveledCompaction strategy. Default key cache, =
row cache turned off.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; The config we tried modifying so far was con=
current_reads to (16 * number of drives) and concurrent_writes to (8 * numb=
er of cores) as per recommendation in cassandra.yaml, but that didn&#8217;t=
 make much difference.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; We were hoping that at least the extra RAM i=
n the new hardware will be used for Linux file caching and hence an improve=
ment in performance will be observed.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Running Cassandra 1.1.5 currently, but evalu=
ating to upgrade to 1.2.11 soon.<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt;&nbsp; <o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Thanks,<o:p></o:p></p>
<p class=3D"MsoPlainText">&gt; Arindam<o:p></o:p></p>
<p class=3D"MsoPlainText"><o:p>&nbsp;</o:p></p>
</div>
</body>
</html>

--_000_482ae1cac1ed4162a844aa552be59939SINPR03MB139apcprd03pro_--