Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of tompoges@gmail.com designates
 74.125.82.45 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <CD5A3161.21D75%Dean.Hiller@nrel.gov>
References: 
 <CAGj0pLnEnVeSPUDMh=2dRaBrpC+q+tPn_osLqSdv9aWmXrEuaw@mail.gmail.com>
	<CD5A3161.21D75%Dean.Hiller@nrel.gov>
Date: Mon, 4 Mar 2013 18:20:26 +0000
Message-ID: 
 <CAGj0pL=iWLZt1Vhh8k=q9szZ=rHpOnQ7S_pvZgHh+Nk382nvRw@mail.gmail.com>
Subject: Re: Poor read latency
From: Tom Martin <tompoges@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d04374a093c6c0404d71d6908

--f46d04374a093c6c0404d71d6908
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

Yeah, I just checked and the heap size 0.75 warning has been appearing.

nodetool info reports:

Heap Memory (MB) : 563.88 / 1014.00
Heap Memory (MB) : 646.01 / 1014.00
Heap Memory (MB) : 639.71 / 1014.00

We have plenty of free memory on each instance.  Do we need bigger
instances or should we just configure each node to have a bigger max heap?


On Mon, Mar 4, 2013 at 6:10 PM, Hiller, Dean <Dean.Hiller@nrel.gov> wrote:

> What is nodetool info say for your memory?  (we hit that one with memory
> near the max and it slowed down our system big time=85still working on
> resolving it too).
>
> Do any logs have the hit 0.75, running compaction OR worse hit 0.85
> running compaction=85.you get that if the above is the case typically.
>
> Dean
>
> From: Tom Martin <tompoges@gmail.com<mailto:tompoges@gmail.com>>
> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" <
> user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> Date: Monday, March 4, 2013 10:31 AM
> To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" <
> user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
> Subject: Poor read latency
>
> Hi all,
>
> We have a small (3 node) cassandra cluster on aws.  We have a replication
> factor of 3, a read level of local_quorum and are using the ephemeral dis=
k.
>  We're getting pretty poor read performance and quite high read latency i=
n
> cfstats.  For example:
>
> Column Family: AgentHotel
> SSTable count: 4
> Space used (live): 829021175
> Space used (total): 829021175
> Number of Keys (estimate): 2148352
> Memtable Columns Count: 0
> Memtable Data Size: 0
> Memtable Switch Count: 0
> Read Count: 67204
> Read Latency: 23.813 ms.
> Write Count: 0
> Write Latency: NaN ms.
> Pending Tasks: 0
> Bloom Filter False Positives: 50
> Bloom Filter False Ratio: 0.00201
> Bloom Filter Space Used: 7635472
> Compacted row minimum size: 259
> Compacted row maximum size: 4768
> Compacted row mean size: 873
>
> For comparison we have a similar set up in another cluster for an old
> project (hosted on rackspace) where we're getting sub 1ms read latencies.
>  We are using multigets on the client (Hector) but are only requesting ~4=
0
> rows per request on average.
>
> I feel like we should reasonably expect better performance but perhaps I'=
m
> mistaken.  Is there anything super obvious we should be checking out?
>
>

--f46d04374a093c6c0404d71d6908
Content-Type: text/html; charset=windows-1252
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Yeah, I just checked and the heap size 0.75 warning has be=
en appearing.<div><br></div><div style>nodetool info reports:</div><div sty=
le><br></div><div style><div>Heap Memory (MB) : 563.88 / 1014.00</div><div>
Heap Memory (MB) : 646.01 / 1014.00<br></div><div>Heap Memory (MB) : 639.71=
 / 1014.00<br></div><div><br></div><div style>We have plenty of free memory=
 on each instance. =A0Do we need bigger instances or should we just configu=
re each node to have a bigger max heap?</div>
</div></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">O=
n Mon, Mar 4, 2013 at 6:10 PM, Hiller, Dean <span dir=3D"ltr">&lt;<a href=
=3D"mailto:Dean.Hiller@nrel.gov" target=3D"_blank">Dean.Hiller@nrel.gov</a>=
&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">What is nodetool info say for your memory? =
=A0(we hit that one with memory near the max and it slowed down our system =
big time=85still working on resolving it too).<br>

<br>
Do any logs have the hit 0.75, running compaction OR worse hit 0.85 running=
 compaction=85.you get that if the above is the case typically.<br>
<br>
Dean<br>
<br>
From: Tom Martin &lt;<a href=3D"mailto:tompoges@gmail.com">tompoges@gmail.c=
om</a>&lt;mailto:<a href=3D"mailto:tompoges@gmail.com">tompoges@gmail.com</=
a>&gt;&gt;<br>
Reply-To: &quot;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra=
.apache.org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">user=
@cassandra.apache.org</a>&gt;&quot; &lt;<a href=3D"mailto:user@cassandra.ap=
ache.org">user@cassandra.apache.org</a>&lt;mailto:<a href=3D"mailto:user@ca=
ssandra.apache.org">user@cassandra.apache.org</a>&gt;&gt;<br>

Date: Monday, March 4, 2013 10:31 AM<br>
To: &quot;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra.apach=
e.org</a>&lt;mailto:<a href=3D"mailto:user@cassandra.apache.org">user@cassa=
ndra.apache.org</a>&gt;&quot; &lt;<a href=3D"mailto:user@cassandra.apache.o=
rg">user@cassandra.apache.org</a>&lt;mailto:<a href=3D"mailto:user@cassandr=
a.apache.org">user@cassandra.apache.org</a>&gt;&gt;<br>

Subject: Poor read latency<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
Hi all,<br>
<br>
We have a small (3 node) cassandra cluster on aws. =A0We have a replication=
 factor of 3, a read level of local_quorum and are using the ephemeral disk=
. =A0We&#39;re getting pretty poor read performance and quite high read lat=
ency in cfstats. =A0For example:<br>

<br>
Column Family: AgentHotel<br>
SSTable count: 4<br>
Space used (live): 829021175<br>
Space used (total): 829021175<br>
Number of Keys (estimate): 2148352<br>
Memtable Columns Count: 0<br>
Memtable Data Size: 0<br>
Memtable Switch Count: 0<br>
Read Count: 67204<br>
Read Latency: 23.813 ms.<br>
Write Count: 0<br>
Write Latency: NaN ms.<br>
Pending Tasks: 0<br>
Bloom Filter False Positives: 50<br>
Bloom Filter False Ratio: 0.00201<br>
Bloom Filter Space Used: 7635472<br>
Compacted row minimum size: 259<br>
Compacted row maximum size: 4768<br>
Compacted row mean size: 873<br>
<br>
For comparison we have a similar set up in another cluster for an old proje=
ct (hosted on rackspace) where we&#39;re getting sub 1ms read latencies. =
=A0We are using multigets on the client (Hector) but are only requesting ~4=
0 rows per request on average.<br>

<br>
I feel like we should reasonably expect better performance but perhaps I=
9;m mistaken. =A0Is there anything super obvious we should be checking out?=
<br>
<br>
</div></div></blockquote></div><br></div>

--f46d04374a093c6c0404d71d6908--