Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of dfgriffith@gmail.com
 designates 74.125.82.174 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAEDUwd0BgOsp9UO9JKg3tuF5v-D==jDZ0bgGgZmTChRa4rs+gw@mail.gmail.com>
References: 
 <CACK4mqRu72qz_e=BPx-J_RW6yHiFkX4_wtnrzzMb-=d2yEmV2w@mail.gmail.com>
	<62C7CCE5A334433FBC9F93F87370648F@JackKrupansky14>
	<CACK4mqS5bz0XH7oUN16t7EKWqsEdNW9iRxG2N-O1hqPtKJ97_A@mail.gmail.com>
	<43A7F5BFFCEB4231B2D261DB7F61D648@JackKrupansky14>
	<CACK4mqRP4jU1pd2hWHeMhMqx01LqRr00ikQ_KTAmisX49PeHkw@mail.gmail.com>
	<CAFqWSge_NjeSbDNPhYAG6--e37Lfkt61dattOxkVoQfrjhDSEQ@mail.gmail.com>
	<CACK4mqSuvTamD1pJjySrEC64gGhOsDg8_1v6qZf+3KR+39SwOQ@mail.gmail.com>
	<CAAam9staardVVPvEgJR1nWabbXNnW0hC2=5EGHrYLCLTm47=gA@mail.gmail.com>
	<CACK4mqQ2=A_QgAFoPipqEvZ9q6R7TruO2GWcEjiZZpyFy=MAWw@mail.gmail.com>
	<CAEDUwd0BgOsp9UO9JKg3tuF5v-D==jDZ0bgGgZmTChRa4rs+gw@mail.gmail.com>
Date: Mon, 21 Jul 2014 22:23:13 -0400
Message-ID: 
 <CACK4mqT7TVwq=0-LVERJjbNbi9qXYdbb0Ydjssn0VQDOxxH-RA@mail.gmail.com>
Subject: Re: horizontal query scaling issues follow on
From: Diane Griffith <dfgriffith@gmail.com>
To: user <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=e89a8f23568bd3641404febee734

--e89a8f23568bd3641404febee734
Content-Type: text/plain; charset=UTF-8

So I appreciate all the help so far.  Upfront, it is possible the schema
and data query pattern could be contributing to the problem.  The schema
was born out of certain design requirements.  If it proves to be part of
what makes the scalability crumble, then I hope it will help shape the
design requirements.

Anyway, the premise of the question was my struggle where scalability
metrics fell apart going from 2 nodes to 4 nodes for the current schema and
query access pattern being modeled:
- 1 node was producing acceptable response times seemed to be the consensus
- 2 nodes showed marked improvement to the response times for the query
scenario being modeled which was welcomed news
- 4 nodes showed a decrease in performance and it was not clear why going 2
to 4 nodes triggered the decrease

Also what contributed to the question was 2 more items:
- cassandra-env.sh - where in the example for HEAP_NEWSIZE states in the
comments it assumes a modern 8 core machine for pause times
- a wiki article I had found and I am trying to relocate where a person set
up very small nodes for developers on that team and talked through all the
paramters that had to be changed from the default to get good throughput.
 It sort of implied the defaults maybe were based on a certain sized vm.

That was the main driver for those questions. I agree it does not seem
correct to boost the values let alone so high to minimize impact in some
respects (i.e. not trigger the reads to time out and start over given the
retry policy).

So the question really was are the defaults sized with the assumption of a
certain minimal vm size (i.e. the comment in cassandra-env.sh)

Does that explain where I am coming from better?

My question, despite being naive and ignoring other impacts still stands,
is there a minimal vm size that is more of the sweet spot for cassandra and
the defaults.  I get the point that a column family schema as it relates to
the desired queries can and do impact that answer.  I guess what bothered
me was it didn't impact that answer going from 1 node to 2 nodes but
started showing up going from 2 nodes to 4 nodes.

I'm building whatever facts I can to support the schema and query pattern
scales or does not.  If it does not, then I am trying to pull information
from some metrics outputted by nodetool or log statements on the cassandra
log files to support a case to change the design requirements.

Thanks,
Diane


On Mon, Jul 21, 2014 at 8:15 PM, Robert Coli <rcoli@eventbrite.com> wrote:

> On Sun, Jul 20, 2014 at 6:12 PM, Diane Griffith <dfgriffith@gmail.com>
> wrote:
>
>> I am running tests again across different number of client threads and
>> number of nodes but this time I tweaked some of the timeouts configured for
>> the nodes in the cluster.  I was able to get better performance on the
>> nodes at 10 client threads by upping 4 timeout values in cassandra.yaml to
>> 240000:
>>
>
> If you have to tune these timeout values, you have probably modeled data
> in such a way that each of your requests is "quite large" or "quite slow".
>
> This is usually, but not always, an indicator that you are Doing It Wrong.
> Massively multithreaded things don't generally like their threads to be
> long-lived, for what should hopefully be obvious reasons.
>
>
>> I did this because of my interpretation of the cfhistograms output on one
>> of the nodes.
>>
>
> Could you be more specific?
>
>
>> So 3 questions that come to mind:
>>
>>
>>    1. Did I interpret the histogram information correctly in cassandra
>>    2.0.6 nodetool output?  That the 2 column read latency output is the offset
>>    or left column is the time in milliseconds and the right column is number
>>    of requests that fell into that bucket range.
>>    2. Was it reasonable for me to boost those 4 timeouts and just those?
>>
>> Not really. In 5 years of operating Cassandra, I've never had a problem
> whose solution was to increase these timeouts from their default.
>
>>
>>    1. What are reasonable timeout values for smaller vm sizes (i.e. 8GB
>>    RAM, 4 CPUs)?
>>
>> As above, I question the premise of this question.
>
> =Rob
>
>

--e89a8f23568bd3641404febee734
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">So I appreciate all the help so far. =C2=A0Upfront, it is =
possible the schema and data query pattern could be contributing to the pro=
blem. =C2=A0The schema was born out of certain design requirements. =C2=A0I=
f it proves to be part of what makes the scalability crumble, then I hope i=
t will help shape the design requirements.<div>
<br></div><div>Anyway, the premise of the question was my struggle where sc=
alability metrics fell apart going from 2 nodes to 4 nodes for the current =
schema and query access pattern being modeled:</div><div>- 1 node was produ=
cing acceptable response times seemed to be the consensus=C2=A0</div>
<div>- 2 nodes showed marked improvement to the response times for the quer=
y scenario being modeled which was welcomed news</div><div>- 4 nodes showed=
 a decrease in performance and it was not clear why going 2 to 4 nodes trig=
gered the decrease</div>
<div><br></div><div>Also what contributed to the question was 2 more items:=
</div><div>- cassandra-env.sh - where in the example for HEAP_NEWSIZE state=
s in the comments it assumes a modern 8 core machine for pause times</div>
<div>- a wiki article I had found and I am trying to relocate where a perso=
n set up very small nodes for developers on that team and talked through al=
l the paramters that had to be changed from the default to get good through=
put. =C2=A0It sort of implied the defaults maybe were based on a certain si=
zed vm.</div>
<div><br></div><div>That was the main driver for those questions. I agree i=
t does not seem correct to boost the values let alone so high to minimize i=
mpact in some respects (i.e. not trigger the reads to time out and start ov=
er given the retry policy). =C2=A0</div>
<div><br></div><div>So the question really was are the defaults sized with =
the assumption of a certain minimal vm size (i.e. the comment in cassandra-=
env.sh)</div><div><br></div><div>Does that explain where I am coming from b=
etter?<br>
</div><div><br></div><div>My question, despite being naive and ignoring oth=
er impacts still stands, is there a minimal vm size that is more of the swe=
et spot for cassandra and the defaults. =C2=A0I get the point that a column=
 family schema as it relates to the desired queries can and do impact that =
answer. =C2=A0I guess what bothered me was it didn&#39;t impact that answer=
 going from 1 node to 2 nodes but started showing up going from 2 nodes to =
4 nodes. =C2=A0</div>
<div><br></div><div>I&#39;m building whatever facts I can to support the sc=
hema and query pattern scales or does not. =C2=A0If it does not, then I am =
trying to pull information from some metrics outputted by nodetool or log s=
tatements on the cassandra log files to support a case to change the design=
 requirements. =C2=A0</div>
<div><br></div><div>Thanks,</div><div>Diane</div></div><div class=3D"gmail_=
extra"><br><br><div class=3D"gmail_quote">On Mon, Jul 21, 2014 at 8:15 PM, =
Robert Coli <span dir=3D"ltr">&lt;<a href=3D"mailto:rcoli@eventbrite.com" t=
arget=3D"_blank">rcoli@eventbrite.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra">=
<div class=3D"gmail_quote"><div class=3D"">On Sun, Jul 20, 2014 at 6:12 PM,=
 Diane Griffith <span dir=3D"ltr">&lt;<a href=3D"mailto:dfgriffith@gmail.co=
m" target=3D"_blank">dfgriffith@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">I am running tests again ac=
ross different number of client threads and number of nodes but this time I=
 tweaked some of the timeouts configured for the nodes in the cluster. =C2=
=A0I was able to get better performance on the nodes at 10 client threads b=
y upping 4 timeout values in cassandra.yaml to 240000:</div>

</blockquote><div>=C2=A0</div></div><div>If you have to tune these timeout =
values, you have probably modeled data in such a way that each of your requ=
ests is &quot;quite large&quot; or &quot;quite slow&quot;.</div><div><br></=
div>
<div>
This is usually, but not always, an indicator that you are Doing It Wrong. =
Massively multithreaded things don&#39;t generally like their threads to be=
 long-lived, for what should hopefully be obvious reasons.</div><div class=
=3D"">
<div>=C2=A0<br>
</div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-l=
eft:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>I did this becau=
se of my interpretation of the cfhistograms output on one of the nodes. =C2=
=A0</div>

</div></blockquote><div><br></div></div><div>Could you be more specific?</d=
iv><div class=3D""><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr">
<div>So 3 questions that come to mind:</div>
<div><br></div><div><ol><li>Did I interpret the histogram information corre=
ctly in cassandra 2.0.6 nodetool output? =C2=A0That the 2 column read laten=
cy output is the offset or left column is the time in milliseconds and the =
right column is number of requests that fell into that bucket range.<br>


</li><li>Was it reasonable for me to boost those 4 timeouts and just those?=
</li></ol></div></div></blockquote></div><div>Not really. In 5 years of ope=
rating Cassandra, I&#39;ve never had a problem whose solution was to increa=
se these timeouts from their default.</div>
<div class=3D"">
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><ol><li>What are reaso=
nable timeout values for smaller vm sizes (i.e. 8GB RAM, 4 CPUs)? =C2=A0</l=
i></ol>

</div></div></blockquote></div><div>As above, I question the premise of thi=
s question.</div><div><br></div><div>=3DRob</div><div><br></div></div></div=
></div>
</blockquote></div><br></div>

--e89a8f23568bd3641404febee734--