Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of
 SRS0=xz4W5N=4N=basetechnology.com=jack@yourhostingaccount.com designates
 65.254.253.78 as permitted sender)
Message-ID: <43A7F5BFFCEB4231B2D261DB7F61D648@JackKrupansky14>
From: "Jack Krupansky" <jack@basetechnology.com>
To: <user@cassandra.apache.org>
References: 
 <CACK4mqRu72qz_e=BPx-J_RW6yHiFkX4_wtnrzzMb-=d2yEmV2w@mail.gmail.com><62C7CCE5A334433FBC9F93F87370648F@JackKrupansky14>
 <CACK4mqS5bz0XH7oUN16t7EKWqsEdNW9iRxG2N-O1hqPtKJ97_A@mail.gmail.com>
In-Reply-To: 
 <CACK4mqS5bz0XH7oUN16t7EKWqsEdNW9iRxG2N-O1hqPtKJ97_A@mail.gmail.com>
Subject: Re: horizontal query scaling issues follow on
Date: Fri, 18 Jul 2014 00:19:15 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_1C8C_01CFA21D.E88C64E0"
Importance: Normal
Sender: "Jack Krupansky" <jack@basetechnology.com>

This is a multi-part message in MIME format.

------=_NextPart_000_1C8C_01CFA21D.E88C64E0
Content-Type: text/plain;
	charset="utf-8"
Content-Transfer-Encoding: quoted-printable

Sorry I may have confused the discussion by mentioning tokens =E2=80=93 =
I wasn=E2=80=99t intending to refer to vnodes or the num_tokens =
property, but merely referring to the token range of a node and that the =
partition key hashes to a token value.

The main question is what you use for your primary key and whether you =
are using a small number of partition keys and a large number of =
clustering columns, or does each row have a unique partition key and no =
clustering columns.

-- Jack Krupansky

From: Diane Griffith=20
Sent: Thursday, July 17, 2014 6:21 PM
To: user=20
Subject: Re: horizontal query scaling issues follow on

So do partitions equate to tokens/vnodes?=20

If so we had configured all cluster nodes/vms with num_tokens: 256 =
instead of setting init_token and assigning ranges.  I am still not =
getting why in Cassandra 2.0, I would assign my own ranges via =
init_token and this was based on the documentation and even this blog =
item that made it seem right for us to always configure our cluster vms =
with num_tokens: 256 in the cassandra.yaml file. =20

Also in all testing, all vms were of equal sizing so one was not more =
powerful than another. =20

I didn't think I was hitting an i/o wall on the client vm (separate vm) =
where we command line scripted our query call to the cassandra cluster.  =
  I can break the client call load across vms which I tried early on.  =
Happy to verify that again though.

So given that I was assuming the partitions were such that it wasn't a =
problem.  Is that an incorrect assumption and something to dig into =
more?

Thanks,
Diane


On Thu, Jul 17, 2014 at 3:01 PM, Jack Krupansky =
<jack@basetechnology.com> wrote:

  How many partitions are you spreading those 18 million rows over? That =
many rows in a single partition will not be a sweet spot for Cassandra. =
It=E2=80=99s not exceeding any hard limit (2 billion), but some internal =
operations may cache the partition rather than the logical row.

  And all those rows in a single partition would certainly not be a test =
of =E2=80=9Chorizontal scaling=E2=80=9D (adding nodes to handle more =
data =E2=80=93 more token values or partitions.)

  -- Jack Krupansky

  From: Diane Griffith=20
  Sent: Thursday, July 17, 2014 1:33 PM
  To: user=20
  Subject: horizontal query scaling issues follow on

  This is a follow on re-post to clarify what we are trying to do, =
providing information that was missing or not clear.


  Goal:  Verify horizontal scaling for random non duplicating key reads =
using the simplest configuration (or minimal configuration) possible.


  Background:

  A couple years ago we did similar performance testing with Cassandra =
for both read and write performance and found excellent (essentially =
linear) horizontal scalability.  That project got put on hold.  We are =
now moving forward with an operational system and are having scaling =
problems.


  During the prior testing (3 years ago) we were using a much older =
version of Cassandra (0.8 or older), the THRIFT API, and Amazon AWS =
rather than OpenStack VMs.  We are now using the latest Cassandra and =
the CQL interface.  We did try moving from OpenStack to AWS/EC2 but that =
did not materially change our (poor) results.


  Test Procedure:

    a.. Inserted 54 million cells in 18 million rows (so 3 cells per =
row), using randomly generated row keys. That was to be our data control =
for the test.=20
    b.. Spawn a client on a different VM to query 100k rows and do that =
for 100 reps.  Each row key queried is drawn randomly from the set of =
existing row keys, and then not re-used, so all 10 million row queries =
use a different (valid) row key.  This test is a specific use case of =
our system we are trying to show will scale=20
  Result:

    a.. 2 nodes performed better than 1 node test but 4 nodes showed =
decreased performance over 2 nodes.  So that did not show horizontal =
scaling=20


  Notes:

    a.. We have replication factor set to 1 as we were trying to keep =
the control test simple to prove out horizontal scaling. =20
    b.. When we tried to add threading to see if it would help it had =
interesting side behavior which did not prove out horizontal scaling.=20
    c.. We are using CQL versus THRIFT API for Cassandra 2.0.6=20


  Does anyone have any feedback that either threading or replication =
factor is necessary to show horizontal scaling of Cassandra versus the =
minimal way of just continue to add nodes to help throughput?


  Any suggestions of minimal configuration necessary to show scaling of =
our query use case 100k requests for random non repeating keys =
constantly coming in over a period of time?


  Thanks,

  Diane


------=_NextPart_000_1C8C_01CFA21D.E88C64E0
Content-Type: text/html;
	charset="utf-8"
Content-Transfer-Encoding: quoted-printable

<HTML><HEAD></HEAD>
<BODY dir=3Dltr>
<DIV dir=3Dltr>
<DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: #000000">
<DIV>Sorry I may have confused the discussion by mentioning tokens =
=E2=80=93 I wasn=E2=80=99t=20
intending to refer to vnodes or the num_tokens property, but merely =
referring to=20
the token range of a node and that the partition key hashes to a token=20
value.</DIV>
<DIV>&nbsp;</DIV>
<DIV>The main question is what you use for your primary key and whether =
you are=20
using a small number of partition keys and a large number of clustering =
columns,=20
or does each row have a unique partition key and no clustering =
columns.</DIV>
<DIV>&nbsp;</DIV>
<DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: =
#000000">-- Jack=20
Krupansky</DIV>
<DIV=20
style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
<DIV style=3D"FONT: 10pt tahoma">
<DIV>&nbsp;</DIV>
<DIV style=3D"BACKGROUND: #f5f5f5">
<DIV style=3D"font-color: black"><B>From:</B> <A =
title=3Ddfgriffith@gmail.com=20
href=3D"mailto:dfgriffith@gmail.com">Diane Griffith</A> </DIV>
<DIV><B>Sent:</B> Thursday, July 17, 2014 6:21 PM</DIV>
<DIV><B>To:</B> <A title=3Duser@cassandra.apache.org=20
href=3D"mailto:user@cassandra.apache.org">user</A> </DIV>
<DIV><B>Subject:</B> Re: horizontal query scaling issues follow=20
on</DIV></DIV></DIV>
<DIV>&nbsp;</DIV></DIV>
<DIV=20
style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
<DIV dir=3Dltr>So do partitions equate to tokens/vnodes?=20
<DIV>&nbsp;</DIV>
<DIV>If so we had configured all cluster nodes/vms with num_tokens: 256 =
instead=20
of setting init_token and assigning ranges.&nbsp; I am still not getting =
why in=20
Cassandra 2.0, I would assign my own ranges via init_token and this was =
based on=20
the documentation and even this <A=20
href=3D"http://www.datastax.com/dev/blog/virtual-nodes-in-cassandra-1-2">=
blog=20
item</A> that made it seem right for us to always configure our cluster =
vms with=20
num_tokens: 256 in the cassandra.yaml file.&nbsp; </DIV>
<DIV>&nbsp;</DIV>
<DIV>Also in all testing, all vms were of equal sizing so one was not =
more=20
powerful than another.&nbsp; </DIV>
<DIV>&nbsp;</DIV>
<DIV>I didn't think I was hitting an i/o wall on the client vm (separate =
vm)=20
where we command line scripted our query call to the cassandra=20
cluster.&nbsp;&nbsp;&nbsp; I can break the client call load across vms =
which I=20
tried early on.&nbsp; Happy to verify that again though.</DIV>
<DIV>&nbsp;</DIV>
<DIV>So given that I was assuming the partitions were such that it =
wasn't a=20
problem.&nbsp; Is that an incorrect assumption and something to dig into =

more?</DIV>
<DIV>&nbsp;</DIV>
<DIV>Thanks,</DIV>
<DIV>Diane</DIV></DIV>
<DIV class=3Dgmail_extra><BR><BR>
<DIV class=3Dgmail_quote>On Thu, Jul 17, 2014 at 3:01 PM, Jack Krupansky =
<SPAN=20
dir=3Dltr>&lt;<A href=3D"mailto:jack@basetechnology.com"=20
target=3D_blank>jack@basetechnology.com</A>&gt;</SPAN> wrote:<BR>
<BLOCKQUOTE class=3Dgmail_quote=20
style=3D"PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc =
1px solid">
  <DIV dir=3Dltr>
  <DIV dir=3Dltr>
  <DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: =
#000000">
  <DIV>How many partitions are you spreading those 18 million rows over? =
That=20
  many rows in a single partition will not be a sweet spot for =
Cassandra. It=E2=80=99s=20
  not exceeding any hard limit (2 billion), but some internal operations =
may=20
  cache the partition rather than the logical row.</DIV>
  <DIV>&nbsp;</DIV>
  <DIV>And all those rows in a single partition would certainly not be a =
test of=20
  =E2=80=9Chorizontal scaling=E2=80=9D (adding nodes to handle more data =
=E2=80=93 more token values or=20
  partitions.)</DIV>
  <DIV>&nbsp;</DIV>
  <DIV style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: =
#000000">-- Jack=20
  Krupansky</DIV>
  <DIV=20
  style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
  <DIV style=3D"FONT: 10pt tahoma">
  <DIV>&nbsp;</DIV>
  <DIV style=3D"BACKGROUND: #f5f5f5">
  <DIV><B>From:</B> <A title=3Ddfgriffith@gmail.com=20
  href=3D"mailto:dfgriffith@gmail.com" target=3D_blank>Diane =
Griffith</A> </DIV>
  <DIV><B>Sent:</B> Thursday, July 17, 2014 1:33 PM</DIV>
  <DIV><B>To:</B> <A title=3Duser@cassandra.apache.org=20
  href=3D"mailto:user@cassandra.apache.org" target=3D_blank>user</A> =
</DIV>
  <DIV><B>Subject:</B> horizontal query scaling issues follow=20
  on</DIV></DIV></DIV>
  <DIV>&nbsp;</DIV></DIV>
  <DIV>
  <DIV class=3Dh5>
  <DIV=20
  style=3D'FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: =
"Calibri"; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; =
DISPLAY: inline'>
  <DIV dir=3Dltr>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">This is a =
follow on=20
  re-post to clarify what we are trying to do, providing information =
that was=20
  missing or not clear.</SPAN><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Times New =
Roman',serif"></SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Goal:&nbsp; =
Verify=20
  horizontal scaling for random non duplicating key reads using the =
simplest=20
  configuration (or minimal configuration) possible.</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif">Background:</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">A couple =
years ago we=20
  did similar performance testing with Cassandra for both read and write =

  performance and found excellent (essentially linear) horizontal=20
  scalability.&nbsp; That project got put on hold.&nbsp; We are now =
moving=20
  forward with an operational system and are having scaling =
problems.</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">During the =
prior=20
  testing (3 years ago) we were using a much older version of Cassandra =
(0.8 or=20
  older), the THRIFT API, and Amazon AWS rather than OpenStack =
VMs.&nbsp; We are=20
  now using the latest Cassandra and the CQL interface.&nbsp; We did try =
moving=20
  from OpenStack to AWS/EC2 but that did not materially change our =
(poor)=20
  results.</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Test=20
  Procedure:</SPAN></P>
  <UL type=3Ddisc>
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Inserted 54 =
million=20
    cells in 18 million rows (so 3 cells per row), using randomly =
generated row=20
    keys. That was to be our data control for the test.</SPAN>=20
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Spawn a =
client on a=20
    different VM to query 100k rows and do that for 100 reps.&nbsp; Each =
row key=20
    queried is drawn randomly from the set of existing row keys, and =
then not=20
    re-used, so all 10 million row queries use a different (valid) row=20
    key.&nbsp; This test is a specific use case of our system we are =
trying to=20
    show will scale</SPAN> </LI></UL>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif">Result:</SPAN></P>
  <UL type=3Ddisc>
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">2 nodes =
performed=20
    better than 1 node test but 4 nodes showed decreased performance =
over 2=20
    nodes.&nbsp; So that did not show horizontal scaling</SPAN> =
</LI></UL>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif">Notes:</SPAN></P>
  <UL type=3Ddisc>
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">We have =
replication=20
    factor set to 1 as we were trying to keep the control test simple to =
prove=20
    out horizontal scaling.&nbsp; </SPAN>
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">When we =
tried to add=20
    threading to see if it would help it had interesting side behavior =
which did=20
    not prove out horizontal scaling.</SPAN>=20
    <LI class=3DMsoNormal><SPAN=20
    style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">We are =
using CQL=20
    versus THRIFT API for Cassandra 2.0.6</SPAN> </LI></UL>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Does anyone =
have any=20
  feedback that either threading or replication factor is necessary to =
show=20
  horizontal scaling of Cassandra versus the minimal way of just =
continue to add=20
  nodes to help throughput?</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"></SPAN>&nbsp;</P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: arial,sans-serif">Any =
suggestions of=20
  minimal configuration necessary to show scaling of our query use case =
100k=20
  requests for random non repeating keys constantly coming in over a =
period of=20
  time?</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif"><BR></SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif">Thanks,</SPAN></P>
  <P class=3DMsoNormal><SPAN=20
  style=3D"FONT-SIZE: 12pt; FONT-FAMILY: =
arial,sans-serif">Diane</SPAN></P></DIV></DIV></DIV></DIV></DIV></DIV></D=
IV></BLOCKQUOTE></DIV>
<DIV>&nbsp;</DIV></DIV></DIV></DIV></DIV></BODY></HTML>

------=_NextPart_000_1C8C_01CFA21D.E88C64E0--