Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of ben@instaclustr.com
 designates 209.85.192.176 as permitted sender)
From: Ben Bromhead <ben@instaclustr.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_352A35A3-A87A-402B-8D85-D588019E7093"
Message-Id: <F0A98D08-7482-499E-BE2E-0F585D0531A6@instaclustr.com>
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Subject: Re: autoscaling cassandra cluster
Date: Wed, 21 May 2014 10:08:34 -0700
References: 
 <CAPqEvGHif6CX_ksNPsE7O5bsg81=dp-e004DTrLtNmoqNwXBHg@mail.gmail.com>
 <CADfhF1EXDLxmnfYuHFYsxcg7xhnmAWskQh196EjhkqpkPWV2yg@mail.gmail.com>
 <CAPqEvGFax1OFXRFBQG7rzH1CpxRA_t7m8Z-NS67nX-5tauaSVA@mail.gmail.com>
 <CAKNNrnUptJKJgowjVYRc=sLiLeVasXgXcPY0jKPf7AVtwTVR9w@mail.gmail.com>
 <CAPqEvGHe=Mupg9SGjZHoAt+ygJ0S9Fa5k5hXzcpdiN2n=H6Mkg@mail.gmail.com>
 <C1A6918E-2E76-4731-8817-2772D5CDC430@opencore.io>
To: user@cassandra.apache.org
In-Reply-To: <C1A6918E-2E76-4731-8817-2772D5CDC430@opencore.io>


--Apple-Mail=_352A35A3-A87A-402B-8D85-D588019E7093
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=windows-1252

The mechanics for it are simple compared to figuring out when to scale, =
especially when you want to be scaling before peak load on your cluster =
(adding and removing nodes puts additional load on your cluster).

We are currently building our own in-house solution for this for our =
customers. If you want to have a go at it yourself, this is a good =
starting point:

=
http://techblog.netflix.com/2013/11/scryer-netflixs-predictive-auto-scalin=
g.html
=
http://techblog.netflix.com/2013/12/scryer-netflixs-predictive-auto-scalin=
g.html

Most of this is fairly specific to Netflix, but an interesting read =
nonetheless.

Datastax OpsCenter also provides capacity planning and forecasting and =
can provide an easy set of metrics you can make your scaling decisions =
on.

=
http://www.datastax.com/what-we-offer/products-services/datastax-opscenter=
=20

Ben Bromhead
Instaclustr | www.instaclustr.com | @instaclustr | +61 415 936 359


On 21/05/2014, at 7:51 AM, James Horey <jlh@opencore.io> wrote:

> If you're interested and/or need some Cassandra docker images let me =
know I'll shoot you a link.
>=20
> James
>=20
> Sent from my iPhone
>=20
> On May 21, 2014, at 10:19 AM, Jabbar Azam <ajazam@gmail.com> wrote:
>=20
>> That sounds interesting.   I was thinking of using coreos with docker =
containers for the business logic, frontend and Cassandra. I'll also =
have a look at cassandra-mesos
>>=20
>> Thanks
>>=20
>> Jabbar Azam
>>=20
>> On 21 May 2014 14:04, "Panagiotis Garefalakis" <pangaref@gmail.com> =
wrote:
>> I agree with Prem, but recently a guy send this promising project =
called Mesos in this list.=20
>> https://github.com/mesosphere/cassandra-mesos
>> One of its goals is to make scaling easier.=20
>> I don=92t have any personal opinion yet but maybe you could give it a =
try.
>>=20
>> Regards,
>> Panagiotis
>>=20
>>=20
>>=20
>> On Wed, May 21, 2014 at 3:49 PM, Jabbar Azam <ajazam@gmail.com> =
wrote:
>> Hello Prem,
>>=20
>> I'm trying to find out whether people are autoscaling up and down =
automatically, not manually. I'm also interested in whether they are =
using a cloud based solution and creating and destroying instances.=20
>>=20
>> I've found the following regarding GCE =
https://cloud.google.com/developers/articles/auto-scaling-on-the-google-cl=
oud-platform and how instances can be created and destroyed.=20
>>=20
>>  I
>>=20
>>=20
>> Thanks
>>=20
>> Jabbar Azam
>>=20
>>=20
>> On 21 May 2014 13:09, Prem Yadav <ipremyadav@gmail.com> wrote:
>> Hi Jabbar,
>> with vnodes, scaling up should not be a problem. You could just add a =
machines with the cluster/seed/datacenter conf and it should join the =
cluster.
>> Scaling down has to be manual where you drain the node and =
decommission it.
>>=20
>> thanks,
>> Prem
>>=20
>>=20
>>=20
>> On Wed, May 21, 2014 at 12:35 PM, Jabbar Azam <ajazam@gmail.com> =
wrote:
>> Hello,
>>=20
>> Has anybody got a cassandra cluster which autoscales depending on =
load or times of the day?
>>=20
>> I've seen the documentation on the datastax website and that only =
mentioned adding and removing nodes, unless I've missed something.
>>=20
>> I want to know how to do this for the google compute engine. This =
isn't for a production system but a test system(multiple nodes) where I =
want to learn. I'm not sure how to check the performance of the cluster, =
whether I use one performance metric or a mix of performance metrics and =
then invoke a script to add or remove nodes from the cluster.
>>=20
>> I'd be interested to know whether people out there are autoscaling =
cassandra on demand.
>>=20
>> Thanks
>>=20
>> Jabbar Azam
>>=20
>>=20
>>=20


--Apple-Mail=_352A35A3-A87A-402B-8D85-D588019E7093
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=windows-1252

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dwindows-1252"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">The =
mechanics for it are simple compared to figuring out when to scale, =
especially when you want to be scaling before peak load on your cluster =
(adding and removing nodes puts additional load on your =
cluster).<div><br></div><div>We are currently building our own in-house =
solution for this for our customers. If you want to have a go at it =
yourself, this is a good starting point:</div><div><br></div><div><a =
href=3D"http://techblog.netflix.com/2013/11/scryer-netflixs-predictive-aut=
o-scaling.html">http://techblog.netflix.com/2013/11/scryer-netflixs-predic=
tive-auto-scaling.html</a></div><div><a =
href=3D"http://techblog.netflix.com/2013/12/scryer-netflixs-predictive-aut=
o-scaling.html">http://techblog.netflix.com/2013/12/scryer-netflixs-predic=
tive-auto-scaling.html</a></div><div><br></div><div>Most of this is =
fairly specific to Netflix, but an interesting read =
nonetheless.</div><div><br></div><div>Datastax OpsCenter also provides =
capacity planning and forecasting and can provide an easy set of metrics =
you can make your scaling decisions on.</div><div><br></div><div><a =
href=3D"http://www.datastax.com/what-we-offer/products-services/datastax-o=
pscenter">http://www.datastax.com/what-we-offer/products-services/datastax=
-opscenter</a>&nbsp;</div><div><div><br><div>
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica;  font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-align: -webkit-auto; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div><div><div>Ben Bromhead</div><div></div></div><div>Instaclustr =
|&nbsp;<a =
href=3D"https://www.instaclustr.com/">www.instaclustr.com</a>&nbsp;|&nbsp;=
<a href=3D"http://twitter.com/instaclustr">@instaclustr</a>&nbsp;| +61 =
415 936 359</div></div><div><br></div></div><br =
class=3D"Apple-interchange-newline"><br =
class=3D"Apple-interchange-newline">
</div>
<br><div><div>On 21/05/2014, at 7:51 AM, James Horey &lt;<a =
href=3D"mailto:jlh@opencore.io">jlh@opencore.io</a>&gt; wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><meta =
http-equiv=3D"content-type" content=3D"text/html; charset=3Dutf-8"><div =
dir=3D"auto"><div>If you're interested and/or need some Cassandra docker =
images let me know I'll shoot you a =
link.</div><div><br></div><div>James<br><br>Sent from my =
iPhone</div><div><br>On May 21, 2014, at 10:19 AM, Jabbar Azam &lt;<a =
href=3D"mailto:ajazam@gmail.com">ajazam@gmail.com</a>&gt; =
wrote:<br><br></div><blockquote type=3D"cite"><p dir=3D"ltr">That sounds =
interesting.&nbsp;&nbsp; I was thinking of using coreos with docker =
containers for the business logic, frontend and Cassandra. I'll also =
have a look at cassandra-mesos</p><p dir=3D"ltr">Thanks</p><p =
dir=3D"ltr">Jabbar Azam</p>
<div class=3D"gmail_quote">On 21 May 2014 14:04, "Panagiotis =
Garefalakis" &lt;<a =
href=3D"mailto:pangaref@gmail.com">pangaref@gmail.com</a>&gt; wrote:<br =
type=3D"attribution"><blockquote class=3D"gmail_quote" style=3D"margin:0 =
0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div dir=3D"ltr"><div><div><div>I agree with Prem, but recently a guy =
send this promising project called Mesos in this list. <br><a =
href=3D"https://github.com/mesosphere/cassandra-mesos" =
target=3D"_blank">https://github.com/mesosphere/cassandra-mesos</a><br>

</div>One of its goals is to make scaling easier. <br>I don=92t have any =
personal opinion yet but maybe you could give it a =
try.<br><br></div>Regards,<br></div>Panagiotis<br><div><br></div></div><di=
v class=3D"gmail_extra">

<br><br><div class=3D"gmail_quote">On Wed, May 21, 2014 at 3:49 PM, =
Jabbar Azam <span dir=3D"ltr">&lt;<a href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;</span> wrote:<br><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc =
solid;padding-left:1ex">

<div dir=3D"ltr">Hello Prem,<div><br></div><div>I'm trying to find out =
whether people are autoscaling up and down automatically, not manually. =
I'm also interested in whether they are using a cloud based solution and =
creating and destroying instances.&nbsp;</div>


<div><br></div><div>I've found the following regarding GCE&nbsp;<a =
href=3D"https://cloud.google.com/developers/articles/auto-scaling-on-the-g=
oogle-cloud-platform" =
target=3D"_blank">https://cloud.google.com/developers/articles/auto-scalin=
g-on-the-google-cloud-platform</a> and how instances can be created and =
destroyed.&nbsp;</div>


<div><br></div><div>&nbsp;I</div><div><br></div></div><div =
class=3D"gmail_extra"><br clear=3D"all"><div><div =
dir=3D"ltr">Thanks<span><font color=3D"#888888"><br><br>Jabbar =
Azam<br></font></span></div></div><div>

<br><br><div class=3D"gmail_quote">On 21 May 2014 13:09, Prem Yadav =
<span dir=3D"ltr">&lt;<a href=3D"mailto:ipremyadav@gmail.com" =
target=3D"_blank">ipremyadav@gmail.com</a>&gt;</span> =
wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Hi Jabbar,<div>with vnodes, scaling up should not be a =
problem. You could just add a machines with the cluster/seed/datacenter =
conf and it should join the cluster.</div><div>Scaling down has to be =
manual where you drain the node and decommission it.</div>


=
<div><br></div><div>thanks,</div><div>Prem<br><div><br></div></div></div><=
div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On =
Wed, May 21, 2014 at 12:35 PM, Jabbar Azam <span dir=3D"ltr">&lt;<a =
href=3D"mailto:ajazam@gmail.com" =
target=3D"_blank">ajazam@gmail.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
dir=3D"ltr">Hello,<div><br></div><div>Has anybody got a cassandra =
cluster which autoscales depending on load or times of the day?</div>


<div><br></div><div>I've seen the documentation on the datastax website =
and that only mentioned adding and removing nodes, unless I've missed =
something.</div>
<div><br></div><div>I want to know how to do this for the google compute =
engine. This isn't for a production system but a test system(multiple =
nodes) where I want to learn. I'm not sure how to check the performance =
of the cluster, whether I use one performance metric or a mix of =
performance metrics and then invoke a script to add or remove nodes from =
the cluster.</div>


<div><br></div><div>I'd be interested to know whether people out there =
are autoscaling cassandra on demand.<br clear=3D"all"><div><div =
dir=3D"ltr"><br></div><div dir=3D"ltr">Thanks<span><font =
color=3D"#888888"><br>
<br>Jabbar Azam<br></font></span></div></div>
</div></div>
</blockquote></div><br></div>
</div></blockquote></div><br></div></div>
</blockquote></div><br></div>
</blockquote></div>
</blockquote></div></blockquote></div><br></div></div></body></html>=

--Apple-Mail=_352A35A3-A87A-402B-8D85-D588019E7093--