Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=b5tKKj4KZ6
	oSU3CFQCoXFdv7OJ6xF4rTTQ5rM4w3wQmJWjdlJOS8Cq8bAUTardS2lZAxq+mQyV
	ETMv0e5cTeDhVbuImp1hF2kW+Ako86Le1wvp6xQt4ThjI9ufPI0OllunYC66aY2d
	Vh6zYlCbWDJCt+FpY1pYMUsTFo6Bq5dCw=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1084)
Content-Type: multipart/alternative; boundary=Apple-Mail-1--911308251
Subject: Re: how large cassandra could scale when it need to do manual
 operation?
Date: Sat, 9 Jul 2011 16:57:37 -0700
In-Reply-To: 
 <CA+2nF5YjdQoaLk4k7Yy=kNz-P1OvwiJ4VNh93fLHtnaSMJX1xQ@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CAOA66tEvX=OQtq3_pBajgGb1hWg+UF4XXBkGvC-p6ss7+vzgTg@mail.gmail.com>
 <A37E54DF-9223-4566-90F7-D89E26ED0B4E@thelastpickle.com>
 <CAOA66tE0ZJaxt5fJHo74=z77p9By4n71s2Y3=HWQHq54z3teEA@mail.gmail.com>
 <CA+2nF5YjdQoaLk4k7Yy=kNz-P1OvwiJ4VNh93fLHtnaSMJX1xQ@mail.gmail.com>
Message-Id: <B6AAAC51-1621-4712-A6AF-6FD5D0EA0C88@thelastpickle.com>


--Apple-Mail-1--911308251
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

> about the decommission problem, here is the link:  =
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/how-to-de=
commission-two-slow-nodes-td5078455.html
The key part of that post is "and since the second node was under heavy =
load, and not enough ram, it was busy GCing and worked horribly slow" .=20=


> maybe I was misunderstanding the replication factor, doesn't it RF=3D3 =
means I could lose two nodes and still have one available(with 100% of =
the keys), once Nodes>=3D3?
When you start losing replicas the CL you use dictates if the cluster is =
still up for 100% of the keys. See =
http://thelastpickle.com/2011/06/13/Down-For-Me/=20

>  I have the strong willing to set RF to a very high value...
As chris said 3 is about normal, it means the QUORUM CL is only 2 nodes.=20=


> I am also trying to deploy cassandra across two datacenters(with 20ms =
latency).

Lookup LOCAL_QUORUM in the wiki

Hope that helps.=20

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 9 Jul 2011, at 02:01, Chris Goffinet wrote:

> As mentioned by Aaron, yes we run hundreds of Cassandra nodes across =
multiple clusters. We run with RF of 2 and 3 (most common).=20
>=20
> We use commodity hardware and see failure all the time at this scale. =
We've never had 3 nodes that were in same replica set, fail all at once. =
We mitigate risk by being rack diverse, using different vendors for our =
hard drives, designed workflows to make sure machines get serviced in =
certain time windows and have an extensive automated burn-in process of =
(disk, memory, drives) to not roll out nodes/clusters that could fail =
right away.
>=20
> On Sat, Jul 9, 2011 at 12:17 AM, Yan Chunlu <springrider@gmail.com> =
wrote:
> thank you very much for the reply. which brings me more confidence on =
cassandra.
> I will try the automation tools, the examples you've listed seems =
quite promising!
>=20
>=20
> about the decommission problem, here is the link:  =
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/how-to-de=
commission-two-slow-nodes-td5078455.html
>  I am also trying to deploy cassandra across two datacenters(with 20ms =
latency). so I am worrying about the network latency will even make it =
worse. =20
>=20
> maybe I was misunderstanding the replication factor, doesn't it RF=3D3 =
means I could lose two nodes and still have one available(with 100% of =
the keys), once Nodes>=3D3?   besides I am not sure what's twitters =
setting on RF, but it is possible to lose 3 nodes in the same =
time(facebook once encountered photo loss because there RAID broken, =
rarely happen though). I have the strong willing to set RF to a very =
high value...
>=20
> Thanks!
>=20
>=20
> On Sat, Jul 9, 2011 at 5:22 AM, aaron morton <aaron@thelastpickle.com> =
wrote:
> AFAIK Facebook Cassandra and Apache Cassandra diverged paths a long =
time ago. Twitter is a vocal supporter with a large Apache Cassandra =
install, e.g. "Twitter currently runs a couple hundred Cassandra nodes =
across a half dozen clusters. " =
http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak-at-cass=
andra-sf-2011
>=20
>=20
> If you are working with a 3 node cluster removing/rebuilding/what ever =
one node will effect 33% of your capacity. When you scale up the =
contribution from each individual node goes down, and the impact of one =
node going down is less. Problems that happen with a few nodes will go =
away at scale, to be replaced by a whole set of new ones.  =20
>=20
>=20
>> 1):  the load balance need to manually performed on every node, =
according to:=20
>=20
> Yes
> =09
>> 2): when adding new nodes, need to perform node repair and cleanup on =
every node=20
>=20
>=20
>=20
>=20
>=20
>=20
> You only need to run cleanup, see =
http://wiki.apache.org/cassandra/Operations#Bootstrap
>=20
>=20
>=20
>=20
>=20
>=20
>=20
>> 3) when decommission a node, there is a chance that slow down the =
entire cluster. (not sure why but I saw people ask around about it.) and =
the only way to do is shutdown the entire the cluster, rsync the data, =
and start all nodes without the decommission one.=20
>=20
> I cannot remember any specific cases where decommission requires a =
full cluster stop, do you have a link? With regard to slowing down, the =
decommission process will stream data from the node you are removing =
onto the other nodes this can slow down the target node (I think it's =
more intelligent now about what is moved). This will be exaggerated in a =
3 node cluster as you are removing 33% of the processing and adding some =
(temporary) extra load to the remaining nodes.=20
>=20
>=20
>=20
>=20
>=20
>=20
>=20
>> after all, I think there is alot of human work to do to maintain the =
cluster which make it impossible to scale to thousands of nodes,=20
>=20
> Automation, Automation, Automation is the only way to go.=20
>=20
> Chef, Puppet, CF Engine for general config and deployment; Cloud Kick, =
munin, ganglia etc for monitoring. And=20
>=20
>=20
>=20
>=20
>=20
>=20
> Ops Centre (http://www.datastax.com/products/opscenter) for cassandra =
specific management.
>=20
>=20
>=20
>=20
>=20
>=20
>=20
>> I am totally wrong about all of this, currently I am serving 1 =
millions pv every day with Cassandra and it make me feel unsafe, I am =
afraid one day one node crash will cause the data broken and all cluster =
goes wrong....
>=20
> With RF3 and a 3Node cluster you have room to lose one node and the =
cluster will be up for 100% of the keys. While better than having to =
worry about *the* database server, it's still entry level fault =
tolerance. With RF 3 in a 6 Node cluster you can lose up to 2 nodes and =
still be up for 100% of the keys.=20
>=20
>=20
>=20
>=20
>=20
>=20
>=20
> Is there something you are specifically concerned about with your =
current installation ?=20
>=20
> Cheers
>=20
>=20
>=20
>=20
>=20
>=20
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 8 Jul 2011, at 08:50, Yan Chunlu wrote:
>=20
>> hi, all:
>> I am curious about how large that Cassandra can scale?=20
>>=20
>> from the information I can get, the largest usage is at facebook, =
which is about 150 nodes.  in the mean time they are using 2000+ nodes =
with Hadoop, and yahoo even using 4000 nodes of Hadoop.=20
>>=20
>> I am not understand why is the situation, I only have  little =
knowledge with Cassandra and even no knowledge with Hadoop.=20
>>=20
>>=20
>>=20
>> currently I am using cassandra with 3 nodes and having problem bring =
one back after it out of sync, the problems I encountered making me =
worry about how cassandra could scale out:=20
>>=20
>> 1):  the load balance need to manually performed on every node, =
according to:=20
>>=20
>> def tokens(nodes):=20
>>=20
>> for x in xrange(nodes):=20
>>=20
>> print 2 ** 127 / nodes * x=20
>>=20
>>=20
>>=20
>> 2): when adding new nodes, need to perform node repair and cleanup on =
every node=20
>>=20
>>=20
>>=20
>> 3) when decommission a node, there is a chance that slow down the =
entire cluster. (not sure why but I saw people ask around about it.) and =
the only way to do is shutdown the entire the cluster, rsync the data, =
and start all nodes without the decommission one.=20
>>=20
>>=20
>>=20
>>=20
>>=20
>> after all, I think there is alot of human work to do to maintain the =
cluster which make it impossible to scale to thousands of nodes, but I =
hope I am totally wrong about all of this, currently I am serving 1 =
millions pv every day with Cassandra and it make me feel unsafe, I am =
afraid one day one node crash will cause the data broken and all cluster =
goes wrong....=20
>>=20
>>=20
>>=20
>> in the contrary, relational database make me feel safety but it does =
not scale well.=20
>>=20
>>=20
>>=20
>> thanks for any guidance here.
>>=20
>=20
>=20
>=20
>=20
> --=20
> Charles
>=20


--Apple-Mail-1--911308251
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><blockquote type=3D"cite"><div><div><div>about the decommission =
problem, here is the link: &nbsp;<a =
href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/h=
ow-to-decommission-two-slow-nodes-td5078455.html" =
target=3D"_blank">http://cassandra-user-incubator-apache-org.3065146.n2.na=
bble.com/how-to-decommission-two-slow-nodes-td5078455.html</a></div></div>=
</div></blockquote><div>The key part of that post is "and since the =
second node was under heavy load, and not enough ram, it was busy GCing =
and worked horribly slow" .&nbsp;</div><div><br></div><div><blockquote =
type=3D"cite"><div><div><div>maybe I was misunderstanding the =
replication factor, doesn't it RF=3D3 means I could lose two nodes and =
still have one available(with 100% of the keys), once =
Nodes&gt;=3D3?</div></div></div></blockquote>When you start losing =
replicas the CL you use dictates if the cluster is still up for 100% of =
the keys. See&nbsp;<a =
href=3D"http://thelastpickle.com/2011/06/13/Down-For-Me/">http://thelastpi=
ckle.com/2011/06/13/Down-For-Me/</a>&nbsp;</div><div><br></div><div><block=
quote type=3D"cite"><div><div><div>&nbsp;I have the strong willing to =
set RF to a very high value...</div></div></div></blockquote>As chris =
said 3 is about normal, it means the QUORUM CL is only 2 =
nodes.&nbsp;</div><div><br></div><div><blockquote =
type=3D"cite"><div><div><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin-top: 0px; margin-right: 0px; =
margin-bottom: 0px; margin-left: 0.8ex; border-left-width: 1px; =
border-left-color: rgb(204, 204, 204); border-left-style: solid; =
padding-left: 1ex; position: static; z-index: auto; "><div><div><div>I =
am also trying to deploy cassandra across two datacenters(with 20ms =
latency).</div></div></div></blockquote></div></div></div></blockquote></d=
iv><div>Lookup LOCAL_QUORUM in the wiki</div><div><br></div><div>Hope =
that helps.&nbsp;</div><div><br></div><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 9 Jul 2011, at 02:01, Chris Goffinet wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">As =
mentioned by Aaron, yes we run hundreds of Cassandra nodes across =
multiple clusters. We run with RF of 2 and 3 (most =
common).&nbsp;<div><br></div><div>We use commodity hardware and see =
failure all the time at this scale. We've never had 3 nodes that were in =
same replica set, fail all at once. We mitigate risk by being rack =
diverse, using different vendors for our hard drives, designed workflows =
to make sure machines get serviced in certain time windows and have an =
extensive automated burn-in process of (disk, memory, drives) to not =
roll out nodes/clusters that could fail right away.<br>
<div><br><div class=3D"gmail_quote">On Sat, Jul 9, 2011 at 12:17 AM, Yan =
Chunlu <span dir=3D"ltr">&lt;<a =
href=3D"mailto:springrider@gmail.com">springrider@gmail.com</a>&gt;</span>=
 wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin-top: 0px; =
margin-right: 0px; margin-bottom: 0px; margin-left: 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; ">
thank you very much for the reply. which brings me more confidence on =
cassandra.<div><div>I will try the automation tools, the examples you've =
listed seems quite promising!</div><div><br></div><div>
<br><div>about the decommission problem, here is the link: &nbsp;<a =
href=3D"http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/h=
ow-to-decommission-two-slow-nodes-td5078455.html" =
target=3D"_blank">http://cassandra-user-incubator-apache-org.3065146.n2.na=
bble.com/how-to-decommission-two-slow-nodes-td5078455.html</a></div>


<div>&nbsp;I am also trying to deploy cassandra across two =
datacenters(with 20ms latency). so I am worrying about the network =
latency will even make it worse. &nbsp;</div><div><br></div><div>maybe I =
was misunderstanding the replication factor, doesn't it RF=3D3 means I =
could lose two nodes and still have one available(with 100% of the =
keys), once Nodes&gt;=3D3? &nbsp; besides I am not sure what's twitters =
setting on RF, but it is possible to lose 3 nodes in the same =
time(facebook once encountered photo loss because there RAID broken, =
rarely happen though). I have the strong willing to set RF to a very =
high value...</div>


=
<div><br></div><div>Thanks!</div><div><br></div><div><div><div></div><div =
class=3D"h5"><br><div class=3D"gmail_quote">On Sat, Jul 9, 2011 at 5:22 =
AM, aaron morton <span dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word">AFAIK Facebook Cassandra and Apache =
Cassandra diverged paths a long time ago. Twitter is a vocal supporter =
with a large Apache Cassandra install, e.g. "Twitter&nbsp;currently runs =
a couple hundred Cassandra nodes across a half dozen =
clusters.&nbsp;"&nbsp;<a =
href=3D"http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak=
-at-cassandra-sf-2011" =
target=3D"_blank">http://www.datastax.com/2011/06/chris-goffinet-of-twitte=
r-to-speak-at-cassandra-sf-2011</a><div>


<br></div><div><br></div><div><a =
href=3D"http://www.datastax.com/2011/06/chris-goffinet-of-twitter-to-speak=
-at-cassandra-sf-2011" target=3D"_blank"></a>If you are working with a 3 =
node cluster removing/rebuilding/what ever one node will effect 33% of =
your capacity. When you scale up the contribution from each individual =
node goes down, and the impact of one node going down is less. Problems =
that happen with a few nodes will go away at scale, to be replaced by a =
whole set of new ones. &nbsp;&nbsp;<div>


<br><div><span style=3D"white-space:pre-wrap"><br></span></div><div><span =
style=3D"white-space:pre-wrap"><blockquote type=3D"cite" =
style=3D"white-space:normal"><div><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">1): &nbsp;the load balance =
need to manually performed on every node, according to:&nbsp;<br>


</span></div></blockquote></span></div></div><div>Yes</div><div><span =
style=3D"white-space:pre-wrap">	</span></div><div><span =
style=3D"white-space:pre-wrap"></span><span =
style=3D"white-space:pre-wrap"><div><blockquote type=3D"cite" =
style=3D"white-space:normal">


<div><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">2): when adding new =
nodes, need to perform node repair and cleanup on every =
node&nbsp;<br></span></div></blockquote>


</div><div><span style=3D"white-space:pre-wrap">You only need to run =
cleanup, see </span><a =
href=3D"http://wiki.apache.org/cassandra/Operations#Bootstrap" =
target=3D"_blank">http://wiki.apache.org/cassandra/Operations#Bootstrap</a=
></div>


<div><div><span style=3D"white-space:pre-wrap"><br></span></div><div><span=
 style=3D"white-space:pre-wrap"><blockquote type=3D"cite" =
style=3D"white-space:normal"><div><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">3) when decommission a =
node, there is a chance that slow down the entire cluster. (not sure why =
but I saw people ask around about it.) and the only way to do is =
shutdown the entire the cluster, rsync the data, and start all nodes =
without the decommission one.&nbsp;</span></div>


</blockquote></span></div></div><div><span =
style=3D"white-space:pre-wrap">I cannot remember any specific cases =
where decommission requires a full cluster stop, do you have a link? =
With regard to slowing down, the decommission process will stream data =
from the node you are removing onto the other nodes this can slow down =
the target node (I think it's more intelligent now about what is moved). =
This will be exaggerated in a 3 node cluster as you are removing 33% of =
the processing and adding some (temporary) extra load to the remaining =
nodes. </span></div>


<div><div><span =
style=3D"white-space:pre-wrap"><br></span></div><blockquote type=3D"cite" =
style=3D"white-space:normal"><div><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">after all, I think there is =
alot of human work to do to maintain the cluster which make it =
impossible to scale to thousands of nodes,&nbsp;</span></div>


</blockquote></div><div>Automation, Automation, Automation is the only =
way to go. </div><div><br></div><div>Chef, Puppet, CF Engine for general =
config and deployment; Cloud Kick, munin, ganglia etc for monitoring. =
And </div>


<div><span style=3D"white-space:pre-wrap">Ops Centre (<a =
href=3D"http://www.datastax.com/products/opscenter" =
target=3D"_blank">http://www.datastax.com/products/opscenter</a>) for =
cassandra specific management.</span></div><div>


<div><span style=3D"white-space:pre-wrap"><br></span></div><blockquote =
type=3D"cite" style=3D"white-space:normal"><div><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">I am totally wrong about =
all of this, currently I am serving 1 millions pv every day with =
Cassandra and it make me feel unsafe, I am afraid one day one node crash =
will cause the data broken and all cluster goes wrong....</span></div>


</blockquote></div><div>With RF3 and a 3Node cluster you have room to =
lose one node and the cluster will be up for 100% of the keys. While =
better than having to worry about *the* database server, it's still =
entry level fault tolerance. With RF 3 in a 6 Node cluster you can lose =
up to 2 nodes and still be up for 100% of the keys. </div>


<div><br></div><div>Is there something you are specifically concerned =
about with your current installation ? </div><div><span =
style=3D"white-space:pre-wrap"><br></span></div>Cheers</span></div><div><s=
pan style=3D"white-space:pre-wrap"><br>


</span><div>
<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-align:auto;text-inde=
nt:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:m=
edium"><span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">


<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">


<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div></div>


</div></span></div></span></span>
</div><div><div></div><div>

<br><div><div>On 8 Jul 2011, at 08:50, Yan Chunlu =
wrote:</div><br><blockquote type=3D"cite">hi, all:
<div><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">I am curious about =
how large that Cassandra can scale?&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">

</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">from the information I can =
get, the largest usage is at facebook, which is about 150 nodes. =
&nbsp;in the mean time they are using 2000+ nodes with Hadoop, and yahoo =
even using 4000 nodes of Hadoop.&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">I am not understand why is =
the situation, I only have &nbsp;little knowledge with Cassandra and =
even no knowledge with Hadoop.&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">

</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">currently I am using =
cassandra with 3 nodes and having problem bring one back after it out of =
sync, the problems I encountered making me worry about how cassandra =
could scale out:&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">1): &nbsp;the load balance =
need to manually performed on every node, according =
to:&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">

</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">def =
tokens(nodes):&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">for x in =
xrange(nodes):&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">print 2 ** 127 / nodes * =
x&nbsp;</span><span style=3D"border-collapse:collapse;font-family:Verdana,=
 Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">2): when adding new nodes, =
need to perform node repair and cleanup on every node&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">3) when decommission a =
node, there is a chance that slow down the entire cluster. (not sure why =
but I saw people ask around about it.) and the only way to do is =
shutdown the entire the cluster, rsync the data, and start all nodes =
without the decommission one.&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">

</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">after all, I think there is =
alot of human work to do to maintain the cluster which make it =
impossible to scale to thousands of nodes, but I hope I am totally wrong =
about all of this, currently I am serving 1 millions pv every day with =
Cassandra and it make me feel unsafe, I am afraid one day one node crash =
will cause the data broken and all cluster goes =
wrong....&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px">

</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">in the contrary, relational =
database make me feel safety but it does not scale =
well.&nbsp;</span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px"><br>


</span><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br></span><span =
style=3D"border-collapse:collapse;font-family:Verdana, Geneva, =
Helvetica, Arial, sans-serif;font-size:13px">thanks for any guidance =
here.</span></div>


<div><span style=3D"border-collapse:collapse;font-family:Verdana, =
Geneva, Helvetica, Arial, sans-serif;font-size:13px"><br>

</span></div>
=
</blockquote></div><br></div></div></div></div></div></blockquote></div><b=
r><br clear=3D"all"><br></div></div><font color=3D"#888888">-- =
<br>Charles</font></div></div></div>
</blockquote></div><br></div></div>
</blockquote></div><br></body></html>=

--Apple-Mail-1--911308251--