Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_FA32AFA4-971A-4214-954F-8E3DD8E5077D"
Message-Id: <659187FB-54E3-46CB-B66D-AE4CC4B9DAE2@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.3 \(1503\))
Subject: Re: Adding nodes in 1.2 with vnodes requires huge disks
Date: Mon, 29 Apr 2013 21:24:36 +1200
References: 
 <CA+qt5VP_nESCBnJsWjLbsbKU-2E8V0fEgDvptPdjaFwpQAwXiw@mail.gmail.com>
 <6BF74559-E40C-46DA-9BD3-5CE27C74C72A@igcorp.com.br>
 <CA+qt5VOamu_+-zR_NMf8YEysn=8=nW9-HzNEFeYfwqW_Y-TVtg@mail.gmail.com>
 <CAPmnz8HWi7tT1pBHuzwMp-BxDBytYq-qDMHj-wb7ELe2i96HCA@mail.gmail.com>
 <479C8613-8785-40FC-A4FE-77356483F5FC@thelastpickle.com>
 <CA+qt5VOErLnv+3nd4-fui7x4DiK1Cn8CSpPryPSh9YaoA2aVxw@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CA+qt5VOErLnv+3nd4-fui7x4DiK1Cn8CSpPryPSh9YaoA2aVxw@mail.gmail.com>


--Apple-Mail=_FA32AFA4-971A-4214-954F-8E3DD8E5077D
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

is this understanding correct "we had a 12 node cluster with 256 vnodes =
on each node (upgraded from 1.1), we added two additional nodes that =
streamed so much data (600+Gb when other nodes had 150-200GB) during the =
joining phase that they filled their local disks and had to be killed" ?

Can you raise a ticket on =
https://issues.apache.org/jira/browse/CASSANDRA and update the thread =
with the ticket number.

Can you show the output from nodetool status so we can get a feel for =
the ring?
Can you include the logs from one of the nodes that failed to join ?=20

Thanks

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 29/04/2013, at 10:01 AM, John Watson <john@disqus.com> wrote:

> On Sun, Apr 28, 2013 at 2:19 PM, aaron morton =
<aaron@thelastpickle.com> wrote:
>> We're going to try running a shuffle before adding a new node =
again... maybe that will help
>=20
> I don't think  hurt but I doubt it will help.=20
>=20
> We had to bail on shuffle since we need to add capacity ASAP and not =
in 20 days.
> =20
>=20
>>> It seems when new nodes join, they are streamed *all* sstables in =
the cluster.
>=20
>>=20
>=20
> How many nodes did you join, what was the num_tokens ?=20
> Did you notice streaming from all nodes (in the logs) or are you =
saying this in response to the cluster load increasing ?=20
>=20
> =20
> Was only adding 2 nodes at the time (planning to add a total of 12.) =
Starting with a cluster of 12, but now 11 since 1 node entered some =
weird state when one of the new nodes ran out disk space.
> num_tokens is set to 256 on all nodes.
> Yes, nearly all current nodes were streaming to the new ones (which =
was great until disk space was an issue.)
>>> The purple line machine, I just stopped the joining process because =
the main cluster was dropping mutation messages at this point on a few =
nodes (and it still had dozens of sstables to stream.)
> Which were the new nodes ?
> Can you show the output from nodetool status?
>=20
>=20
> The new nodes are the purple and gray lines above all the others.
>=20
> nodetool status doesn't show joining nodes. I think I saw a bug =
already filed for this but I can't seem to find it.
> =20
>=20
> Cheers
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Consultant
> New Zealand
>=20
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 27/04/2013, at 9:35 AM, Bryan Talbot <btalbot@aeriagames.com> =
wrote:
>=20
>> I believe that "nodetool rebuild" is used to add a new datacenter, =
not just a new host to an existing cluster.  Is that what you ran to add =
the node?
>>=20
>> -Bryan
>>=20
>>=20
>>=20
>> On Fri, Apr 26, 2013 at 1:27 PM, John Watson <john@disqus.com> wrote:
>> Small relief we're not the only ones that had this issue.
>>=20
>> We're going to try running a shuffle before adding a new node =
again... maybe that will help
>>=20
>> - John
>>=20
>>=20
>> On Fri, Apr 26, 2013 at 5:07 AM, Francisco Nogueira Calmon Sobral =
<fsobral@igcorp.com.br> wrote:
>> I am using the same version and observed something similar.
>>=20
>> I've added a new node, but the instructions from Datastax did not =
work for me. Then I ran "nodetool rebuild" on the new node. After =
finished this command, it contained two times the load of the other =
nodes. Even when I ran "nodetool cleanup" on the older nodes, the =
situation was the same.
>>=20
>> The problem only seemed to disappear when "nodetool repair" was =
applied to all nodes.
>>=20
>> Regards,
>> Francisco Sobral.
>>=20
>>=20
>>=20
>>=20
>> On Apr 25, 2013, at 4:57 PM, John Watson <john@disqus.com> wrote:
>>=20
>>> After finally upgrading to 1.2.3 from 1.1.9, enabling vnodes, and =
running upgradesstables, I figured it would be safe to start adding =
nodes to the cluster. Guess not?
>>>=20
>>> It seems when new nodes join, they are streamed *all* sstables in =
the cluster.
>>>=20
>>> =
https://dl.dropbox.com/s/bampemkvlfck2dt/Screen%20Shot%202013-04-25%20at%2=
012.35.24%20PM.png
>>>=20
>>> The gray the line machine ran out disk space and for some reason =
cascaded into errors in the cluster about 'no host id' when trying to =
store hints for it (even though it hadn't joined yet).
>>> The purple line machine, I just stopped the joining process because =
the main cluster was dropping mutation messages at this point on a few =
nodes (and it still had dozens of sstables to stream.)
>>>=20
>>> I followed this: =
http://www.datastax.com/docs/1.2/operations/add_replace_nodes
>>>=20
>>> Is there something missing in that documentation?
>>>=20
>>> Thanks,
>>>=20
>>> John
>>=20
>>=20
>>=20
>=20
>=20


--Apple-Mail=_FA32AFA4-971A-4214-954F-8E3DD8E5077D
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">is =
this understanding correct "we had a 12 node cluster with 256 vnodes on =
each node (upgraded from 1.1), we added two additional nodes that =
streamed so much data (600+Gb when other nodes had 150-200GB) during the =
joining phase that they filled their local disks and had to be killed" =
?<div><br></div><div>Can you raise a ticket on&nbsp;<a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA">https://issues.ap=
ache.org/jira/browse/CASSANDRA</a>&nbsp;and update the thread with the =
ticket number.</div><div><br></div><div>Can you show the output from =
nodetool status so we can get a feel for the ring?</div><div>Can you =
include the logs from one of the nodes that failed to join =
?&nbsp;</div><div><br></div><div>Thanks</div><div><br></div><div><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
border-spacing: 0px; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>
<br><div><div>On 29/04/2013, at 10:01 AM, John Watson &lt;<a =
href=3D"mailto:john@disqus.com">john@disqus.com</a>&gt; wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><div =
dir=3D"ltr">On Sun, Apr 28, 2013 at 2:19 PM, aaron morton <span =
dir=3D"ltr">&lt;<a href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;</span> wrote:<br><div =
class=3D"gmail_extra"><div class=3D"gmail_quote">

<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div =
style=3D"word-wrap:break-word"><div><blockquote type=3D"cite">

<div class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div dir=3D"ltr">

<div>We're going to try running a shuffle before adding a new node =
again... maybe that will help</div><span><font =
color=3D"#888888"></font></span></div></blockquote></div></div></blockquot=
e><font color=3D"#888888"></font></div>

<div>I don't think &nbsp;hurt but I doubt it will =
help.&nbsp;</div></div></blockquote><div><br></div><div>We had to bail =
on shuffle since we need to add capacity ASAP and not in 20 =
days.</div><div>&nbsp;</div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><div><br></div><div><div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<span style=3D"border-collapse:separate;border-spacing:0px"><div =
style=3D"word-wrap:break-word"><span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<div><div><blockquote type=3D"cite"><div class=3D"gmail_extra"><div =
class=3D"gmail_quote"><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div><div class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><blockquote type=3D"cite"><div =
dir=3D"ltr">It seems when new nodes join, they are streamed *all* =
sstables in the =
cluster.</div></blockquote></div></blockquote></div></div></div></blockquo=
te>

</div></div></blockquote></div><div><div class=3D"gmail_extra"><div =
class=3D"gmail_quote"><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div><div class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><blockquote type=3D"cite"><div =
dir=3D"ltr"><br></div></blockquote><br></div></blockquote></div></div></di=
v></blockquote></div></div></div></div><div>How many nodes did you join, =
what was the num_tokens ?&nbsp;</div>

<div>Did you notice streaming from all nodes (in the logs) or are you =
saying this in response to the cluster load increasing =
?&nbsp;</div><div><br></div></div></span></div></span></div></span></div><=
/span></div></div></div></div>

</blockquote><div>&nbsp;</div><div>Was only adding 2 nodes at the time =
(planning to add a total of 12.) Starting with a cluster of 12, but now =
11 since 1 node entered some weird state when one of the new nodes ran =
out disk space.</div>

<div>num_tokens is set to 256 on all nodes.</div><div>Yes, nearly all =
current nodes were streaming to the new ones (which was great until disk =
space was an issue.)</div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<span style=3D"border-collapse:separate;border-spacing:0px">

<div style=3D"word-wrap:break-word">

<span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<div></div><div><div><blockquote type=3D"cite"><div =
class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div><div class=3D"gmail_extra"><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex">

<div style=3D"word-wrap:break-word"><blockquote type=3D"cite"><div =
dir=3D"ltr">The purple line machine, I just stopped the joining process =
because the main cluster was dropping mutation messages at this point on =
a few nodes (and it still had dozens of sstables to stream.)</div>

=
</blockquote></div></blockquote></div></div></div></blockquote></div></div=
></blockquote></div>Which were the new nodes ?</div><div>Can you show =
the output from nodetool status?</div><div><br></div></div></span></div>

</span></div></div></div></blockquote><div><br></div><div>The new nodes =
are the purple and gray lines above all the =
others.</div><div><br></div><div>nodetool status doesn't show joining =
nodes. I think I saw a bug already filed for this but I can't seem to =
find it.</div>

<div>&nbsp;<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px=
 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div style=3D"word-wrap:break-word"><div>
<div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<div =
style=3D"text-indent:0px;letter-spacing:normal;font-variant:normal;text-al=
ign:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;t=
ext-transform:none;font-size:medium;white-space:normal;font-family:Helveti=
ca;word-wrap:break-word;word-spacing:0px">

<span style=3D"border-collapse:separate;border-spacing:0px"><div =
style=3D"word-wrap:break-word"><span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

<span =
style=3D"border-spacing:0px;text-indent:0px;letter-spacing:normal;font-var=
iant:normal;font-style:normal;font-weight:normal;line-height:normal;border=
-collapse:separate;text-transform:none;font-size:medium;white-space:normal=
;font-family:Helvetica;word-spacing:0px"><div =
style=3D"word-wrap:break-word">

=
<div></div><div><br></div><div>Cheers</div><div><br></div><div>-----------=
------</div><div>Aaron Morton</div><div>Freelance Cassandra =
Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div>

</div></span></div></span></div></span></div></span></div></div>
</div><div>
<br><div><div>On 27/04/2013, at 9:35 AM, Bryan Talbot &lt;<a =
href=3D"mailto:btalbot@aeriagames.com" =
target=3D"_blank">btalbot@aeriagames.com</a>&gt; =
wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr">I believe =
that "nodetool rebuild" is used to add a new datacenter, not just a new =
host to an existing cluster. &nbsp;Is that what you ran to add the =
node?<div>

<br></div><div>-Bryan</div><div>
<br></div></div><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">On Fri, Apr 26, 2013 at 1:27 PM, John Watson <span =
dir=3D"ltr">&lt;<a href=3D"mailto:john@disqus.com" =
target=3D"_blank">john@disqus.com</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div dir=3D"ltr">Small relief we're not =
the only ones that had this issue.<div>

<br></div><div>We're going to try running a shuffle before adding a new =
node again... maybe that will help</div>
<span><font color=3D"#888888"><div><br></div>
<div>- John</div></font></span></div><div><div =
class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Fri, Apr 26, =
2013 at 5:07 AM, Francisco Nogueira Calmon Sobral <span dir=3D"ltr">&lt;<a=
 href=3D"mailto:fsobral@igcorp.com.br" =
target=3D"_blank">fsobral@igcorp.com.br</a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; =
border-left-width: 1px; border-left-color: rgb(204, 204, 204); =
border-left-style: solid; padding-left: 1ex; position: static; z-index: =
auto; "><div style=3D"word-wrap:break-word"><div>I am using the same =
version and observed something similar.</div>

<div><br></div>

<div>I've added a new node, but the instructions from Datastax did not =
work for me. Then I ran "nodetool rebuild" on the new node. After =
finished this command, it contained two times the load of the other =
nodes. Even when I ran "nodetool cleanup" on the older nodes, the =
situation was the same.</div>


<div><br></div><div>The problem only seemed to disappear when "nodetool =
repair" was applied to all =
nodes.</div><div><br></div><div>Regards,</div><div>Francisco =
Sobral.</div><div><div><br></div><div>
<br></div><br>
<br><div><div>On Apr 25, 2013, at 4:57 PM, John Watson &lt;<a =
href=3D"mailto:john@disqus.com" target=3D"_blank">john@disqus.com</a>&gt; =
wrote:</div><br><blockquote type=3D"cite"><div dir=3D"ltr">After finally =
upgrading to 1.2.3 from 1.1.9, enabling vnodes, and running =
upgradesstables, I figured it would be safe to start adding nodes to the =
cluster. Guess not?<div>


<br></div><div>It seems when new nodes join, they are streamed *all* =
sstables in the cluster.</div>
<div><br></div><div><a =
href=3D"https://dl.dropbox.com/s/bampemkvlfck2dt/Screen Shot 2013-04-25 =
at 12.35.24 PM.png" =
target=3D"_blank">https://dl.dropbox.com/s/bampemkvlfck2dt/Screen%20Shot%2=
02013-04-25%20at%2012.35.24%20PM.png</a><br>


</div><div><br></div><div>The gray the line machine ran out disk space =
and for some reason cascaded into errors in the cluster about 'no host =
id' when trying to store hints for it (even though it hadn't joined =
yet).</div>


<div>The purple line machine, I just stopped the joining process because =
the main cluster was dropping mutation messages at this point on a few =
nodes (and it still had dozens of sstables to stream.)</div><div>
<br></div><div>I followed this:&nbsp;<a =
href=3D"http://www.datastax.com/docs/1.2/operations/add_replace_nodes" =
target=3D"_blank">http://www.datastax.com/docs/1.2/operations/add_replace_=
nodes</a></div><div><br></div><div>Is there something missing in that =
documentation?</div>


<div><br></div><div>Thanks,</div><div><br></div><div>John</div></div>
</blockquote></div><br></div></div></blockquote></div><br></div>
</div></blockquote></div><br></div>
</blockquote></div><br></div></div></blockquote></div><br></div></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_FA32AFA4-971A-4214-954F-8E3DD8E5077D--