Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy includes SPF record at
 spf.trusted-forwarder.org)
MIME-Version: 1.0
In-Reply-To: 
 <CABNBnCjdO5jEqbujCo70qhDTOWm78nzzeePB5oV525-9UwTcbQ@mail.gmail.com>
References: 
 <CAJh8kFSwGBWtBaUDTfyC+ba7sA=pn1qW=vXAqVJWBkxHYUOqMA@mail.gmail.com>
	<CABNBnCjdO5jEqbujCo70qhDTOWm78nzzeePB5oV525-9UwTcbQ@mail.gmail.com>
Date: Thu, 21 Nov 2013 17:43:35 +0200
Message-ID: 
 <CAJh8kFQCH9SaVoFz6Sp_pyPBpV8R8HCaJq1OOFojQCJK_hVQrw@mail.gmail.com>
Subject: Re: Simple test of adding a node causes data loss
From: Tamar Rosen <tamar@correlor.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7b33d414b901e604ebb1c20c

--047d7b33d414b901e604ebb1c20c
Content-Type: text/plain; charset=ISO-8859-1

This worked, thanks.


On Thu, Nov 21, 2013 at 5:20 PM, Julien Campan <julien.campan@gmail.com>wrote:

> Hi,
>
> You said : Adjusted cassandra.yaml as above except that for seeds put both
> server addresses
>
> If you put the new node into the seeds list, your node will not bootstrap.
> This could explain why you see only the half of your data.
>
> Can you check in system.log ?
>
>
> By the way, you can retry without adding the new server address in the
> seeds list and normally  should work :)
>
> You should add your new node into the seeds list only after the bootstrap
> operation.
>
>
>
> Julien Campan
>
>
>
>
> 2013/11/21 Tamar Rosen <tamar@correlor.com>
>
>> Hi,
>>
>> We are testing the process of adding a node to a cluster using a simple
>> procedure, and seeing data loss.
>>
>> System: Ubuntu 12.04 on AWS
>> Version: Cassandra + dsc 1.2.10
>>
>> Here is what we did:
>> Created 2 new m1.large instances
>> Installed Java
>> Installed Cassandra 1.2.10 (the version we are using in our production
>> system)
>>
>> In server1:
>> Adjusted cassandra.yaml
>>   comment out the initial_token
>>   uncomment num_tokens: 256
>>   changed "seeds" to the address of this server
>>   changed listen_address to the address of this server
>>   changed rpc_address to 0.0.0.0
>>   changed practitioner to org.apache.cassandra.dht.RandomPartitioner
>> made sure cassandra is not running
>> sudo rm -rf /var/lib/cassandra/*
>> started cassandra
>> connected via cqlsh
>> Created a new keyspace with replication factor 1
>> Created a new table
>> Populated the table with 4000 row of simple data using cql copy command
>> cqlsh> select count(*) - returns 4000
>> nodetool status shows a single server at this point (using vnodes)
>>
>> In server2:
>> made sure cassandra is not running
>> sudo rm -rf /var/lib/cassandra/*
>> Adjusted cassandra.yaml as above except that for seeds put both server
>> addresses
>> started cassandra
>> waited a couple of min
>>
>> What we found:
>> nodetool status on either server shows two servers, each with appox 50%
>> (but not exactly)
>> cqlsh>select count(*) - return 1870 (on either server)
>> This process was repeated 3 times. each time the number was a bit
>> different, but ~2000
>>
>> Notes
>> Replication factor is 1.
>> No nodetool cleanup was run
>>
>> We have successfully added nodes in the past, but not since we moved to
>> using vnodes
>> THIS WAS A TEST. CLEAN MACHINES, SIMPLE DATA - What are we doing wrong?
>>
>> Thanks,
>>
>> Tamar Rosen
>> Senior Data Architect
>> Correlor.com
>>
>>
>>
>>
>>
>>
>>
>
>

--047d7b33d414b901e604ebb1c20c
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>This worked, thanks. <br></div><br></div><div class=
=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Nov 21, 2013 at=
 5:20 PM, Julien Campan <span dir=3D"ltr">&lt;<a href=3D"mailto:julien.camp=
an@gmail.com" target=3D"_blank">julien.campan@gmail.com</a>&gt;</span> wrot=
e:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>

<p class=3D"MsoNormal" style=3D"margin-bottom:12pt;line-height:normal"><spa=
n style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot;ser=
if&quot;" lang=3D"EN-US">Hi,</span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:12pt;line-height:normal"><spa=
n style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot;ser=
if&quot;" lang=3D"EN-US">You said : Adjusted cassandra.yaml as above except=
 that for seeds put both
server addresses</span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:0.0001pt;line-height:normal">=
<span style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot=
;serif&quot;" lang=3D"EN-US">If you put the new node into the seeds list, y=
our node will not bootstrap.
This could explain why you see only the half of your data.</span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:0.0001pt;line-height:normal">=
<span style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot=
;serif&quot;" lang=3D"EN-US">Can you check in system.log ? </span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:12pt;line-height:normal"><spa=
n style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot;ser=
if&quot;" lang=3D"EN-US"><br>
By the way, you can retry without adding the new server address in the seed=
s list
and normally=A0 should work :)</span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:0.0001pt;line-height:normal">=
<span style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot=
;serif&quot;" lang=3D"EN-US">You should add your new node into the seeds li=
st only after the bootstrap
operation.</span></p><span class=3D"HOEnZb"><font color=3D"#888888">

<p class=3D"MsoNormal" style=3D"margin-bottom:0.0001pt;line-height:normal">=
<span style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot=
;serif&quot;" lang=3D"EN-US">=A0</span></p>

<p class=3D"MsoNormal" style=3D"margin-bottom:0.0001pt;line-height:normal">=
<span style=3D"font-size:12pt;font-family:&quot;Times New Roman&quot;,&quot=
;serif&quot;">Julien
Campan</span></p>

<p class=3D"MsoNormal">=A0</p>

</font></span></div><div><div class=3D"h5"><div class=3D"gmail_extra"><br><=
br><div class=3D"gmail_quote">2013/11/21 Tamar Rosen <span dir=3D"ltr">&lt;=
<a href=3D"mailto:tamar@correlor.com" target=3D"_blank">tamar@correlor.com<=
/a>&gt;</span><br>

<blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex;border-=
left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div>Hi,=
 <br><br>We are testing the process of adding a node to a cluster using a s=
imple procedure, and seeing data loss. <br>


<br></div><div>System: Ubuntu 12.04 on AWS<br>Version: Cassandra + dsc 1.2.=
10<br>
<br>Here is what we did:<br></div><div>Created 2 new m1.large instances<br>=
</div><div>Installed Java <br></div><div>Installed Cassandra 1.2.10 (the ve=
rsion we are using in our production system)<br></div><div><br>In server1:<=
br>


Adjusted cassandra.yaml<br></div><div>=A0 comment out the initial_token <br=
></div><div>=A0 uncomment num_tokens: 256<br></div><div>=A0 changed &quot;s=
eeds&quot; to the address of this server<br></div><div>=A0 changed listen_a=
ddress to the address of this server<br>


</div><div>=A0 changed rpc_address to 0.0.0.0<br></div><div>=A0 changed pra=
ctitioner to org.apache.cassandra.dht.RandomPartitioner<br></div><div><div>=
made sure cassandra is not running<br></div>sudo rm -rf /var/lib/cassandra/=
*<br>


started cassandra<br></div><div>connected via cqlsh<br></div><div>Created a=
 new keyspace with replication factor 1<br></div><div>Created a new table<b=
r></div><div>Populated the table with 4000 row of simple data using cql cop=
y command<br>


</div><div>cqlsh&gt; select count(*) - returns 4000<br>nodetool status show=
s a single server at this point (using vnodes) <br></div><div><br></div><di=
v>In server2:<br></div><div>made sure cassandra is not running<br></div>


<div>sudo rm -rf /var/lib/cassandra/*<br></div><div>Adjusted cassandra.yaml=
 as above except that for seeds put both server addresses<br></div><div>sta=
rted cassandra<br></div><div>waited a couple of min<br><br></div><div>


What we found:<br>
nodetool status on either server shows two servers, each with appox 50% (bu=
t not exactly)<br></div><div>cqlsh&gt;select count(*) - return 1870 (on eit=
her server)<br></div><div>This process was repeated 3 times. each time the =
number was a bit different, but ~2000<br>


<br></div><div>Notes <br>Replication factor is 1.<br></div><div>No nodetool=
 cleanup was run<br></div><div><br></div><div>We have successfully added no=
des in the past, but not since we moved to using vnodes<br>THIS WAS A TEST.=
 CLEAN MACHINES, SIMPLE DATA - What are we doing wrong?<br>


<br></div><div>Thanks, <br><br></div><div>Tamar Rosen<br></div><div>Senior =
Data Architect<br></div><div>Correlor.com<br></div><div><br></div><div><br>=
<br></div><div><br></div><div>=A0 <br></div><div>=A0<br></div></div>
</blockquote></div><br></div></div></div></div>
</blockquote></div><br></div>

--047d7b33d414b901e604ebb1c20c--