Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=Ss+JBXpTFZ
	U/4PIOsyXlckc73veuHVw8J3Cu38FBrXf/HyK/HNBKuUgE7MmtFhDjQodl9A4e6O
	E9ZNtrB3RyC1fV0OFrB6i1CKmyPdSb+VpQkh7ryOvk5RwqR569QUQ86+0IdYM5rz
	SeiqGjjMH6VL5ChdtAOzlr5VjFLc1BSDk=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_1E069596-30B8-47C1-B126-E0B96B1F30D9"
Subject: Re: Bootstrapping
Date: Thu, 11 Aug 2011 13:59:38 +1200
In-Reply-To: <8E8C8608-59F7-43F7-BEF1-94751F99DFAE@gmail.com>
To: user@cassandra.apache.org
References: <8E8C8608-59F7-43F7-BEF1-94751F99DFAE@gmail.com>
Message-Id: <C2597AF1-8E83-478C-A532-4A3767686431@thelastpickle.com>


--Apple-Mail=_1E069596-30B8-47C1-B126-E0B96B1F30D9
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

First, upgrade from 0.7.5 if possible. This is as good a reason as any =
https://github.com/apache/cassandra/blob/cassandra-0.7.8/CHANGES.txt#L58

Can you copy the SSTables off node and then just bring it back ? It will =
be *a lot* faster than use nodetool repair. (drain the node first to =
clear the commit log). Or if you have a spare machine perform a rolling =
migration.

If at all possible I would try to do it as an upgrade described above. =
It will be much much easier.=20

If you plan to turn a node off and clear it's data you should remove the =
nodes token from the ring. You can either use nodetool decommission =
which will distribute the data around the ring, or turn it off and then =
use nodetool remove token which will not.=20

> 1. I realize this will put a heavier I/O load on the replication nodes =
to AntiCompact the CF's, but what kind of load does this put on the JVM. =
Are there any gotchas I should be aware of to prevent long gc times or =
OOM exceptions on the replication nodes.
We don't have the AnitCompaction step any more. If your app is stable I =
would assume the repair process would be to. Do your normal repair =
processed complete ok ?

> 3. Documentation at http://wiki.apache.org/cassandra/Operations says =
that the thrift port is not active on the bootstrapping node during the =
streaming process. What is the process that brings the node up-to-date =
with mutations that occurred during the time of the bootstrap? Maybe =
it's only reads that are disabled and writes are allowed?

Thrift is the connection the client uses, disabling it means clients =
cannot write to it. The node will announce it's intention to take =
ownership of a token range in the ring when the bootstrap starts. =46rom =
that point on other nodes will include it in write requests but not read =
requests. During that time your data is replicated to RF+1 nodes.=20
=20
> 4. What happens if schema changes (add/drop column families) occur in =
the cluster while the bootstrap is in progress?
They will be distributed to the node when it comes back. Until it gets =
the new updates it will log ERRORs for mutations to non existent CF's. =
Best advice is do not make those changes.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 11 Aug 2011, at 09:54, Chad Johnson wrote:

> Hi,
>=20
> I have a 15 node cluster with a RF=3D3 running version 0.7.5. I am =
planning to perform some filesystem maintenance on each of the nodes. =
The filesystem happens to be on the partition holding the keyspace data. =
The maintenance means that all the SSTables for our keyspace will be =
destroyed. Rather than backup all the data to a backup disk and restore, =
my plan was to bring the node down, perform the maintenance, keep the =
original initial_token, set auto_bootstrap to true and let Cassandra =
repopulate the data through the streaming process. Nodes in the cluster =
will have a load of about 250 to 300GB
>=20
> I have a couple questions regarding bootstrapping and the streaming =
process.
>=20
> 1. I realize this will put a heavier I/O load on the replication nodes =
to AntiCompact the CF's, but what kind of load does this put on the JVM. =
Are there any gotchas I should be aware of to prevent long gc times or =
OOM exceptions on the replication nodes.
> 2. If the initial_token is not changed, is it correct to assume that =
anticompaction will occur only on the replication nodes and not =
throughout the cluster as the key space has not been modified.
> 3. Documentation at http://wiki.apache.org/cassandra/Operations says =
that the thrift port is not active on the bootstrapping node during the =
streaming process. What is the process that brings the node up-to-date =
with mutations that occurred during the time of the bootstrap? Maybe =
it's only reads that are disabled and writes are allowed?
> 4. What happens if schema changes (add/drop column families) occur in =
the cluster while the bootstrap is in progress?
>=20
> Thanks for your help
>=20
> Chad


--Apple-Mail=_1E069596-30B8-47C1-B126-E0B96B1F30D9
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
">First, upgrade from 0.7.5 if possible. This is as good a reason as =
any&nbsp;<a =
href=3D"https://github.com/apache/cassandra/blob/cassandra-0.7.8/CHANGES.t=
xt#L58">https://github.com/apache/cassandra/blob/cassandra-0.7.8/CHANGES.t=
xt#L58</a><div><br></div><div>Can you copy the SSTables off node and =
then just bring it back ? It will be *a lot* faster than use nodetool =
repair. (drain the node first to clear the commit log). Or if you have a =
spare machine perform a rolling migration.</div><div><br></div><div>If =
at all possible I would try to do it as an upgrade described above. It =
will be much much easier.&nbsp;</div><div><br></div><div>If you plan to =
turn a node off and clear it's data you should remove the nodes token =
from the ring. You can either use nodetool decommission which will =
distribute the data around the ring, or turn it off and then use =
nodetool remove token which will =
not.&nbsp;</div><div><br></div><div></div><blockquote =
type=3D"cite"><div>1. I realize this will put a heavier I/O load on the =
replication nodes to AntiCompact the CF's, but what kind of load does =
this put on the JVM. Are there any gotchas I should be aware of to =
prevent long gc times or OOM exceptions on the replication =
nodes.</div></blockquote><div>We don't have the AnitCompaction step any =
more. If your app is stable I would assume the repair process would be =
to. Do your normal repair processed complete ok =
?</div><div><br></div><div><blockquote type=3D"cite"><div>3. =
Documentation at <a =
href=3D"http://wiki.apache.org/cassandra/Operations">http://wiki.apache.or=
g/cassandra/Operations</a> says that the thrift port is not active on =
the bootstrapping node during the streaming process. What is the process =
that brings the node up-to-date with mutations that occurred during the =
time of the bootstrap? Maybe it's only reads that are disabled and =
writes are allowed?<br></div></blockquote></div><div><div>Thrift is the =
connection the client uses, disabling it means clients cannot write to =
it. The node will announce it's intention to take ownership of a token =
range in the ring when the bootstrap starts. =46rom that point on other =
nodes will include it in write requests but not read requests. During =
that time your data is replicated to RF+1 =
nodes.&nbsp;</div><div>&nbsp;</div><div><blockquote type=3D"cite"><div>4. =
What happens if schema changes (add/drop column families) occur in the =
cluster while the bootstrap is in progress?<br></div></blockquote>They =
will be distributed to the node when it comes back. Until it gets the =
new updates it will log ERRORs for mutations to non existent CF's. Best =
advice is do not make those =
changes.&nbsp;</div><div><br></div><div>Cheers</div><div><br></div><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 11 Aug 2011, at 09:54, Chad Johnson wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div>Hi,<br><br>I have a 15 node cluster with a RF=3D3 =
running version 0.7.5. I am planning to perform some filesystem =
maintenance on each of the nodes. The filesystem happens to be on the =
partition holding the keyspace data. The maintenance means that all the =
SSTables for our keyspace will be destroyed. Rather than backup all the =
data to a backup disk and restore, my plan was to bring the node down, =
perform the maintenance, keep the original initial_token, set =
auto_bootstrap to true and let Cassandra repopulate the data through the =
streaming process. Nodes in the cluster will have a load of about 250 to =
300GB<br><br>I have a couple questions regarding bootstrapping and the =
streaming process.<br><br>1. I realize this will put a heavier I/O load =
on the replication nodes to AntiCompact the CF's, but what kind of load =
does this put on the JVM. Are there any gotchas I should be aware of to =
prevent long gc times or OOM exceptions on the replication nodes.<br>2. =
If the initial_token is not changed, is it correct to assume that =
anticompaction will occur only on the replication nodes and not =
throughout the cluster as the key space has not been modified.<br>3. =
Documentation at <a =
href=3D"http://wiki.apache.org/cassandra/Operations">http://wiki.apache.or=
g/cassandra/Operations</a> says that the thrift port is not active on =
the bootstrapping node during the streaming process. What is the process =
that brings the node up-to-date with mutations that occurred during the =
time of the bootstrap? Maybe it's only reads that are disabled and =
writes are allowed?<br>4. What happens if schema changes (add/drop =
column families) occur in the cluster while the bootstrap is in =
progress?<br><br>Thanks for your =
help<br><br>Chad</div></blockquote></div><br></div></body></html>=

--Apple-Mail=_1E069596-30B8-47C1-B126-E0B96B1F30D9--