Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 22201D0FF for ; Tue, 8 Jan 2013 16:42:04 +0000 (UTC) Received: (qmail 30505 invoked by uid 500); 8 Jan 2013 16:42:01 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 30483 invoked by uid 500); 8 Jan 2013 16:42:01 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 30475 invoked by uid 99); 8 Jan 2013 16:42:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Jan 2013 16:42:01 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of edlinuxguru@gmail.com designates 209.85.223.181 as permitted sender) Received: from [209.85.223.181] (HELO mail-ie0-f181.google.com) (209.85.223.181) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Jan 2013 16:41:54 +0000 Received: by mail-ie0-f181.google.com with SMTP id 16so765517iea.12 for ; Tue, 08 Jan 2013 08:41:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=npkcO1Tv9k27OvQU0e98gNH58dX38/iwtrpk1LjArG8=; b=RoKZmfUqEk39cpRkd6Xnb3ccPbtw1dqys1fwpuVv6dGwC034lQhwozKYEp1Lgxv5t9 GT+es4Fp01HQyVEl/2QeP/MTyP+rjGcyIwdtUHDQdeUZ9m1YfZEF73cuLkOvYxlDWPgE fm77QXJJIobru5drUn+oI3QjU3ILyAZgo59uMJN/mfRd0e+MsZfx0Xg3VuRZlByXd5Qe x7fBoDJQNARKk4rWnr29uzLb/Bu3h0iFjtiwr3itI+4byNoqUi8Bm8dWrSlHmidHcdQ5 aUerTIy6LRtcaU4rby9Q991/F+FcbX47a4cinyLlcbQ1UKsLQjZXT3bLcqtihLRaz807 1A+Q== MIME-Version: 1.0 X-Received: by 10.50.7.204 with SMTP id l12mr9843857iga.103.1357663294065; Tue, 08 Jan 2013 08:41:34 -0800 (PST) Received: by 10.64.97.162 with HTTP; Tue, 8 Jan 2013 08:41:33 -0800 (PST) In-Reply-To: <13546_1357660668_50EC41FB_13546_6082_1_AEE40020481AB74EAADC798FC2BC7C4A01B14995CD9F@THSONEA01CMS03P.one.grp> References: <12205_1357648060_50EC10BB_12205_167_1_AEE40020481AB74EAADC798FC2BC7C4A01B14995C524@THSONEA01CMS03P.one.grp> <13546_1357660668_50EC41FB_13546_6082_1_AEE40020481AB74EAADC798FC2BC7C4A01B14995CD9F@THSONEA01CMS03P.one.grp> Date: Tue, 8 Jan 2013 11:41:33 -0500 Message-ID: Subject: Re: about validity of recipe "A node join using external data copy methods" From: Edward Capriolo To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=f46d04446b0b5e0c7004d2c99e4c X-Virus-Checked: Checked by ClamAV on apache.org --f46d04446b0b5e0c7004d2c99e4c Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable It has been true since about 0.8. in the old days ANTI-COMPACTION stunk and many weird errors would cause node joins to have to be retried N times. Now node moves/joins seem to work near 100% of the time (in 1.0.7) they are also very fast and efficient. If you want to move a node to new hardware you can do it with rsync, but I would not use the technique for growing the cluster. It is error prone, and ends up being more work. On Tue, Jan 8, 2013 at 10:57 AM, DE VITO Dominique < dominique.devito@thalesgroup.com> wrote: > " Now streaming is very efficient rarely fails and there is no need to > do it this way anymore" > > > > I guess it's true in v1.2. > > Is it true also in v1.1 ? > > > > Thanks. > > > > Dominique > > > > > > *De :* Edward Capriolo [mailto:edlinuxguru@gmail.com] > *Envoy=E9 :* mardi 8 janvier 2013 16:01 > *=C0 :* user@cassandra.apache.org > *Objet :* Re: about validity of recipe "A node join using external data > copy methods" > > > > Basically this recipe is from the old days when we had anti-compaction. > Now streaming is very efficient rarely fails and there is no need to do i= t > this way anymore. This recipe will be abolished from the second edition. = It > still likely works except when using counters. > > > > Edward > > > > On Tue, Jan 8, 2013 at 7:27 AM, DE VITO Dominique < > dominique.devito@thalesgroup.com> wrote: > > Hi, > > > > Edward Capriolo described in his Cassandra book a faster way [1] to start > new nodes if the cluster size doubles, from N to 2 *N. > > > > It's about splitting in 2 parts each token range taken in charge, after > the split, with 2 nodes: the existing one, and a new one. And for startin= g > a new node, one needs to: > > - copy the data records from the corresponding node (without the "system" > records) > > - start the new node with "auto_bootstrap: false" > > > > This raises 2 questions: > > > > A) is this recipe still valid with v1.1 and v1.2 ? > > > > B) do we still need to start the new node with "auto_bootstrap: false" ? > > My guess is "yes" as the happening of the bootstrap phase is not recorded > into the data records. > > > > Thanks. > > > > Dominique > > > > [1] see recipe "A node join using external data copy methods", page 165 > > > --f46d04446b0b5e0c7004d2c99e4c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
It has been true since about 0.8. in the old days ANTI-COMPACTION stun= k and many weird errors would cause node joins to have to be retried N time= s.

Now node moves/joins seem to work near 100%= of the time (in 1.0.7) they are also very fast and efficient.=A0

If you want to move a node to new hardware you can do i= t with rsync, but I would not use the technique for growing the cluster. It= is error prone, and ends up being more work.

On Tue, Jan 8, 2013 at 10:57 AM, DE VITO Dominique <dominiq= ue.devito@thalesgroup.com> wrote:

" Now streami= ng is very efficient rarely fails and there is no need to do it this way anymore"

=A0

I guess it's true in v1.2.

Is it true also in v1.1 ?

=A0

Thanks.

=A0

Dominique

=A0

=A0

De=A0: Edward = Capriolo [mailto:edlinuxg= uru@gmail.com]
Envoy=E9=A0: mardi 8 janvier 2013 16:01
=C0=A0: user@cassandra.apache.org
Objet=A0: Re: about validity of recipe "A node join using external data copy methods"

=A0

Basically this recipe is from the old days when we h= ad anti-compaction. Now streaming is very efficient rarely fails and there is = no need to do it this way anymore. This recipe will be abolished from the seco= nd edition. It still likely works except when using counters.

=A0

Edward

=A0

On Tue, Jan 8, 2013 at 7:27 AM, DE VITO Dominique &l= t;dom= inique.devito@thalesgroup.com> wrote:

Hi,

=A0

Edward Capriolo described in his Cassandra book a faster way [1] to start new node= s if the cluster size doubles, from N to 2 *N.

=A0

It's about splitting in 2 parts each token range taken in charge, after the spli= t, with 2 nodes: the existing one, and a new one. And for starting a new node,= one needs to:

- copy the data records from the corresponding node (without the "system" records)

- start the new node with "auto_bootstrap: false"

=A0

This raises 2 questions:

=A0

A) is this recipe still valid with v1.1 and v1.2 ?

=A0

B) do we still need to start the new node with "auto_bootstrap: false&quo= t; ?

My guess is "yes" as the happening of the bootstrap phase is not recorded into the data records.

=A0

Thanks.

=A0

Dominique

=A0

[1] see recipe "A node join using external data copy methods", page 1= 65

=A0


--f46d04446b0b5e0c7004d2c99e4c--