Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 05419F171 for ; Wed, 24 Apr 2013 03:34:09 +0000 (UTC) Received: (qmail 3712 invoked by uid 500); 24 Apr 2013 03:34:06 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 3608 invoked by uid 500); 24 Apr 2013 03:34:06 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 3584 invoked by uid 99); 24 Apr 2013 03:34:05 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Apr 2013 03:34:05 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a43.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Apr 2013 03:33:59 +0000 Received: from homiemail-a43.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a43.g.dreamhost.com (Postfix) with ESMTP id 9ABAE8C058 for ; Tue, 23 Apr 2013 20:33:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; s= thelastpickle.com; bh=Z/hRvD/Ds8G+ckJDOeNfRTaaq6o=; b=z6f9SHgKQB pPU3SEpN9S/cayQo1Nq5OxQwgrFI7aL2sBNzeRgHG7XuxRr2TXvIOnxoAUK9wzEC 2JOiFejCJoAZ/eiXZX5hJ5lOh+8+7GQzwFo4J0mzcM+5n5SboyxpQIqE1m8l9M6q bd+s+h6vjDolGlJNIlJJlXSkBXYqgT2fE= Received: from [172.16.1.8] (unknown [203.86.207.101]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a43.g.dreamhost.com (Postfix) with ESMTPSA id 205698C057 for ; Tue, 23 Apr 2013 20:33:37 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\)) Subject: Re: Ec2Snitch to Ec2MultiRegionSnitch From: aaron morton In-Reply-To: Date: Wed, 24 Apr 2013 15:33:36 +1200 Content-Transfer-Encoding: quoted-printable Message-Id: References: To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1499) X-Virus-Checked: Checked by ClamAV on apache.org > You are advising me to test it, what would be a good way of testing it = (I can use AWS EC2 instances if needed) ? If you are only using one Available Zone per region then you have only = one rack per DC and the NetworkTopologyStrategy will do the right thing.=20= > Why ? I mean we have maybe only 5% of our customers on the us-east = zone, what in C* require to have the same number of node on each DC ? Because you are going to replicate your data 3 times in each DC so that = each DC can operate with a LOCAL_QUOURM.=20 > What is better on adding nodes with no data and then rebuild them = compared to using the auto_bootstrap ? nodetool rebuild is designed to handle pulling data from another dc, so = you can use it when the local DC does not contain data. i.e. you do not = want a node in the new DC bootstrapping from other nodes in the new DC, = they have no data.=20 =20 > Any doc on this ? I am not aware of all the possibilities. Why is this = the best method according to you ? = http://wiki.apache.org/cassandra/Operations?highlight=3D%28token%29#Token_= selection http://www.datastax.com/docs/1.2/initialize/token_generation Cause it's easier to understand than interleaving the nodes and works = with 2+ DC's.=20 > What is the point of this ? http://wiki.apache.org/cassandra/FAQ#seed > I didn't thought this change would be that tricky, thank you guys for = these warnings and your help ;) Yup, this is a lot of work.=20 Cheers ----------------- Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 23/04/2013, at 7:26 PM, Alain RODRIGUEZ wrote: > Hi,these advice are very welcome. >=20 > @Dane, about the rack awareness, we use only one rack per DC, so I = guess using EC2MultiRegionSnitch will do just fine and it doesn't need = any configuration. Does it seem right to you. If we are someday = interested on multi racks I will make sure to use them properly. Thank = you for this insight anyway. You are advising me to test it, what would = be a good way of testing it (I can use AWS EC2 instances if needed) ? >=20 > @Aaron >=20 > "I recommend using the same number of nodes in both DC's." >=20 > Why ? I mean we have maybe only 5% of our customers on the us-east = zone, what in C* require to have the same number of node on each DC ? >=20 > "Add the nodes (I recommend 6) with auto_bootstrap: false added to the = yaml. > update the keyspace replication strategy to add rf:3 for the new DC.=20= > Use nodetool rebuild on the new nodes to rebuild them from the us-west = DC. " >=20 > What is better on adding nodes with no data and then rebuild them = compared to using the auto_bootstrap ? >=20 > "I prefer to use the offset method. Take the 6 tokens from your = us-west DC and add 100 to them for the new dc. " >=20 > Any doc on this ? I am not aware of all the possibilities. Why is this = the best method according to you ? >=20 > About seeds =3D> "Yes. Have 3 from each." >=20 > What is the point of this ? >=20 > I didn't thought this change would be that tricky, thank you guys for = these warnings and your help ;) >=20 > Alain >=20 >=20 > 2013/4/23 Dane Miller > On Thu, Apr 18, 2013 at 7:41 AM, Alain RODRIGUEZ = wrote: > > I am wondering about the process to grow from one data center to a = few of > > them. First thing is we use EC2Snitch for now. So I guess we have to = switch > > to Ec2MultiRegionSnitch. > > > > c/ I am using the SimpleStrategy. Is it worth it/mandatory to change = this > > strategy when using multiple DC ? >=20 > I suggest you thoroughly read the datastax documentation on cassandra > replication. The change you are planning is big - make sure to try it > in a test environment first. Also, you might find you don't really > need Cassandra's rack aware feature, and can operate using > (Gossiping)PropertyFileSnitch. The rack feature is listed as an > "anti-pattern" here: > http://www.datastax.com/docs/1.2/cluster_architecture/anti_patterns >=20 > Here are some recent discussions on this list: > = http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/migrating= -from-SimpleStrategy-to-NetworkTopologyStrategy-tp7586272.html > = http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/migrating= -from-SimpleStrategy-to-NetworkTopologyStrategy-tp7481090.html >=20 > Dane >=20