Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of sdolgy@gmail.com designates
 209.85.216.44 as permitted sender)
DomainKey-Signature: a=rsa-sha1; c=nofws;
        d=gmail.com; s=gamma;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        b=STGJZLqrl2qRK2W3juEZJqa5zvWhrCRt24Tm4sTus4SZCKo8v2jiubV7cTKfNnpXrH
         CgTe3sOHrR6i735DUXY4pYN1ys0XWpDY1pqrnKWaz2CyoFdirZ9YDKg6ih+YoguiAfaQ
         rWINb+51Vf8vJBc6Ia5eGeSXk8Ipe02Yy9kVE=
MIME-Version: 1.0
In-Reply-To: <AANLkTimuDyt4gZAamUyXjZmwzYP9YebnUiEvOOwKcwF0@mail.gmail.com>
References: <AANLkTimPTY=+nJ-kJXLTqyL9Oi=BduNRVr1ynYHfbwKY@mail.gmail.com>
	<AANLkTimuDyt4gZAamUyXjZmwzYP9YebnUiEvOOwKcwF0@mail.gmail.com>
Date: Tue, 22 Mar 2011 17:28:36 +0100
Message-ID: <AANLkTin-ech3D48G8NFkb8o6hAqCsT1Tai02bd=OBy0B@mail.gmail.com>
Subject: Re: Ec2Snitch & Other snitches...
From: Sasha Dolgy <sdolgy@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=0016e68ee29d84b875049f14bc83

--0016e68ee29d84b875049f14bc83
Content-Type: text/plain; charset=ISO-8859-1

Thanks for the good response.

my thought was as aws becomes more and more expensive (no option to swap out
small cheap disks for larger cheap disks...) i'll need to switch to
dedicated hardware and the topology will change.  didnt want to back myself
into a corner early on when the amount of data is still manageable.

-sd
On Mar 22, 2011 5:05 PM, "Robert Coli" <rcoli@digg.com> wrote:
> On Tue, Mar 22, 2011 at 7:19 AM, Sasha Dolgy <sdolgy@gmail.com> wrote:
>> More, I suppose the question I'm after is, can the snitch method be
>> adjusted adhoc (with node restart) or once it's changed from
>> SimpleSnitch to Ec2Snitch that's it?
>
> You can change Snitches on a cluster with data on it, as long as you
> are very careful about what you are doing and you are in a particular
> case which you are probably not in if you want to change your Snitch.
>
> The snitch meaningfully determines replica placement strategy, and in
> general when changing snitches you need the replica placement strategy
> to stay exactly the same. Unfortunately the point of changing a snitch
> is usually.. changing your replica placement strategy. Simplest case
> is if the replica placement strategy actually stays the same, like for
> example when Digg replaced its custom version of the
> PropertyFileSnitch with SimpleSnitch in prep for going single-DC,
> because we weren't actually using the functionality of PFS. In that
> case, I simply generated a set of input which hashed correctly such
> that I had one piece of input per node. I then verified the topology
> based on this input before and after changing my snitch, and got the
> same results both times, confirming that my change of the Snitch was a
> no-op.
>
> A less simple, but still tractable case is if the topology changes
> such that one or more replicas is different but at least one is still
> the same. In this case, repair would be likely to repair.. most.. of
> your data. But honestly if you have to change strategy that much (and
> are not running IP-partitioned counts, which make this operation much
> more difficult) you probably just want to dump and reload your data
> into a new cluster which has the topology and snitch you want.
>
> =Rob

--0016e68ee29d84b875049f14bc83
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p>Thanks for the good response.=A0 </p>
<p>my thought was as aws becomes more and more expensive (no option to swap=
 out small cheap disks for larger cheap disks...) i&#39;ll need to switch t=
o dedicated hardware and the topology will change.=A0 didnt want to back my=
self into a corner early on when the amount of data is still manageable.</p=
>

<p>-sd</p>
<div class=3D"gmail_quote">On Mar 22, 2011 5:05 PM, &quot;Robert Coli&quot;=
 &lt;<a href=3D"mailto:rcoli@digg.com">rcoli@digg.com</a>&gt; wrote:<br typ=
e=3D"attribution">&gt; On Tue, Mar 22, 2011 at 7:19 AM, Sasha Dolgy &lt;<a =
href=3D"mailto:sdolgy@gmail.com">sdolgy@gmail.com</a>&gt; wrote:<br>
&gt;&gt; More, I suppose the question I&#39;m after is, can the snitch meth=
od be<br>&gt;&gt; adjusted adhoc (with node restart) or once it&#39;s chang=
ed from<br>&gt;&gt; SimpleSnitch to Ec2Snitch that&#39;s it?<br>&gt; <br>
&gt; You can change Snitches on a cluster with data on it, as long as you<b=
r>&gt; are very careful about what you are doing and you are in a particula=
r<br>&gt; case which you are probably not in if you want to change your Sni=
tch.<br>
&gt; <br>&gt; The snitch meaningfully determines replica placement strategy=
, and in<br>&gt; general when changing snitches you need the replica placem=
ent strategy<br>&gt; to stay exactly the same. Unfortunately the point of c=
hanging a snitch<br>
&gt; is usually.. changing your replica placement strategy. Simplest case<b=
r>&gt; is if the replica placement strategy actually stays the same, like f=
or<br>&gt; example when Digg replaced its custom version of the<br>&gt; Pro=
pertyFileSnitch with SimpleSnitch in prep for going single-DC,<br>
&gt; because we weren&#39;t actually using the functionality of PFS. In tha=
t<br>&gt; case, I simply generated a set of input which hashed correctly su=
ch<br>&gt; that I had one piece of input per node. I then verified the topo=
logy<br>
&gt; based on this input before and after changing my snitch, and got the<b=
r>&gt; same results both times, confirming that my change of the Snitch was=
 a<br>&gt; no-op.<br>&gt; <br>&gt; A less simple, but still tractable case =
is if the topology changes<br>
&gt; such that one or more replicas is different but at least one is still<=
br>&gt; the same. In this case, repair would be likely to repair.. most.. o=
f<br>&gt; your data. But honestly if you have to change strategy that much =
(and<br>
&gt; are not running IP-partitioned counts, which make this operation much<=
br>&gt; more difficult) you probably just want to dump and reload your data=
<br>&gt; into a new cluster which has the topology and snitch you want.<br>
&gt; <br>&gt; =3DRob<br></div>

--0016e68ee29d84b875049f14bc83--