Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <CA+VSrLpqR_noLtQ5UPm5MNtxjenQ99fWD6TdSDX6n1Fm=+M6aA@mail.gmail.com>
References: 
 <CAMc-71kZpOpJMiTfVF+EWNh4WTzVCoLdGXw2WiGAcKHkZ2qGHA@mail.gmail.com>
	<88E7978E-DDCB-41C6-8138-45AF31C5E840@crowdstrike.com>
	<CA+VSrLpqR_noLtQ5UPm5MNtxjenQ99fWD6TdSDX6n1Fm=+M6aA@mail.gmail.com>
Date: Wed, 12 Aug 2015 11:11:19 +0200
Message-ID: 
 <CAMc-71n9q5295PQ9pWuWqcSAdAgJaEixXtALCFg0wUwY76JW9g@mail.gmail.com>
Subject: Re: Duplicating a cluster with different # of disks
From: Gerard Maas <gerard.maas@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=f46d044280cc17ab57051d199acc

--f46d044280cc17ab57051d199acc
Content-Type: text/plain; charset=UTF-8

Many thanks for confirming the procedure. I was doing the copy from 3->2 as
explained before. My doubt came from  noticing that the total count
strongly differed from src to destination. 3M vs 150k.
But small test tables with few hundred records all went well.

Double checked the copy and the procedure was correct. It was a table we
had issues with in the past (few very loooong rows). Maybe related to that?

Kr, Gerard
On Aug 6, 2015 11:00 PM, "Alain RODRIGUEZ" <arodrime@gmail.com> wrote:

> I agree with Jeff, those 2 solution should work well indeed to have
> distinct cluster (data will be fixed in time, not synchronised).
>
> It really depends on you but basically having hybride data storage
> structures is not an issue at all in both cases as it is something that you
> can set in the cassandra.yaml at the node level.
>
> C*heers,
>
> Alain
>
> 2015-08-06 22:41 GMT+02:00 Jeff Jirsa <Jeff.Jirsa@crowdstrike.com>:
>
>> You can copy all of the sstables into any given data directory without
>> issue (keep them within the keyspace/table directories, but the
>> mnt/mnt2/mnt3 location is irrelevant).
>>
>> You can also stream them in via sstableloader if your ring topology has
>> changed (especially if tokens have moved)
>>
>>
>>
>> From: Gerard Maas
>> Reply-To: "user@cassandra.apache.org"
>> Date: Thursday, August 6, 2015 at 9:50 AM
>> To: "user@cassandra.apache.org"
>> Subject: Duplicating a cluster with different # of disks
>>
>> Hi,
>>
>> I'm currently trying to duplicate a given keyspace on a new cluster to
>> run some analytics on it.
>>
>> My source cluster has 3 disks and corresponding data directories (mnt,
>> mnt2, mnt3) but the machines in my target cluster only have 2 disks (mnt,
>> mnt2).
>>
>> What should be the correct procedure to copy the sstable data  from
>> source to destination in this case?
>>
>> -kr, Gerard.
>>
>
>

--f46d044280cc17ab57051d199acc
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr">Many thanks for confirming the procedure. I was doing the co=
py from 3-&gt;2 as explained before. My doubt came from=C2=A0 noticing that=
 the total count strongly differed from src to destination. 3M vs 150k. <br=
>
But small test tables with few hundred records all went well. </p>
<p dir=3D"ltr">Double checked the copy and the procedure was correct. It wa=
s a table we had issues with in the past (few very loooong rows). Maybe rel=
ated to that?</p>
<p dir=3D"ltr">Kr, Gerard<br>
</p>
<div class=3D"gmail_quote">On Aug 6, 2015 11:00 PM, &quot;Alain RODRIGUEZ&q=
uot; &lt;<a href=3D"mailto:arodrime@gmail.com">arodrime@gmail.com</a>&gt; w=
rote:<br type=3D"attribution"><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"lt=
r">I agree with Jeff, those 2 solution should work well indeed to have dist=
inct cluster (data will be fixed in time, not synchronised).<div><br></div>=
<div>It really depends on you but basically having hybride data storage str=
uctures is not an issue at all in both cases as it is something that you ca=
n set in the cassandra.yaml at the node level.</div><div><br></div><div>C*h=
eers,</div><div><br></div><div>Alain</div></div><div class=3D"gmail_extra">=
<br><div class=3D"gmail_quote">2015-08-06 22:41 GMT+02:00 Jeff Jirsa <span =
dir=3D"ltr">&lt;<a href=3D"mailto:Jeff.Jirsa@crowdstrike.com" target=3D"_bl=
ank">Jeff.Jirsa@crowdstrike.com</a>&gt;</span>:<br><blockquote class=3D"gma=
il_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-lef=
t:1ex"><div style=3D"word-wrap:break-word;color:rgb(0,0,0);font-size:14px;f=
ont-family:Calibri,sans-serif"><div><div><div>You can copy all of the sstab=
les into any given data directory without issue (keep them within the keysp=
ace/table directories, but the mnt/mnt2/mnt3 location is irrelevant).</div>=
<div><br></div><div>You can also stream them in via sstableloader if your r=
ing topology has changed (especially if tokens have moved)</div><div><br></=
div><div><br></div><div><div></div></div></div></div><div><br></div><span><=
div style=3D"font-family:Calibri;font-size:12pt;text-align:left;color:black=
;BORDER-BOTTOM:medium none;BORDER-LEFT:medium none;PADDING-BOTTOM:0in;PADDI=
NG-LEFT:0in;PADDING-RIGHT:0in;BORDER-TOP:#b5c4df 1pt solid;BORDER-RIGHT:med=
ium none;PADDING-TOP:3pt"><span style=3D"font-weight:bold">From: </span> Ge=
rard Maas<br><span style=3D"font-weight:bold">Reply-To: </span> &quot;<a hr=
ef=3D"mailto:user@cassandra.apache.org" target=3D"_blank">user@cassandra.ap=
ache.org</a>&quot;<br><span style=3D"font-weight:bold">Date: </span> Thursd=
ay, August 6, 2015 at 9:50 AM<br><span style=3D"font-weight:bold">To: </spa=
n> &quot;<a href=3D"mailto:user@cassandra.apache.org" target=3D"_blank">use=
r@cassandra.apache.org</a>&quot;<br><span style=3D"font-weight:bold">Subjec=
t: </span> Duplicating a cluster with different # of disks<br></div><div><d=
iv><div><br></div><div><div><div dir=3D"ltr">Hi,
<div><br></div><div>I&#39;m currently trying to duplicate a given keyspace =
on a new cluster to run some analytics on it.</div><div><br></div><div>My s=
ource cluster has 3 disks and corresponding data directories (mnt, mnt2, mn=
t3) but the machines in my target cluster only have 2 disks (mnt, mnt2).</d=
iv><div><br></div><div>What should be the correct procedure to copy the sst=
able data =C2=A0from source to destination in this case?</div><div><br></di=
v><div>-kr, Gerard.<br></div></div></div></div></div></div></span></div>
</blockquote></div><br></div>
</blockquote></div>

--f46d044280cc17ab57051d199acc--