Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_BEC65A9B-F06A-4B7E-82F5-256D40CE0511"
Message-Id: <D40540EB-8C75-4E85-8412-82A9D3EF024D@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: how to take consistant snapshot?
Date: Fri, 7 Dec 2012 16:34:30 +1300
References: 
 <CAK0tFt6eog7NXLkQ4DMXXD2Y6P=ZycUwG3sNZrH8r1KQ4Uyf-w@mail.gmail.com>
To: user@cassandra.apache.org
In-Reply-To: 
 <CAK0tFt6eog7NXLkQ4DMXXD2Y6P=ZycUwG3sNZrH8r1KQ4Uyf-w@mail.gmail.com>


--Apple-Mail=_BEC65A9B-F06A-4B7E-82F5-256D40CE0511
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

For background

=
http://wiki.apache.org/cassandra/Operations?highlight=3D%28snapshot%29#Con=
sistent_backups

If you it for a single node then yes there is a chance of inconsistency =
across CF's.=20

If you have mulitple nodes the snashots you take on the later nodes will =
help. If you use CL QUOURM for reads you *may* be ok (cannot work it out =
quickly.). If you use CL ALL for reads you will be ok. Or you can use =
nodetool repair to ensure the data is consistent.=20

I doubt that even using repair would give you a provable guarantee =
though. Anyone ?

Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 6/12/2012, at 7:56 AM, Andrey Ilinykh <ailinykh@gmail.com> wrote:

> Hello, everybody!
> I have production cluster with incremental backup on and I want to =
clone it (create test one). I don't understand one thing- each column =
family gets flushed (and copied to backup storage) independently. Which =
means the total snapshot is inconsistent. If I restore from such =
snapshot  I have totally useless system. To be more specific, let's say =
I have two CF, one serves as an index for another. Every time I update =
one CF I update index CF. There is a good chance that all replicas flush =
index CF first. Then I move it into backup storage, restore and get CF =
which has pointers to non existent data in another CF. What is a way to =
avoid this situation?
>=20
> Thank you,
>   Andrey


--Apple-Mail=_BEC65A9B-F06A-4B7E-82F5-256D40CE0511
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">For =
background<div><br></div><div><a =
href=3D"http://wiki.apache.org/cassandra/Operations?highlight=3D(snapshot)=
#Consistent_backups">http://wiki.apache.org/cassandra/Operations?highlight=
=3D%28snapshot%29#Consistent_backups</a></div><div><br></div><div>If you =
it for a single node then yes there is a chance of inconsistency across =
CF's.&nbsp;</div><div><br></div><div>If you have mulitple nodes the =
snashots you take on the later nodes will help. If you use CL QUOURM for =
reads you *may* be ok (cannot work it out quickly.). If you use CL ALL =
for reads you will be ok. Or you can use nodetool repair to ensure the =
data is consistent.&nbsp;</div><div><br></div><div>I doubt that even =
using repair would give you a provable guarantee though. Anyone =
?</div><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div>
</div>

<br><div><div>On 6/12/2012, at 7:56 AM, Andrey Ilinykh &lt;<a =
href=3D"mailto:ailinykh@gmail.com">ailinykh@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Hello, everybody!<div>I have production cluster =
with&nbsp;incremental&nbsp;backup on and I want to clone it (create test =
one). I don't understand one thing- each column family gets flushed (and =
copied to backup storage) independently. Which means the total snapshot =
is inconsistent. If I restore from such snapshot &nbsp;I have totally =
useless system. To be more specific, let's say I have two CF, one serves =
as an index for another. Every time I update one CF I update index CF. =
There is a good chance that all replicas flush index CF first. Then I =
move it into backup storage, restore and get CF which has pointers to =
non&nbsp;existent&nbsp;data in another CF. What is a way to avoid this =
situation?</div>
<div><br></div><div>Thank you,</div><div>&nbsp; Andrey</div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_BEC65A9B-F06A-4B7E-82F5-256D40CE0511--