Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of edlinuxguru@gmail.com
 designates 209.85.210.172 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <BANLkTik4HynV=GnzMi3n8OAa20HFSzej5g@mail.gmail.com>
References: <BANLkTik4HynV=GnzMi3n8OAa20HFSzej5g@mail.gmail.com>
Date: Thu, 30 Jun 2011 16:47:16 -0400
Message-ID: <BANLkTindb2CrWaQeRp9NFGnu9FZk2-svug@mail.gmail.com>
Subject: Re: Meaning of 'nodetool repair has to run within GCGraceSeconds'
From: Edward Capriolo <edlinuxguru@gmail.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=485b397dd1039e911b04a6f4010b

--485b397dd1039e911b04a6f4010b
Content-Type: text/plain; charset=ISO-8859-1

On Thu, Jun 30, 2011 at 4:25 PM, A J <s5alye@gmail.com> wrote:

> I am little confused of the reason why nodetool repair has to run
> within GCGraceSeconds.
>
> The documentation at:
> http://wiki.apache.org/cassandra/Operations#Frequency_of_nodetool_repair
> is not very clear to me.
>
> How can a delete be 'unforgotten' if I don't run nodetool repair? (I
> understand that if a node is down for more than GCGraceSeconds, I
> should not get it up without resynching is completely. Otherwise
> deletes may reappear.http://wiki.apache.org/cassandra/DistributedDeletes
> )
> But not sure how exactly nodetool repair ties into this mechanism of
> distributed deletes.
>
> Thanks for any clarifications.
>

Read repair does NOT repair tombstones. Failed writes/tomstones with
TimedoutException do not get hinted even if HH is on.
https://issues.apache.org/jira/browse/CASSANDRA-2034. Thus tombstones can
get lost.

Because of this the only way to find lost tombstones is to anti-entropy
repair. If you do not repair in the gc period a node could lose a tombstone
and the row could be read repaired and resurrected.

In our case, we are lucky, we delete rows when they get old and stale. While
it is not great if a deleted row appears it is not harmful thus I can live
with less repairing then most.

--485b397dd1039e911b04a6f4010b
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<br><br><div class=3D"gmail_quote">On Thu, Jun 30, 2011 at 4:25 PM, A J <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:s5alye@gmail.com">s5alye@gmail.com</a>=
&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"border-lef=
t: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1=
ex;">
I am little confused of the reason why nodetool repair has to run<br>
within GCGraceSeconds.<br>
<br>
The documentation at:<br>
<a href=3D"http://wiki.apache.org/cassandra/Operations#Frequency_of_nodetoo=
l_repair" target=3D"_blank">http://wiki.apache.org/cassandra/Operations#Fre=
quency_of_nodetool_repair</a><br>
is not very clear to me.<br>
<br>
How can a delete be &#39;unforgotten&#39; if I don&#39;t run nodetool repai=
r? (I<br>
understand that if a node is down for more than GCGraceSeconds, I<br>
should not get it up without resynching is completely. Otherwise<br>
deletes may reappear.<a href=3D"http://wiki.apache.org/cassandra/Distribute=
dDeletes" target=3D"_blank">http://wiki.apache.org/cassandra/DistributedDel=
etes</a><br>
)<br>
But not sure how exactly nodetool repair ties into this mechanism of<br>
distributed deletes.<br>
<br>
Thanks for any clarifications.<br>
</blockquote></div><br>Read repair does NOT repair tombstones. Failed write=
s/tomstones with
TimedoutException do not get hinted even if HH is on.
<a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2034">https://is=
sues.apache.org/jira/browse/CASSANDRA-2034</a>. Thus tombstones can get los=
t.<br><br>Because of this the only way to find lost tombstones is to anti-e=
ntropy repair. If you do not repair in the gc period a node could lose a to=
mbstone and the row could be read repaired and resurrected. <br>
<br>In our case, we are lucky, we delete rows when they get old and stale. =
While it is not great if a deleted row appears it is not harmful thus I can=
 live with less repairing then most.<br><br>

--485b397dd1039e911b04a6f4010b--