Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_8099798C-C8DA-47DF-8FEE-CEF6A924EB3C"
Message-Id: <6D6E54DA-C479-4182-91F9-73861769CD20@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: Repair does not fix inconsistency
Date: Thu, 4 Apr 2013 07:05:41 +0530
References: <515C18C1.4000103@opera.com>
To: user@cassandra.apache.org
In-Reply-To: <515C18C1.4000103@opera.com>


--Apple-Mail=_8099798C-C8DA-47DF-8FEE-CEF6A924EB3C
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

What version are you on ?=20

Can you run a repair on the CF and check:

Does the repair detect differences in the CF and stream changes ?=20
After the streaming does it run a secondary index rebuild on the new =
sstable ? (Should be in the logs)

Can you provide the full query trace ?=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 3/04/2013, at 5:25 PM, Michal Michalski <michalm@opera.com> wrote:

> Hi,
>=20
> TL;DR: I have inconsistend data (1 live row on node A & 1 tombstoned =
row on node B) that do not get fixed by repair. What can be a problem?
>=20
> Long version:
>=20
> I have a CF containing Users' info, which I sometimes query by key, =
and sometimes by indexed columns like email. I'm using RF=3D2. I write =
with CL.ONE, but  this CF is very rarely updated, so C* has a looot of =
time to fix inconsistencies that may occur, so I'm fine with this (at =
least in theory ;-) ).
>=20
> To be clear:
> - I've run a successfull cluster-wide repair on this CF before =
testing, so I do not expect any inconsistency
> - All indexes are built, I've rebuilt them manually before testing, so =
I expect them to work properly (I mention it because it seems to be =
somehow related to indexes, but I'm not sure - see below)
>=20
> The problem is:
>=20
> When I query (cqlsh) some rows by key (CL is default =3D ONE) I =
_always_ get a correct result.  However, when I query it by indexed =
column, it returns nothing.
>=20
> When tracing a query with CL.ALL in cqlsh, I get info that C* has:
>=20
> Read 0 live cells and 1 tombstoned       // for first replica node
> Read 1 live cells and 0 tombstoned       // for second replica node
>=20
> When CL is ONE it's never asking second replica for data (possibly due =
to DynamicSnitch scores or so), so it returns nothing.
>=20
> Switching to CL >=3D TWO obviously fixes this problem for us, but it's =
not the solution I'd like to use as I'd rather rely on fast read/write =
requests with CL.ONE + frequent repairs, allowing some short-term =
inconsistency.
>=20
> Any ideas why it may happen that data are still inconsistent after =
repair? Is there something I could have missed?
>=20
> I'm mainly surprised that repair does not fix this inconsistency in =
ANY way - either by pulling missing data to first replica _OR_ =
tombstoning it on second replica. First one would be correct (delete was =
made a long time ago and then the row reappeared), but both could make =
sense, as both will make the data consistent. In this state it's =
definitely inconsistent and I don't understand it :-)
>=20
>=20
> M.


--Apple-Mail=_8099798C-C8DA-47DF-8FEE-CEF6A924EB3C
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">What =
version are you on ?&nbsp;<div><br></div><div>Can you run a repair on =
the CF and check:</div><div><br></div><div>Does the repair detect =
differences in the CF and stream changes ?&nbsp;</div><div>After the =
streaming does it run a secondary index rebuild on the new sstable ? =
(Should be in the logs)</div><div><br></div><div>Can you provide the =
full query trace =
?&nbsp;</div><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div =
style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: medium; =
font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Consultant</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div></div>
</div>

<br><div><div>On 3/04/2013, at 5:25 PM, Michal Michalski &lt;<a =
href=3D"mailto:michalm@opera.com">michalm@opera.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Hi,<br><br>TL;DR: I have inconsistend data (1 live row on =
node A &amp; 1 tombstoned row on node B) that do not get fixed by =
repair. What can be a problem?<br><br>Long version:<br><br>I have a CF =
containing Users' info, which I sometimes query by key, and sometimes by =
indexed columns like email. I'm using RF=3D2. I write with CL.ONE, but =
&nbsp;this CF is very rarely updated, so C* has a looot of time to fix =
inconsistencies that may occur, so I'm fine with this (at least in =
theory ;-) ).<br><br>To be clear:<br>- I've run a successfull =
cluster-wide repair on this CF before testing, so I do not expect any =
inconsistency<br>- All indexes are built, I've rebuilt them manually =
before testing, so I expect them to work properly (I mention it because =
it seems to be somehow related to indexes, but I'm not sure - see =
below)<br><br>The problem is:<br><br>When I query (cqlsh) some rows by =
key (CL is default =3D ONE) I _always_ get a correct result. =
&nbsp;However, when I query it by indexed column, it returns =
nothing.<br><br>When tracing a query with CL.ALL in cqlsh, I get info =
that C* has:<br><br>Read 0 live cells and 1 tombstoned =
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;// for first replica node<br>Read 1 =
live cells and 0 tombstoned &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;// for =
second replica node<br><br>When CL is ONE it's never asking second =
replica for data (possibly due to DynamicSnitch scores or so), so it =
returns nothing.<br><br>Switching to CL &gt;=3D TWO obviously fixes this =
problem for us, but it's not the solution I'd like to use as I'd rather =
rely on fast read/write requests with CL.ONE + frequent repairs, =
allowing some short-term inconsistency.<br><br>Any ideas why it may =
happen that data are still inconsistent after repair? Is there something =
I could have missed?<br><br>I'm mainly surprised that repair does not =
fix this inconsistency in ANY way - either by pulling missing data to =
first replica _OR_ tombstoning it on second replica. First one would be =
correct (delete was made a long time ago and then the row reappeared), =
but both could make sense, as both will make the data consistent. In =
this state it's definitely inconsistent and I don't understand it =
:-)<br><br><br>M.<br></blockquote></div><br></div></body></html>=

--Apple-Mail=_8099798C-C8DA-47DF-8FEE-CEF6A924EB3C--