Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_8B0B18DE-5CD4-4171-A5C1-09979689C077"
Message-Id: <0DFF4995-9DF6-44F1-9242-B663F67BE3E5@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.1 \(1498\))
Subject: Re: Secondary index loss on node restart
Date: Tue, 25 Sep 2012 13:06:19 +1200
References: <5463B3F0-D46D-45BD-9A43-92E790E549C8@yahoo.com>
To: user@cassandra.apache.org
In-Reply-To: <5463B3F0-D46D-45BD-9A43-92E790E549C8@yahoo.com>


--Apple-Mail=_8B0B18DE-5CD4-4171-A5C1-09979689C077
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

Can you contribute your experience to this ticket =
https://issues.apache.org/jira/browse/CASSANDRA-4670 ?=20

Thanks


-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 24/09/2012, at 6:22 AM, Michael Theroux <mtheroux2@yahoo.com> wrote:

> Hello,
>=20
> We have been noticing an issue where, about 50% of the time in which a =
node fails or is restarted, secondary indexes appear to be partially =
lost or corrupted.  A drop and re-add of the index appears to correct =
the issue.  There are no errors in the cassandra logs that I see.  Part =
of the index seems to be simply missing.  Sometimes this corruption/loss =
doesn't happen immediately, but sometime after the node is restarted.  =
In addition, the index never appears to have an issue when the node =
comes down, it is only after the node comes back up and recovers in =
which we experience an issue.
>=20
> We developed some code that goes through all the rows in the table, by =
key, in which the index is present.  It then attempts to look up the =
information via secondary index, in an attempt to detect when the issue =
occurs.  Another odd observation is that the number of members present =
in the index when we have the issue varies up and down (the index and =
the tables don't change that often).
>=20
> We are running a 6 node Cassandra cluster with a replication factor of =
3, consistency level for all queries is LOCAL_QUORUM.  We are running =
Cassandra 1.1.2.
>=20
> Anyone have any insights?
>=20
> -Mike


--Apple-Mail=_8B0B18DE-5CD4-4171-A5C1-09979689C077
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Can =
you contribute your experience to this ticket&nbsp;<a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-4670">https://issu=
es.apache.org/jira/browse/CASSANDRA-4670</a>&nbsp;?&nbsp;<div><br></div><d=
iv>Thanks</div><div><br></div><div><br><div><div =
apple-content-edited=3D"true">
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: -webkit-auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></div></span></span>
</div>

<br><div><div>On 24/09/2012, at 6:22 AM, Michael Theroux &lt;<a =
href=3D"mailto:mtheroux2@yahoo.com">mtheroux2@yahoo.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Hello,<br><br>We have been noticing an issue where, about =
50% of the time in which a node fails or is restarted, secondary indexes =
appear to be partially lost or corrupted. &nbsp;A drop and re-add of the =
index appears to correct the issue. &nbsp;There are no errors in the =
cassandra logs that I see. &nbsp;Part of the index seems to be simply =
missing. &nbsp;Sometimes this corruption/loss doesn't happen =
immediately, but sometime after the node is restarted. &nbsp;In =
addition, the index never appears to have an issue when the node comes =
down, it is only after the node comes back up and recovers in which we =
experience an issue.<br><br>We developed some code that goes through all =
the rows in the table, by key, in which the index is present. &nbsp;It =
then attempts to look up the information via secondary index, in an =
attempt to detect when the issue occurs. &nbsp;Another odd observation =
is that the number of members present in the index when we have the =
issue varies up and down (the index and the tables don't change that =
often).<br><br>We are running a 6 node Cassandra cluster with a =
replication factor of 3, consistency level for all queries is =
LOCAL_QUORUM. &nbsp;We are running Cassandra 1.1.2.<br><br>Anyone have =
any =
insights?<br><br>-Mike</blockquote></div><br></div></div></body></html>=

--Apple-Mail=_8B0B18DE-5CD4-4171-A5C1-09979689C077--