Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of brian.jeltema@digitalenvoy.net
 designates 64.88.168.14 as permitted sender)
From: Brian Jeltema <brian.jeltema@digitalenvoy.net>
Mime-Version: 1.0 (Apple Message framework v1278)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_CE76A4C0-581A-4F0D-B876-1049C35C5EF6"
Subject: Re: inconsistent hadoop/cassandra results
Date: Wed, 9 Jan 2013 07:24:16 -0500
In-Reply-To: <6A5E739F-29B0-48DD-B518-E12245752206@thelastpickle.com>
To: user@cassandra.apache.org
References: <EC3BF709-F0B6-46BA-AE40-FB8CD24CB8FB@digitalenvoy.net>
 <6A5E739F-29B0-48DD-B518-E12245752206@thelastpickle.com>
Message-Id: <51567E23-C212-481E-9A2B-0E5B7966C213@digitalenvoy.net>


--Apple-Mail=_CE76A4C0-581A-4F0D-B876-1049C35C5EF6
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=us-ascii

Sorry if this is a duplicate - I was having mailer problems last night:

> Assuming their were no further writes, running repair or using CL all =
should have fixed it.=20
>=20
> Can you describe the inconsistency between runs?=20

Sure. The job output is generated by a single reducer and consists of a =
list of
key/value pairs where the key is the row key of the original table, and =
the value is
the total count of all columns in the row. Each run produces a file with =
a different
size, and running a diff against various output file pairs displays rows =
that only
appear in one file, or rows with the same key but different counts.=20

What seems particularly hard to explain is the behavior after setting CL =
to ALL,
where the results eventually become reproducible (making it hard to =
place the
blame on my trivial mapper/reducer implementations) but only after about =
half a=20
dozen runs. And once reaching this state, setting CL to QUORUM results =
in=20
additional inconsistent results.

I can say with certainty that there were no other writes. I'm the sole =
developer working
with the CF in question. I haven't seen behavior like this before, =
though I don't have
a tremendous amount of experience. But this is the first time I've tried =
to use the
wide-row support, which makes me a little suspicious. The wide-row =
support is not
very well documented, so maybe I'm doing something wrong there in =
ignorance.

Brian

>=20
> Cheers
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> New Zealand
>=20
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 8/01/2013, at 2:16 AM, Brian Jeltema =
<brian.jeltema@digitalenvoy.net> wrote:
>=20
>> I need some help understanding unexpected behavior I saw in some =
recent experiments with Cassandra 1.1.5 and Hadoop 1.0.3:
>>=20
>> I've written a small map/reduce job that simply counts the number of =
columns in each row of a static CF (call it Foo)=20
>> and generates a list of every row and column count. A relatively =
small fraction of the rows have a large number
>> of columns; worst case is approximately 36 million. So when I set up =
the job, I used wide-row support:
>>=20
>>     ConfigHelper.setInputColumnFamily(job.getConfiguration(), =
"fooKS", "Foo", WIDE_ROWS); // where WIDE_ROWS =3D=3D true
>>=20
>> When I ran this job using the default CL (1) I noticed that the =
results varied from run to run, which I attributed to inconsistent
>> replicas, since Foo was generated with CL =3D=3D 1 and the RF =3D=3D =
3.=20
>>=20
>> So I ran repair for that CF on every node. The cassandra log on every =
node contains lines similar to:
>>=20
>>   INFO [AntiEntropyStage:1] 2013-01-05 20:38:48,605 =
AntiEntropyService.java (line 778) [repair =
#e4a1d7f0-579d-11e2-0000-d64e0a75e6df] Foo is fully synced
>>=20
>> However, repeated runs were still inconsistent. Then I set CL to ALL, =
which I presumed would always result in identical
>> output, but repeated runs initially continued to be inconsistent. =
However, I noticed that the results seemed to
>> be converging, and after several runs (somewhere between 4 and 6) I =
finally was producing identical results on every run.
>> Then I set CL to QUORUM, and again generated inconsistent results.
>>=20
>> Does this behavior make sense?
>>=20
>> Brian
>=20


--Apple-Mail=_CE76A4C0-581A-4F0D-B876-1049C35C5EF6
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=us-ascii

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div><div>Sorry if this is a duplicate - I was having mailer problems =
last night:</div><div><br></div><div></div><blockquote type=3D"cite"><meta=
 http-equiv=3D"Content-Type" content=3D"text/html charset=3Dus-ascii"><div=
 style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><div>Assuming their were no =
further writes, running repair or using CL all should have fixed =
it.&nbsp;</div><div><br></div>Can you describe the inconsistency between =
runs?&nbsp;</div></blockquote><div><br></div></div><div><div>Sure. The =
job output is generated by a single reducer and consists of a list =
of</div><div>key/value pairs where the key is the row key of the =
original table, and the value is</div><div>the total count of all =
columns in the row. Each run produces a file with a =
different</div><div>size, and running a diff against various output file =
pairs displays rows that only</div><div>appear in one file, or rows with =
the same key but different counts.&nbsp;</div><div><br></div><div>What =
seems particularly hard to explain is the behavior after setting CL to =
ALL,</div><div>where the results eventually become reproducible (making =
it hard to place the</div><div>blame on my trivial mapper/reducer =
implementations) but only after about half a&nbsp;</div><div>dozen runs. =
And once reaching this state, setting CL to QUORUM results =
in&nbsp;</div><div>additional inconsistent =
results.</div><div><br></div><div>I can say with certainty that there =
were no other writes. I'm the sole developer working</div><div>with the =
CF in question. I haven't seen behavior like this before, though I don't =
have</div><div>a tremendous amount of experience. But this is the first =
time I've tried to use the</div><div>wide-row support, which makes me a =
little suspicious. The wide-row support is not</div><div>very well =
documented, so maybe I'm doing something wrong there in =
ignorance.</div><div><br></div><div>Brian</div><div><br></div><div><blockq=
uote type=3D"cite"><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"></div></blockquote></div><blockquote type=3D"cite"><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><br></div><div>Cheers</div><div><br><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
border-spacing: 0px; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; =
font-family: Helvetica; font-style: normal; font-variant: normal; =
font-weight: normal; letter-spacing: normal; line-height: normal; =
orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; =
widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; font-family: Helvetica; font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; font-family: Helvetica; font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/">http://www.thelastpickle.com</a></d=
iv></div></span></div></span></div></span></div></span></div>
</div>

<br><div><div>On 8/01/2013, at 2:16 AM, Brian Jeltema &lt;<a =
href=3D"mailto:brian.jeltema@digitalenvoy.net">brian.jeltema@digitalenvoy.=
net</a>&gt; wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><div>I need some help =
understanding unexpected behavior I saw in some recent experiments with =
Cassandra 1.1.5 and Hadoop 1.0.3:</div><div><br></div>I've written a =
small map/reduce job that simply counts the number of columns in each =
row of a static CF (call it Foo)&nbsp;<div>and generates a list of every =
row and column count. A relatively small fraction of the rows have a =
large number</div><div>of columns; worst case is approximately 36 =
million. So when I set up the job, I used wide-row =
support:</div><div><br></div><div>&nbsp; &nbsp;&nbsp;<span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; ">ConfigHelper.setInputColumnFamily(job.getConfiguration(), =
</span><span class=3D"Apple-style-span" style=3D"font-family: Monaco; =
font-size: 11px; "><span style=3D"color: =
#3c41fc">"fooKS"</span></span><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; ">, </span><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; "><span style=3D"color: #0d31c9">"Foo"</span></span><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; ">, WIDE_ROWS); // where WIDE_ROWS =3D=3D =
true</span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; =
"><br></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">When I ran this job using the default CL (1) I noticed that =
the results varied from run to run, which I attributed to =
inconsistent</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">replicas, since Foo was generated with CL =3D=3D 1 and the RF =
=3D=3D 3.&nbsp;</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; "><br></span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">So I ran repair for that CF on every node. The cassandra log =
on every node contains lines similar to:</span></span></div><div><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; "><span class=3D"Apple-style-span" style=3D"font-family: =
Helvetica; font-size: medium; "><br></span></span></div><div><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; "><span class=3D"Apple-style-span" style=3D"font-family: =
Helvetica; font-size: medium; ">&nbsp; INFO [AntiEntropyStage:1] =
2013-01-05 20:38:48,605 AntiEntropyService.java (line 778) [repair =
#e4a1d7f0-579d-11e2-0000-d64e0a75e6df] Foo is fully =
synced</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; "><br></span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">However, repeated runs were still inconsistent. Then I set CL =
to ALL, which I presumed would always result in =
identical</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">output, but repeated runs initially continued to be =
inconsistent. However, I noticed that the results seemed =
to</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">be converging, and after several runs (somewhere between 4 and =
6) I finally was producing identical results on every =
run.</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">Then I set CL to QUORUM, and again generated inconsistent =
results.</span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; "><br></span></span></div><div><span class=3D"Apple-style-span" =
style=3D"font-family: Monaco; font-size: 11px; "><span =
class=3D"Apple-style-span" style=3D"font-family: Helvetica; font-size: =
medium; ">Does this behavior make sense?</span></span></div><div><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; "><span class=3D"Apple-style-span" style=3D"font-family: =
Helvetica; font-size: medium; "><br></span></span></div><div><span =
class=3D"Apple-style-span" style=3D"font-family: Monaco; font-size: =
11px; "><span class=3D"Apple-style-span" style=3D"font-family: =
Helvetica; font-size: medium; =
">Brian</span></span></div></div></blockquote></div><br></div></div></bloc=
kquote></div><br></body></html>=

--Apple-Mail=_CE76A4C0-581A-4F0D-B876-1049C35C5EF6--