Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <AANLkTinxMvK1mwuASXxAsr4-_f33X7SBxIDxVqga85sL@mail.gmail.com>
References: <AANLkTikBIaaJOHK5uw8NlN02DP9ikm60kpq9U6wbXGsI@mail.gmail.com>
	<58893209-1479-448C-956D-3F98B435206F@wimba.com>
	<AANLkTinxMvK1mwuASXxAsr4-_f33X7SBxIDxVqga85sL@mail.gmail.com>
Date: Fri, 18 Jun 2010 15:57:12 -0700
Message-ID: <AANLkTim8DWHqQgNKpjl5p9DWqU8zZG10j4R3y4zLzcMm@mail.gmail.com>
Subject: Re: Possible bug in Cassandra MapReduce
From: Corey Hulen <cj@earnstone.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=000e0cd728362096ae048955e0d8

--000e0cd728362096ae048955e0d8
Content-Type: text/plain; charset=ISO-8859-1

OK...I just verified on a clean EC2 small single instance box using
apache-cassandra-0.6.2-src.
 I'm pertty sure the Cassandra MapReduce functionality is broken.

If your MapReduce jobs are idempotent then you are OK, but if you are doing
things like word count (as in the supplied example) or key count you will
get double counts.

-Corey


On Fri, Jun 18, 2010 at 3:15 PM, Corey Hulen <cj@earnstone.com> wrote:

>
> I thought the same thing, but using the supplied contrib example I just
> delete the /var/lib/data dirs and commit log.
>
> -Corey
>
>
>
>
> On Fri, Jun 18, 2010 at 3:11 PM, Phil Stanhope <pstanhope@wimba.com>wrote:
>
>> "blow all the data away" ... how do you do that? What is the timestamp
>> precision that you are using when creating key/col or key/supercol/col
>> items?
>>
>> I have seen a fail to write a key when the timestamp is identical to the
>> previous timestamp of a deleted key/col. While I didn't examine the source
>> code, I'm certain that this is do to delete tombstones.
>>
>> I view this as a application error because I was attempting to do this
>> within the GCGraceSeconds time period. If I, however, stopped cassandra,
>> blew away data & commitlogs and restarted the write always succeeds (no
>> surprise there).
>>
>> I turned this behavior into a feature (of sorts). When this happens I
>> increment a formally non-zero portion of the timestamp (the last digit of
>> precision which was always zero) and use this as a counter to track how many
>> times a key/col was updated (max 9 for my purposes).
>>
>> -phil
>>
>> On Jun 18, 2010, at 5:49 PM, Corey Hulen wrote:
>>
>> >
>> > We are using MapReduce to periodical verify and rebuild our secondary
>> indexes along with counting total records.  We started to noticed double
>> counting of unique keys on single machine standalone tests. We were finally
>> able to reproduce the problem using the
>> apache-cassandra-0.6.2-src/contrib/word_count example and just re-running it
>> multiple times.  We are hoping someone can verify the bug.
>> >
>> > re-run the tests and the word count for /tmp/word_count3/part-r-00000
>> will be 1000 +~200  and will change if you blow the data away and re-run.
>>  Notice the setup script loops and only inserts 1000 records so we expect
>> count to be 1000.  Once the data is generated then re-running the setup
>> script and/or mapreduce doesn't change the number (still off).  The key is
>> to blow all the data away and start over which will cause it to change.
>> >
>> > Can someone please verify this behavior?
>> >
>> > -Corey
>>
>>
>

--000e0cd728362096ae048955e0d8
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div><br></div>OK...I just verified on a clean EC2 small single instance bo=
x using=A0<meta http-equiv=3D"content-type" content=3D"text/html; charset=
=3Dutf-8"><span class=3D"Apple-style-span" style=3D"font-family: arial, san=
s-serif; font-size: 13px; border-collapse: collapse; color: rgb(68, 68, 68)=
; ">apache-cassandra-0.6.2-src. =A0I&#39;m pertty sure the Cassandra MapRed=
uce functionality is=A0broken.</span><div>
<font class=3D"Apple-style-span" color=3D"#444444" face=3D"arial, sans-seri=
f"><span class=3D"Apple-style-span" style=3D"border-collapse: collapse;"><b=
r></span></font></div><div><font class=3D"Apple-style-span" color=3D"#44444=
4" face=3D"arial, sans-serif"><span class=3D"Apple-style-span" style=3D"bor=
der-collapse: collapse;">If your MapReduce jobs are idempotent then you are=
 OK, but if you are doing things like word count (as in the supplied exampl=
e) or key count you will get double counts.</span></font></div>
<div><font class=3D"Apple-style-span" color=3D"#444444" face=3D"arial, sans=
-serif"><span class=3D"Apple-style-span" style=3D"border-collapse: collapse=
;"><br></span></font></div><div><font class=3D"Apple-style-span" color=3D"#=
444444" face=3D"arial, sans-serif"><span class=3D"Apple-style-span" style=
=3D"border-collapse: collapse;">-Corey</span></font></div>
<meta http-equiv=3D"content-type" content=3D"text/html; charset=3Dutf-8"><d=
iv><font class=3D"Apple-style-span" color=3D"#444444" face=3D"arial, sans-s=
erif"><span class=3D"Apple-style-span" style=3D"border-collapse: collapse;"=
><br></span></font><br>
<div class=3D"gmail_quote">On Fri, Jun 18, 2010 at 3:15 PM, Corey Hulen <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:cj@earnstone.com">cj@earnstone.com</a>=
&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div><br></div>I thought the same thing, but using the supplied contrib exa=
mple I just delete the /var/lib/data dirs and commit log.<div><br></div><di=
v><font color=3D"#888888">-Corey</font><div><div></div><div class=3D"h5"><b=
r>
<div><br></div><div><br><br><div class=3D"gmail_quote">On Fri, Jun 18, 2010=
 at 3:11 PM, Phil Stanhope <span dir=3D"ltr">&lt;<a href=3D"mailto:pstanhop=
e@wimba.com" target=3D"_blank">pstanhope@wimba.com</a>&gt;</span> wrote:<br=
>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">&quot;blow all the data away&quot; ... how d=
o you do that? What is the timestamp precision that you are using when crea=
ting key/col or key/supercol/col items?<br>


<br>
I have seen a fail to write a key when the timestamp is identical to the pr=
evious timestamp of a deleted key/col. While I didn&#39;t examine the sourc=
e code, I&#39;m certain that this is do to delete tombstones.<br>
<br>
I view this as a application error because I was attempting to do this with=
in the GCGraceSeconds time period. If I, however, stopped cassandra, blew a=
way data &amp; commitlogs and restarted the write always succeeds (no surpr=
ise there).<br>


<br>
I turned this behavior into a feature (of sorts). When this happens I incre=
ment a formally non-zero portion of the timestamp (the last digit of precis=
ion which was always zero) and use this as a counter to track how many time=
s a key/col was updated (max 9 for my purposes).<br>


<font color=3D"#888888"><br>
-phil<br>
</font><div><div></div><div><br>
On Jun 18, 2010, at 5:49 PM, Corey Hulen wrote:<br>
<br>
&gt;<br>
&gt; We are using MapReduce to periodical verify and rebuild our secondary =
indexes along with counting total records. =A0We started to noticed double =
counting of unique keys on single machine standalone tests. We were finally=
 able to reproduce the problem using the apache-cassandra-0.6.2-src/contrib=
/word_count example and just re-running it multiple times. =A0We are hoping=
 someone can verify the bug.<br>


&gt;<br>
&gt; re-run the tests and the word count for /tmp/word_count3/part-r-00000 =
will be 1000 +~200 =A0and will change if you blow the data away and re-run.=
 =A0Notice the setup script loops and only inserts 1000 records so we expec=
t count to be 1000. =A0Once the data is generated then re-running the setup=
 script and/or mapreduce doesn&#39;t change the number (still off). =A0The =
key is to blow all the data away and start over which will cause it to chan=
ge.<br>


&gt;<br>
&gt; Can someone please verify this behavior?<br>
&gt;<br>
&gt; -Corey<br>
<br>
</div></div></blockquote></div><br></div></div></div></div>
</blockquote></div><br></div>

--000e0cd728362096ae048955e0d8--