Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 15815 invoked from network); 18 Jun 2010 22:12:18 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 18 Jun 2010 22:12:18 -0000 Received: (qmail 50410 invoked by uid 500); 18 Jun 2010 22:12:17 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 50371 invoked by uid 500); 18 Jun 2010 22:12:17 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 50362 invoked by uid 99); 18 Jun 2010 22:12:16 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jun 2010 22:12:16 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=10.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [216.82.249.19] (HELO mail137.messagelabs.com) (216.82.249.19) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 18 Jun 2010 22:12:07 +0000 X-VirusChecked: Checked X-Env-Sender: pstanhope@wimba.com X-Msg-Ref: server-4.tower-137.messagelabs.com!1276899104!84340998!1 X-StarScan-Version: 6.2.4; banners=-,-,- X-Originating-IP: [64.1.25.213] Received: (qmail 32645 invoked from network); 18 Jun 2010 22:11:45 -0000 Received: from 64.1.25.213.ptr.us.xo.net (HELO mail.wimba.com) (64.1.25.213) by server-4.tower-137.messagelabs.com with SMTP; 18 Jun 2010 22:11:45 -0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by mail.wimba.com (Postfix) with ESMTP id C41BDB384BE for ; Fri, 18 Jun 2010 18:11:44 -0400 (EDT) Received: from mail.wimba.com ([127.0.0.1]) by localhost (mail.wimba.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id n7JdXtJ6mwOh for ; Fri, 18 Jun 2010 18:11:40 -0400 (EDT) Received: from 149.sub-75-223-124.myvzw.com (149.sub-75-223-124.myvzw.com [75.223.124.149]) by mail.wimba.com (Postfix) with ESMTP id 5486BB3828E for ; Fri, 18 Jun 2010 18:11:39 -0400 (EDT) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1078) Subject: Re: Possible bug in Cassandra MapReduce From: Phil Stanhope In-Reply-To: Date: Fri, 18 Jun 2010 18:11:36 -0400 Content-Transfer-Encoding: quoted-printable Message-Id: <58893209-1479-448C-956D-3F98B435206F@wimba.com> References: To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1078) X-Virus-Checked: Checked by ClamAV on apache.org "blow all the data away" ... how do you do that? What is the timestamp = precision that you are using when creating key/col or key/supercol/col = items? I have seen a fail to write a key when the timestamp is identical to the = previous timestamp of a deleted key/col. While I didn't examine the = source code, I'm certain that this is do to delete tombstones. I view this as a application error because I was attempting to do this = within the GCGraceSeconds time period. If I, however, stopped cassandra, = blew away data & commitlogs and restarted the write always succeeds (no = surprise there). I turned this behavior into a feature (of sorts). When this happens I = increment a formally non-zero portion of the timestamp (the last digit = of precision which was always zero) and use this as a counter to track = how many times a key/col was updated (max 9 for my purposes). -phil On Jun 18, 2010, at 5:49 PM, Corey Hulen wrote: >=20 > We are using MapReduce to periodical verify and rebuild our secondary = indexes along with counting total records. We started to noticed double = counting of unique keys on single machine standalone tests. We were = finally able to reproduce the problem using the = apache-cassandra-0.6.2-src/contrib/word_count example and just = re-running it multiple times. We are hoping someone can verify the bug. >=20 > re-run the tests and the word count for /tmp/word_count3/part-r-00000 = will be 1000 +~200 and will change if you blow the data away and = re-run. Notice the setup script loops and only inserts 1000 records so = we expect count to be 1000. Once the data is generated then re-running = the setup script and/or mapreduce doesn't change the number (still off). = The key is to blow all the data away and start over which will cause it = to change. >=20 > Can someone please verify this behavior? >=20 > -Corey