Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 73441 invoked from network); 29 Apr 2010 04:52:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 29 Apr 2010 04:52:37 -0000 Received: (qmail 4326 invoked by uid 500); 29 Apr 2010 04:52:36 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 4271 invoked by uid 500); 29 Apr 2010 04:52:35 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 4263 invoked by uid 99); 29 Apr 2010 04:52:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 04:52:35 +0000 X-ASF-Spam-Status: No, hits=-1.1 required=10.0 tests=AWL,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jbellis@gmail.com designates 209.85.218.222 as permitted sender) Received: from [209.85.218.222] (HELO mail-bw0-f222.google.com) (209.85.218.222) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Apr 2010 04:52:31 +0000 Received: by bwz22 with SMTP id 22so14112809bwz.25 for ; Wed, 28 Apr 2010 21:52:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=pbqr7WqZklPscstzRJCLoDxLd3r4SvObqyFGrkTm/cc=; b=uohs4GMNDOyIt9G2jmCOi5s9J+zWJM7rleeoO+HgQMLJj7Q7JX6y3oHmy2hbW7DGNd pr+0qZEfGodtz8Y3p6EG7QzWmHN0XtlFmT7xtS+q0GSZlV2b4mJdRLQf6u3ywTFJ2Ekp UuHwWUOP8+CptAryVIKEDuzrE+up0+yGc+R8g= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; b=nFa4mVjSfem+3xD9iwvaEK5tFYShCtWFoChYdQdO+ViKDGA/td9ph5IdhS0kTLXA/F 03ZtvjbomYqnUXpHa8WTYluRW7vx9QhzuBKPaLW5VVs/YJYRnqkrxept24iS9hSIUqer M/NCtQtF3esW2zPmulR19sqB7dSNUP0S9wObc= Received: by 10.204.7.194 with SMTP id e2mr5395076bke.103.1272516729385; Wed, 28 Apr 2010 21:52:09 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.116.78 with HTTP; Wed, 28 Apr 2010 21:51:49 -0700 (PDT) In-Reply-To: References: From: Jonathan Ellis Date: Wed, 28 Apr 2010 23:51:49 -0500 Message-ID: Subject: Re: Cassandra reverting deletes? To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Good! :) Can you reproduce w/o map/reduce, with raw get_range_slices? On Wed, Apr 28, 2010 at 3:56 PM, Joost Ouwerkerk wro= te: > Yes! Reproduced on single-node cluster: > > 10/04/28 16:30:24 INFO mapred.JobClient: =A0 =A0 ROWS=3D274884 > 10/04/28 16:30:24 INFO mapred.JobClient: =A0 =A0 TOMBSTONES=3D951083 > > 10/04/28 16:42:49 INFO mapred.JobClient: =A0 =A0 ROWS=3D166580 > 10/04/28 16:42:49 INFO mapred.JobClient: =A0 =A0 TOMBSTONES=3D1059387 > > On Wed, Apr 28, 2010 at 10:43 AM, Jonathan Ellis wrot= e: >> It sounds like either there is a fairly obvious bug, or you're doing >> something wrong. :) >> >> Can you reproduce against a single node? >> >> On Tue, Apr 27, 2010 at 5:14 PM, Joost Ouwerkerk = wrote: >>> Update: I ran a test whereby I deleted ALL the rows in a column >>> family, using a consistency level of ALL. =A0To do this, I mapped the >>> ColumnFamily and called remove on each row id. =A0There were 1.5 millio= n >>> rows, so 1.5 million rows were deleted. >>> >>> I ran a counter job immediately after. =A0This job maps the same column >>> family and tests if any data is returned. =A0If not, it considers the >>> row a "tombstone". =A0If yes, it considers the row not deleted. =A0Belo= w >>> are the hadoop counters for those jobs. =A0Note the fluctuation in the >>> number of rows with data over time, and the increase in time to map >>> the column family after the destroy job. =A0No other clients were >>> accessing cassandra during this time. >>> >>> I'm thoroughly confused. >>> >>> Count: started 13:02:30 EDT, finished 13:11:33 EDT (9 minutes 2 seconds= ): >>> =A0 ROWS: =A0 =A0 =A0 =A01,542,479 >>> =A0 TOMBSTONES: =A069 >>> >>> Destroy: started 16:48:45 EDT, finished 17:07:36 EDT (18 minutes 50 sec= onds) >>> =A0 DESTROYED: =A01,542,548 >>> >>> Count: started 17:15:42 EDT, finished 17:31:03 EDT (15 minutes 21 secon= ds) >>> =A0 ROWS 876,464 >>> =A0 TOMBSTONES =A0 666,084 >>> >>> Count: started 17:31:32, finished 17:47:16 (15mins, 44 seconds) >>> =A0 ROWS 1,451,665 >>> =A0 TOMBSTONES =A0 90,883 >>> >>> Count: started 17:52:34, finished 18:10:28 (17mins, 53 seconds) >>> =A0 ROWS 1,425,644 >>> =A0 TOMBSTONES =A0 116,904 >>> >>> On Tue, Apr 27, 2010 at 5:37 PM, Joost Ouwerkerk = wrote: >>>> Clocks are in sync: >>>> >>>> cluster04:~/cassandra$ dsh -g development "date" >>>> Tue Apr 27 17:36:33 EDT 2010 >>>> Tue Apr 27 17:36:33 EDT 2010 >>>> Tue Apr 27 17:36:33 EDT 2010 >>>> Tue Apr 27 17:36:33 EDT 2010 >>>> Tue Apr 27 17:36:34 EDT 2010 >>>> Tue Apr 27 17:36:34 EDT 2010 >>>> Tue Apr 27 17:36:34 EDT 2010 >>>> Tue Apr 27 17:36:34 EDT 2010 >>>> Tue Apr 27 17:36:34 EDT 2010 >>>> Tue Apr 27 17:36:35 EDT 2010 >>>> Tue Apr 27 17:36:35 EDT 2010 >>>> Tue Apr 27 17:36:35 EDT 2010 >>>> >>>> On Tue, Apr 27, 2010 at 5:35 PM, Nathan McCall wrote: >>>>> Have you confirmed that your clocks are all synced in the cluster? >>>>> This may be the result of an unintentional read-repair occurring if >>>>> that were the case. >>>>> >>>>> -Nate >>>>> >>>>> On Tue, Apr 27, 2010 at 2:20 PM, Joost Ouwerkerk wrote: >>>>>> Hmm... Even after deleting with cl.ALL, I'm getting data back for so= me >>>>>> rows after having deleted them. =A0Which rows return data is >>>>>> inconsistent from one run of the job to the next. >>>>>> >>>>>> On Tue, Apr 27, 2010 at 1:44 PM, Joost Ouwerkerk wrote: >>>>>>> To check that rows are gone, I check that KeySlice.columns is empty= . =A0And as >>>>>>> I mentioned, immediately after the delete job, this returns the exp= ected >>>>>>> number. >>>>>>> Unfortunately I reproduced with QUORUM this morning. =A0No node out= ages. =A0I am >>>>>>> going to try ALL to see if that changes anything, but I am starting= to >>>>>>> wonder if I'm doing something else wrong. >>>>>>> On Mon, Apr 26, 2010 at 9:45 PM, Jonathan Ellis = wrote: >>>>>>>> >>>>>>>> How are you checking that the rows are gone? >>>>>>>> >>>>>>>> Are you experiencing node outages during this? >>>>>>>> >>>>>>>> DC_QUORUM is unfinished code right now, you should avoid using it. >>>>>>>> Can you reproduce with normal QUORUM? >>>>>>>> >>>>>>>> On Sat, Apr 24, 2010 at 12:23 PM, Joost Ouwerkerk >>>>>>>> wrote: >>>>>>>> > I'm having trouble deleting rows in Cassandra.=A0 After running = a job that >>>>>>>> > deletes hundreds of rows, I run another job that verifies that t= he rows >>>>>>>> > are >>>>>>>> > gone.=A0 Both jobs run correctly.=A0 However, when I run the ver= ification >>>>>>>> > job an >>>>>>>> > hour later, the rows have re-appeared.=A0 This is not a case of = "ghosting" >>>>>>>> > because the verification job actually checks that there is data = in the >>>>>>>> > columns. >>>>>>>> > >>>>>>>> > I am running a cluster with 12 nodes and a replication factor of= 3.=A0 I >>>>>>>> > am >>>>>>>> > using DC_QUORUM consistency when deleting. >>>>>>>> > >>>>>>>> > Any ideas? >>>>>>>> > Joost. >>>>>>>> > >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Jonathan Ellis >>>>>>>> Project Chair, Apache Cassandra >>>>>>>> co-founder of Riptano, the source for professional Cassandra suppo= rt >>>>>>>> http://riptano.com >>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> >> >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of Riptano, the source for professional Cassandra support >> http://riptano.com >> > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com