Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3EB0D90DA for ; Mon, 2 Apr 2012 07:49:00 +0000 (UTC) Received: (qmail 47659 invoked by uid 500); 2 Apr 2012 07:48:58 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 47504 invoked by uid 500); 2 Apr 2012 07:48:56 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 47493 invoked by uid 99); 2 Apr 2012 07:48:56 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Apr 2012 07:48:56 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,TVD_SUBJ_NUM_OBFU_MINFP X-Spam-Check-By: apache.org Received-SPF: unknown (nike.apache.org: error in processing during lookup of jonas@borgstrom.se) Received: from [209.85.214.44] (HELO mail-bk0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Apr 2012 07:48:48 +0000 Received: by bkuw5 with SMTP id w5so2121281bku.31 for ; Mon, 02 Apr 2012 00:48:28 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding :x-gm-message-state; bh=I8Yj+oX2Z97NwswNnr3aqo5dyYsLSF86wNMBJpfm5S0=; b=oveirFF8yEYGmSPXVwL61HVDAurv5rOuHsatbrtOoCYqCsiH9On40+u4q0B70EITAg 2rk032SowBk+HCd1HKYVOn7js3RCIibkFpnIKnHlU7b3si1GPyqzd9WKGO6bdK/Rdopq //f0mkNl2zcC9w1QEE5tffH1CR8Mgfd69lj8vKuLU/m0cn+Flf8fetcLtkkmEtBWn5nB KX0QyxMpMWX6JibpydGnoBBWeKoHGolnJIm3v1s0iJwe58d1fOnyAaulMftpL8UIjWAV uTXV+JwHb4Ges4WiA44FjcOzME63ga/BC22ooGg1FWZgxz9SQ7YHWfMD/kcglrOHuCxq SdXQ== Received: by 10.204.145.70 with SMTP id c6mr3158347bkv.41.1333352907999; Mon, 02 Apr 2012 00:48:27 -0700 (PDT) Received: from jonasbor-mac.trioptima.net (tc-pa-ext.trioptima.com. [88.80.182.81]) by mx.google.com with ESMTPS id b3sm27172027bki.16.2012.04.02.00.48.26 (version=SSLv3 cipher=OTHER); Mon, 02 Apr 2012 00:48:26 -0700 (PDT) Message-ID: <4F7959C9.7030807@borgstrom.se> Date: Mon, 02 Apr 2012 09:48:25 +0200 From: =?UTF-8?B?Sm9uYXMgQm9yZ3N0csO2bQ==?= User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:11.0) Gecko/20120327 Thunderbird/11.0.1 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: sstable2json and resurrected rows References: <4F72CA92.2000504@borgstrom.se> <4F75A97D.3070307@borgstrom.se> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Gm-Message-State: ALoCoQnVpTBUATKQbty1CXZiAQmdDpqyr7gK/2Rwx7d3/e9WQiEVbT6oE+odSPkKOQP4BxMAVfu5 On 2012-03-31 08:45 , Zhu Han wrote: > Did you hit the bug here? > > https://issues.apache.org/jira/browse/CASSANDRA-4054 Yes looks like it. But what confuses me most is not the sstable2json bug but why the major compaction does not replace the deleted row data with a tombstone. Is that a bug or a feature? To me it just looks like a wast of disk space... / Jonas > > best regards, > > 坚果云 , 最简捷易用的云存储 > 无限空间, 文件同步, 备份和分享! > > > 2012/3/30 Jonas Borgström > > > Let me rephrase my question: > > Is it true that deleted rows will still be present in the sstable > after a major compaction with 1.0.8 (not just tombstones)? > > Or did I mess up my test below? > > / Jonas > > > > On 2012-03-28 10:23 , Jonas Borgström wrote: > > Hi all, > > I've noticed a change in behavior between 0.8.10 and 1.0.8 when > it comes > to sstable2json output and major compactions. Is this a bug or > intended > behavior? > > With 1.0.8: > > create keyspace ks; > use ks; > create column family foo; > set foo[1][1] = 1; > nodetool -h localhost flush > sstable2json foo-hc-1-Data.db => > { > "01": [["01","01",1332920802272000]] > } > del foo[1]; > set foo[2][2] = 2; > nodetool -h localhost flush > sstable2json foo-hc-2-Data.db => > { > "01": [], > "02": [["02","02",1332920843090000]] > } > nodetool -h localhost compact ks foo > > So far so good. But now I expect the resulting sstable to look like > foo-hc-2 (the way 0.8.10 behaves) but instead it looks like the > deleted > foo[1] has been resurrected (foo[1] is still deleted when using the > thrift api): > > sstable2json foo-hc-3-Data.db => > { > "01": [["01","01",1332920802272000]]__, > "02": [["02","02",1332920843090000]] > } > > So why is the full foo[1] row included in the sstable2json > output and > not just a tombstone? > > This is both a wast of disk space and makes it impossible to > trust the > sstable2json output. > > / Jonas > > >