Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5CAB38F3C for ; Thu, 11 Aug 2011 01:18:57 +0000 (UTC) Received: (qmail 46638 invoked by uid 500); 11 Aug 2011 01:18:54 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 46493 invoked by uid 500); 11 Aug 2011 01:18:53 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 46485 invoked by uid 99); 11 Aug 2011 01:18:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Aug 2011 01:18:53 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: unknown (nike.apache.org: error in processing during lookup of jason@apture.com) Received: from [209.85.160.172] (HELO mail-gy0-f172.google.com) (209.85.160.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Aug 2011 01:18:45 +0000 Received: by gyf3 with SMTP id 3so1212997gyf.31 for ; Wed, 10 Aug 2011 18:18:24 -0700 (PDT) MIME-Version: 1.0 Received: by 10.236.184.104 with SMTP id r68mr10091278yhm.91.1313025504329; Wed, 10 Aug 2011 18:18:24 -0700 (PDT) Received: by 10.236.203.196 with HTTP; Wed, 10 Aug 2011 18:18:24 -0700 (PDT) Date: Wed, 10 Aug 2011 18:18:24 -0700 Message-ID: Subject: Tuning a column family for archival From: Jason Baker To: user Content-Type: multipart/alternative; boundary=20cf30207c6cc43e9e04aa3092d3 X-Virus-Checked: Checked by ClamAV on apache.org --20cf30207c6cc43e9e04aa3092d3 Content-Type: text/plain; charset=ISO-8859-1 I have a column family that I'm using to archive records. They're mostly kept around for historical purposes. Aside from that, they're mostly considered deleted. It's probably going to be very rare that anyone reads from this table *ever*. I don't really even write to it that much. Does anyone have advice for me as far as how (or if) I should tune this table with that in mind? My concern is less speeding up access to this table than it is making sure that it doesn't impact the performance of any other column families in any way. Here's the data from nodetool cfstat (although this table was just created a few days ago): Column Family: ArchivedLinks SSTable count: 1 Space used (live): 29580801 Space used (total): 97838786 Number of Keys (estimate): 93184 Memtable Columns Count: 7497 Memtable Data Size: 3223587 Memtable Switch Count: 11 Read Count: 0 Read Latency: NaN ms. Write Count: 139091 Write Latency: 0.007 ms. Pending Tasks: 0 Key cache: disabled Row cache: disabled Compacted row minimum size: 259 Compacted row maximum size: 372 Compacted row mean size: 311 --20cf30207c6cc43e9e04aa3092d3 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I have a column family that I'm using to archive records. =A0They'r= e mostly kept around for historical purposes. =A0Aside from that, they'= re mostly considered deleted. =A0It's probably going to be very rare th= at anyone reads from this table *ever*. =A0I don't really even write to= it that much. =A0

Does anyone have advice for me as far as how (or if) I shoul= d tune this table with that in mind? =A0My concern is less speeding up acce= ss to this table than it is making sure that it doesn't impact the perf= ormance of any other column families in any way.

Here's the data from nodetool cfstat (although this= table was just created a few days ago):

Column Family= : ArchivedLinks
SSTa= ble count: 1
Space used (live): 29580801
Space used (total): 97838786
Numb= er of Keys (estimate): 93184
Memtable Columns Count: 7497
Memtable Data S= ize: 3223587
Memt= able Switch Count: 11
Read Count: 0
Read Latency: NaN ms.
Writ= e Count: 139091
Write Latency: 0.007 ms.
Pending Tasks: 0
Key = cache: disabled
Row cache: disabled
Compacted row minimum size: 259
Comp= acted row maximum size: 372
Compacted row mean size: 311
--20cf30207c6cc43e9e04aa3092d3--