Return-Path: Delivered-To: apmail-cassandra-dev-archive@www.apache.org Received: (qmail 87661 invoked from network); 20 Sep 2010 16:47:44 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 Sep 2010 16:47:44 -0000 Received: (qmail 60592 invoked by uid 500); 20 Sep 2010 16:47:43 -0000 Delivered-To: apmail-cassandra-dev-archive@cassandra.apache.org Received: (qmail 60492 invoked by uid 500); 20 Sep 2010 16:47:42 -0000 Mailing-List: contact dev-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list dev@cassandra.apache.org Received: (qmail 60484 invoked by uid 99); 20 Sep 2010 16:47:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Sep 2010 16:47:42 +0000 X-ASF-Spam-Status: No, hits=0.7 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.160.172] (HELO mail-gy0-f172.google.com) (209.85.160.172) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 20 Sep 2010 16:47:33 +0000 Received: by gyd12 with SMTP id 12so1907881gyd.31 for ; Mon, 20 Sep 2010 09:47:12 -0700 (PDT) MIME-Version: 1.0 Received: by 10.150.69.30 with SMTP id r30mr9213902yba.43.1285001232539; Mon, 20 Sep 2010 09:47:12 -0700 (PDT) Sender: scode@scode.org Received: by 10.151.101.3 with HTTP; Mon, 20 Sep 2010 09:47:12 -0700 (PDT) X-Originating-IP: [95.193.61.84] In-Reply-To: <4C977450.5030908@corp.aol.com> References: <4C977450.5030908@corp.aol.com> Date: Mon, 20 Sep 2010 18:47:12 +0200 X-Google-Sender-Auth: HgUxPb_DfU73GQpJIq99JBbHEsY Message-ID: Subject: Re: improving read performance From: Peter Schuller To: dev@cassandra.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org > This drawback is unfortunate for systems that use time-based row keys. = =C2=A0 =C2=A0In > such systems, row data will generally not be fragmented very much, if at > all, but reads suffer because the assumption is that all data is fragment= ed. > =C2=A0 =C2=A0Even further, in a real-time system where reads occur quickl= y after > writes, if the data is in memory, the sstables are still checked. Perhaps I am misunderstanding you, but why is this a problem (in the particular case of time based row keys) given that existence of the bloom filters which should eliminate the need to go down to the sstables to any extent more than that they actually contain data for the row (in almost all cases, subject to bloom filter false positives)? Also, for the case of the edges where memtables are flushed, a write-through row cache should help alleviate that. I forget off hand whether the row cache is in fact write-through or not though. --=20 / Peter Schuller