Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D39DF979D for ; Sun, 9 Oct 2011 14:10:29 +0000 (UTC) Received: (qmail 8551 invoked by uid 500); 9 Oct 2011 14:10:27 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 8507 invoked by uid 500); 9 Oct 2011 14:10:26 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 8499 invoked by uid 99); 9 Oct 2011 14:10:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 09 Oct 2011 14:10:26 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=SPF_NEUTRAL,TO_NO_BRKTS_PCNT X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [64.6.108.239] (HELO ponto.amerinoc.com) (64.6.108.239) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 09 Oct 2011 14:10:17 +0000 Received: from fbsd8.localdomain (205.83.broadband7.iol.cz [88.102.83.205]) (authenticated bits=128) by ponto.amerinoc.com (8.14.5/8.14.5) with ESMTP id p99E9gJr073056 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Sun, 9 Oct 2011 16:09:53 +0200 (CEST) (envelope-from hsn@sendmail.cz) Received: from [127.0.0.1] ([10.0.0.1]) by fbsd8.localdomain (8.14.4/8.14.4) with ESMTP id p99E9XaM033005 for ; Sun, 9 Oct 2011 16:09:35 +0200 (CEST) (envelope-from hsn@sendmail.cz) Message-ID: <4E91AB18.8000706@sendmail.cz> Date: Sun, 09 Oct 2011 16:09:28 +0200 From: Radim Kolar User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:7.0.1) Gecko/20110929 Thunderbird/7.0.1 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: how to reduce disk read? (and bloom filter performance) References: <4E8EA5B3.2020601@sendmail.cz> <4E8EBBD4.7030206@sendmail.cz> <4E8F6908.7020107@sendmail.cz> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 111009-0, 09.10.2011), Outbound message X-Antivirus-Status: Clean X-Virus-Checked: Checked by ClamAV on apache.org Dne 7.10.2011 23:16, Mohit Anchlia napsal(a): > You'll see output like: > > Offset SSTables > 1 8021 > 2 783 > > Which means 783 read operations accessed 2 SSTables thank you for explaining it to me. I see this: Offset SSTables 1 59323 2 857 3 56 it means bloom filter failure ratio over 1%. Cassandra in unit tests expects bloom filter false positive less than 1.05%. HBase has configurable bloom filters. You can choose 1% or 0.5% - it can make difference for large cache. But result is that my poor read performance should not be caused by bloom filters.