Return-Path: Delivered-To: apmail-cassandra-dev-archive@www.apache.org Received: (qmail 38088 invoked from network); 7 Apr 2010 20:27:54 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 7 Apr 2010 20:27:54 -0000 Received: (qmail 39326 invoked by uid 500); 7 Apr 2010 20:27:54 -0000 Delivered-To: apmail-cassandra-dev-archive@cassandra.apache.org Received: (qmail 39310 invoked by uid 500); 7 Apr 2010 20:27:54 -0000 Mailing-List: contact dev-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list dev@cassandra.apache.org Received: (qmail 39302 invoked by uid 99); 7 Apr 2010 20:27:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Apr 2010 20:27:54 +0000 X-ASF-Spam-Status: No, hits=4.4 required=10.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sahmed1020@gmail.com designates 209.85.220.216 as permitted sender) Received: from [209.85.220.216] (HELO mail-fx0-f216.google.com) (209.85.220.216) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Apr 2010 20:27:46 +0000 Received: by fxm8 with SMTP id 8so1566836fxm.25 for ; Wed, 07 Apr 2010 13:27:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:received:message-id :subject:from:to:content-type; bh=zrWuW5wdJlJFcegxr66p3H+NKgh6/u67fEZLJYdXkSw=; b=YPRVIR9YFyWLgtbkHfAtqKB3jk73tb9SZQuEKlv9fQPcKm/VLkWa6eYnj92ZrYWj54 k8WmwnzdrxgO6INmOjQxyZjt1dVQFWpe10n25qlMqFpK0Qp56wR76weuE22h16uMH6mx qDiETYktR2J2W/fdYQfxnnUMHd0HilDN5fiqQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=sRStkaEvBj+pwGH8LvZe5hgNsmoCaklzZedRZzJvP7mTdwQb0MRwBLMGWIs9m7LFEu H0c2m+YxlFdWZfD05izTIvsJcRO3XujNoU+Tu3WkPeKKUoca63T6KuwvnrDdabZgH5Ia adqlmjEXdWW0F5IJ9MS3zd33tYS+Naxksc798= MIME-Version: 1.0 Received: by 10.223.109.69 with HTTP; Wed, 7 Apr 2010 13:27:25 -0700 (PDT) Date: Wed, 7 Apr 2010 16:27:25 -0400 Received: by 10.223.4.217 with SMTP id 25mr9500194fas.60.1270672045940; Wed, 07 Apr 2010 13:27:25 -0700 (PDT) Message-ID: Subject: boonfilters From: S Ahmed To: dev@cassandra.apache.org Content-Type: multipart/alternative; boundary=000e0cdfd990eca0310483ab6306 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cdfd990eca0310483ab6306 Content-Type: text/plain; charset=ISO-8859-1 Just reading up on boonfilters, few questions. Basically boonfilters let give you a true/false if a particular key exists, and they *may* give you a false positive i.e. they key exists but never a false negative i.e. the key doesn't exist. The core of boonfilters is its hashing mechanism that marks the in-memory matrix/map if the key exists. 1. Is the only place boonfilters are used in Cassandra is when you want to see if a particular key exists in a particular node? 2. Are boonfilters a fixed size, or they adjust as to the # of keys? any example size? 3. Boonfilters don't give false negatives: So you hit a node, and perform a lookup in the boonfilter for a key. It says "yes", but when you do a lookup the object returned is null, so then you flag that this node needs this particular key during replication. Have I grasp this concept? Really loving this project, learning allot from the code. It would be great if someone could do a walkthrough of common functionality in a detailed way :) --000e0cdfd990eca0310483ab6306--