Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 31841 invoked from network); 22 Aug 2007 16:47:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 22 Aug 2007 16:47:25 -0000 Received: (qmail 40918 invoked by uid 500); 22 Aug 2007 16:47:20 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 40891 invoked by uid 500); 22 Aug 2007 16:47:20 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 40876 invoked by uid 99); 22 Aug 2007 16:47:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Aug 2007 09:47:20 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of icsong@gmail.com designates 64.233.162.239 as permitted sender) Received: from [64.233.162.239] (HELO nz-out-0506.google.com) (64.233.162.239) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Aug 2007 16:47:20 +0000 Received: by nz-out-0506.google.com with SMTP id i28so131928nzi for ; Wed, 22 Aug 2007 09:46:59 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; b=A1J3w7hFOeTNiCn4S4mlKI15t/goiWv39Fo/X5f7JAY8plvtmrZ84sqF+wtR0Bczau1+FNpyHmLsr7zZ057hc0RZmA/PACaONrLDcRihG58rnMekRQ48hxk9bYatiLjJWhsQXyJTk4f+bun9hf17RJLEwA5DYJgcs8FjRANLkUI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; b=NaKfHAU/UIi+JaTc/bLXtasaAD3dBWuxdGTXceOT33MRsJn3DSDz/iwpwOVWLEtI6bii95fGyugR58wNaKvMRGw0r8Xpw3hhl4Y2b8UtL1uSPJT2yDtaJOoBtfH7ZevUnHWVRXOn4I4ln1wkTYxdJ8tB+R1I+ZiGXnMWHZnTWsw= Received: by 10.114.254.1 with SMTP id b1mr935979wai.1187801218517; Wed, 22 Aug 2007 09:46:58 -0700 (PDT) Received: by 10.114.26.18 with HTTP; Wed, 22 Aug 2007 09:46:58 -0700 (PDT) Message-ID: <15af24600708220946x767b67e0g654d05c0825897a9@mail.gmail.com> Date: Thu, 23 Aug 2007 01:46:58 +0900 From: "Inchul Song" To: hadoop-dev@lucene.apache.org Subject: [Hbase] About the BloomFilterDescriptor class MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Virus-Checked: Checked by ClamAV on apache.org Hi all, When I create a column for an Hbase table, I have to create an instance of the HColumnDescriptor class, and pass over an instance of the BloomFilterDescriptor class describing which bloom filter to use to the constructor of the HColumnDescriptor class. But there is some inconvenience in using the BloomFilterDescriptor class: 1. Non-existence of a single argument constructor When creating an instance of the BloomFilterDescriptor class, I need to specify some options for the newly created bloom filter. There are three options: type, vectorSize, nbHash. I know that these options are important for the internal working of a bloom filter, but I cannot help but confess that I don't really understand what vectorSize and hbHash mean and how these two options affect the way in which a bloom filter works. As the user of a bloom filter, the only thing I am concerned with is the first option, the name of the bloom filter that I'd like to use for the column, and it would be nice if the other options are automatically decided and filled in. So it would be nice if there is a constructor with a single 'type' argument in the BloomFilterDescriptor class. 2. Bloom filter types are defined as integers Bloom filter types are not in an enumeration class. Thus, when filling in the type option of the constructor from a String value, I always have to write some translation code from the string value to one of the integer values representing bloom filter types. If there is an enumeration class containing bloom filter types, I can utilize the valueOf method of the enumeration class to do this tedious job. Thanks, Song -- Inchul Song, Ph.D. Candidate Database Lab Division of Computer Science, KAIST Email: icsong@dbserver.kaist.ac.kr, icsong@gmail.com Phone: +82-42-869-3570 Fax: +82-42-867-2255