Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 42F0864E7 for ; Fri, 15 Jul 2011 12:33:24 +0000 (UTC) Received: (qmail 45919 invoked by uid 500); 15 Jul 2011 12:33:21 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 45625 invoked by uid 500); 15 Jul 2011 12:33:20 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 45614 invoked by uid 99); 15 Jul 2011 12:33:19 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Jul 2011 12:33:19 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [80.190.178.166] (HELO mail.digital.tis.bz.it) (80.190.178.166) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Jul 2011 12:33:11 +0000 Received: from tyler.unibz.it (unknown [46.18.27.4]) by mail.digital.tis.bz.it (Mailserver) with ESMTPSA id AEBC7123A003 for ; Fri, 15 Jul 2011 14:32:49 +0200 (CEST) Message-ID: <4E203372.8010604@tis.bz.it> Date: Fri, 15 Jul 2011 14:32:50 +0200 From: Claudio Martella User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.18) Gecko/20110616 Lightning/1.0b2 Thunderbird/3.1.11 MIME-Version: 1.0 To: user@hbase.apache.org Subject: Hash indexing of HFiles Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hello list, at SIGMOD this year i've seen a spreading of different storage files for HBase, with different techniques. My scenario and usage doesn't really require range queries, so I thought I'd take advantage of even faster random i/o from hash indexing of data in each sequence file. Does anybody know if anybody has developed other indexing techniques for sequence files other than Btrees? Thanks! --=20 Claudio Martella Free Software & Open Technologies Analyst TIS innovation park Via Siemens 19 | Siemensstr. 19 39100 Bolzano | 39100 Bozen Tel. +39 0471 068 123 Fax +39 0471 068 129 claudio.martella@tis.bz.it http://www.tis.bz.it Short information regarding use of personal data. According to Section 13= of Italian Legislative Decree no. 196 of 30 June 2003, we inform you tha= t we process your personal data in order to fulfil contractual and fiscal= obligations and also to send you information regarding our services and = events. Your personal data are processed with and without electronic mean= s and by respecting data subjects' rights, fundamental freedoms and digni= ty, particularly with regard to confidentiality, personal identity and th= e right to personal data protection. At any time and without formalities = you can write an e-mail to privacy@tis.bz.it in order to object the proce= ssing of your personal data for the purpose of sending advertising materi= als and also to exercise the right to access personal data and other righ= ts referred to in Section 7 of Decree 196/2003. The data controller is TI= S Techno Innovation Alto Adige, Siemens Street n. 19, Bolzano. You can fi= nd the complete information on the web site www.tis.bz.it.