Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E380544DE for ; Mon, 6 Jun 2011 16:08:08 +0000 (UTC) Received: (qmail 80453 invoked by uid 500); 6 Jun 2011 16:08:08 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 80428 invoked by uid 500); 6 Jun 2011 16:08:08 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 80420 invoked by uid 99); 6 Jun 2011 16:08:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Jun 2011 16:08:08 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jason.rutherglen@gmail.com designates 209.85.220.169 as permitted sender) Received: from [209.85.220.169] (HELO mail-vx0-f169.google.com) (209.85.220.169) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Jun 2011 16:08:02 +0000 Received: by vxk20 with SMTP id 20so4056675vxk.14 for ; Mon, 06 Jun 2011 09:07:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type:content-transfer-encoding; bh=/grqTncpqeXNXloxbAqfX4ggE2Q/PItPFK8BkDrc1d8=; b=KihKApfNaZkABYmuWJ8OevmVj23W+LxX/D9Fni9A2fkQsgjwwAhPAHgG33NO1CEobn xqNJIGWyCcLgvzJK9nE21GIGkGCPcfBgCM8sq6QB+RDfZCW3WWNP7CBmwVhWUVE5/O8r vJTMrIfMY4z3GPdNKpVGHp10qWiMCFX/L4xoM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=rrMagqdy0RnKSZ5+byxdWHgpupqqCIb7MKfSRqSkVMQoKCBug5TGhhjsevM/d9TSr6 nHN9LFlbdq3hXMTDHuyHGm7EsZ4rWeVHp/H6JTwcw1P446tkgqBCguJv5X7dghTt1d7y bp90yRE1AS6oWsvLVvkGuGEHlbykqo3Pslbr0= MIME-Version: 1.0 Received: by 10.52.113.194 with SMTP id ja2mr2307895vdb.300.1307376461112; Mon, 06 Jun 2011 09:07:41 -0700 (PDT) Received: by 10.52.107.136 with HTTP; Mon, 6 Jun 2011 09:07:41 -0700 (PDT) In-Reply-To: References: <636E56F7-5C3B-4D60-9053-9D6BE67EBAF9@gmail.com> Date: Mon, 6 Jun 2011 09:07:41 -0700 Message-ID: Subject: Re: Pluggable block index From: Jason Rutherglen To: dev@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable > Are you sure that is a good idea? Surely the disk seeks would destroy > you on index load? I'm not sure what you mean there. We can create a total HFile block key index that has pointers to within blocks, eg, they'll store a block index int, and a position (within the block) int. Where would the index load occur as this method should allow faster key lookup? On Sun, Jun 5, 2011 at 11:37 PM, Ryan Rawson wrote: > On Sun, Jun 5, 2011 at 11:33 PM, Jason Rutherglen > wrote: >> Ok, the block index is only storing the first key of each block? >> Hmm... I think we can store a pointer to an exact position in the >> block, or at least allow that (for the FST implementation). > > Are you sure that is a good idea? =A0Surely the disk seeks would destroy > you on index load? >