Return-Path: Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: (qmail 3457 invoked from network); 28 Mar 2011 16:52:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 28 Mar 2011 16:52:42 -0000 Received: (qmail 65769 invoked by uid 500); 28 Mar 2011 16:52:41 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 65732 invoked by uid 500); 28 Mar 2011 16:52:41 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 65724 invoked by uid 99); 28 Mar 2011 16:52:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Mar 2011 16:52:41 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of weishung@gmail.com designates 209.85.214.41 as permitted sender) Received: from [209.85.214.41] (HELO mail-bw0-f41.google.com) (209.85.214.41) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 28 Mar 2011 16:52:35 +0000 Received: by bwz17 with SMTP id 17so3664038bwz.14 for ; Mon, 28 Mar 2011 09:52:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=TqAjYLnVIa1Rar8nlsQL6EWDPHSnDLJ318gDPUGPlf0=; b=pXGxF6/KQyRpKod0187PaN52bwEzPbis9rIzXZPuxFVjO2nyH6PqL8OLp6JZCmndLv llhS60OdMLhxDiF13Bf8MRwrEQwwfAY2l7nKxVY8Oxf9UM6zkXtMXEbiDRwSV0SEkxaU aOuiw6GNsKc+WYDB5J57DhxACqpAYiji869kM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=TY2ueBb+jwJ3s4kZAGXw3N2cJv3HDm75HH2vtsb7OrFpcRrvJa3TW6+GYUQJX1lrbP 1+NF0SU5r8EQpAuyJZ4/CfKZcLEd4C034U9QQr7I/y7Q6MOUbV8+XI0RCvlM0D9NDXzH yZTWOyd1Yo+MhEbQdNI7pPU5/CWXkBi0jDR9w= MIME-Version: 1.0 Received: by 10.204.137.19 with SMTP id u19mr3966994bkt.106.1301331134673; Mon, 28 Mar 2011 09:52:14 -0700 (PDT) Received: by 10.204.20.1 with HTTP; Mon, 28 Mar 2011 09:52:14 -0700 (PDT) In-Reply-To: References: Date: Mon, 28 Mar 2011 11:52:14 -0500 Message-ID: Subject: Re: disk seek in range search From: Weishung Chung To: dev@hbase.apache.org Cc: Jean-Daniel Cryans Content-Type: multipart/alternative; boundary=0015173fe6a00475a2049f8dc40d X-Virus-Checked: Checked by ClamAV on apache.org --0015173fe6a00475a2049f8dc40d Content-Type: text/plain; charset=ISO-8859-1 Thank you Jean for the reading materials, I've been reading the source codes and searching on the internet and have a very vague idea how everything is working. Give me a few more days(don't want to embarrass myself), I will check back with you guys to see if my understanding is correct or not :D Have a good day, thanks again :) On Mon, Mar 28, 2011 at 11:23 AM, Jean-Daniel Cryans wrote: > I think you are asking for a bit too much :) > > Let's do it the other way, show us what you think are the answers to > your questions based on currently available documentation and by > looking at the source code, then I'm pretty sure someone will be happy > to verify it. > > Start by looking at the bigtable paper, then use Lars George's blog posts > like: > > http://www.larsgeorge.com/2009/10/hbase-architecture-101-storage.html > > http://www.larsgeorge.com/2010/01/hbase-architecture-101-write-ahead-log.html > http://www.larsgeorge.com/2010/05/hbase-file-locality-in-hdfs.html > > And then try diving into the code. > > Good luck! > > J-D > > On Sat, Mar 26, 2011 at 8:57 AM, Weishung Chung > wrote: > > Dear fellow HBase developers, > > > > Could someone educate me and let me know how to figure out the number of > > disk seeks involved in a range search (startRow to endRow specified in > > Scan). Also, could anyone give me the details of all the steps involved > once > > the Scan for range retrieval is called? I know somehow it needs to figure > > out the regionservers used in hosting the rows but I still don't have a > > clear understanding the whole steps involved :( :( Also, there is a data > > index block in HFile, I was wondering how the index block is utilized in > > figuring out the location of all the rows. > > > > Thank you so much for satisfying my curiosity :) > > > > Have a good weekend and enjoy :) > > > > Wei Shung > > > --0015173fe6a00475a2049f8dc40d--