Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C7A1AD462 for ; Sun, 19 Aug 2012 08:46:55 +0000 (UTC) Received: (qmail 21266 invoked by uid 500); 19 Aug 2012 08:46:54 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 20735 invoked by uid 500); 19 Aug 2012 08:46:50 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 20705 invoked by uid 99); 19 Aug 2012 08:46:49 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 19 Aug 2012 08:46:49 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of lars.george@gmail.com designates 209.85.212.179 as permitted sender) Received: from [209.85.212.179] (HELO mail-wi0-f179.google.com) (209.85.212.179) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 19 Aug 2012 08:46:41 +0000 Received: by wibhq4 with SMTP id hq4so2349028wib.2 for ; Sun, 19 Aug 2012 01:46:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to:x-mailer; bh=agp6fi1bwwO6YcOS4Y8rKFNwPhaattjxlTNyzQO92as=; b=Q9PrJJUBNDw9avAqH/g4h41/ALkDKOuARQGhmsSei7GMq5Q7FXcuM4bPFa9oVcAnZM Qnk9SmK3XKw3O/OzFzw+up5id1PHhYvPOYYlqPqeQMSCjs9wbhxrqJ2chBmYRF66s7Fu Go5ckkHFBK5g+Qowsy3+bgI/P4flp1x87OAfzZghh3u0u4hAJGQSV5yT+x7btQXuv5ET Eluc9VsHaWQl5zOZa5q4XuogBxsg2GTUNq3VKQrkoZc4+JDbKc5x5zcaDAtOQFQWPZQO VmEyY/bE3s10Z7syVN8zTDW97ozbrdZpcJhuGdvlNiwECLu1KdDHhJYYvSEdJBj8Vc4c TxZQ== Received: by 10.180.100.37 with SMTP id ev5mr19368028wib.5.1345365981392; Sun, 19 Aug 2012 01:46:21 -0700 (PDT) Received: from [10.0.0.31] (p4FEBDFAB.dip.t-dialin.net. [79.235.223.171]) by mx.google.com with ESMTPS id bc2sm30955526wib.0.2012.08.19.01.46.19 (version=TLSv1/SSLv3 cipher=OTHER); Sun, 19 Aug 2012 01:46:20 -0700 (PDT) Content-Type: text/plain; charset=windows-1252 Mime-Version: 1.0 (Mac OS X Mail 6.0 \(1485\)) Subject: Re: how client location a region/tablet? From: Lars George In-Reply-To: Date: Sun, 19 Aug 2012 10:46:17 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: <531086F4-4BF3-4BB2-9B28-CCAA5B5049D6@gmail.com> References: To: user@hbase.apache.org X-Mailer: Apple Mail (2.1485) That is spot on Stack, it is the worst case scenario as you describe, = i.e. all cached information is stale. Lars On Aug 19, 2012, at 6:40 AM, Stack wrote: > On Sat, Aug 18, 2012 at 2:13 AM, Lin Ma wrote: >> Hello guys, >>=20 >> I am referencing the Big Table paper about how a client locates a = tablet. >> In section 5.1 Tablet location, it is mentioned that client will = cache all >> tablet locations, I think it means client will cache root tablet in >> METADATA table, and all other tablets in METADATA table (which means = client >> cache the whole METADATA table?). My question is, whether HBase = implements >> in the same or similar way? My concern or confusion is, supposing = each >> tablet or region file is 128M bytes, it will be very huge space (i.e. >> memory footprint) for each client to cache all tablets or region = files of >> METADATA table. Is it doable or feasible in real HBase clusters? = Thanks. >>=20 >=20 > Yeah, we client cache's locations, not the data. >=20 >=20 >> BTW: another confusion from me is in the paper of Big Table section = 5.1 >> Tablet location, it is mentioned that "If the client=92s cache is = stale, the >> location algorithm could take up to six round-trips, because stale = cache >> entries are only discovered upon misses (assuming that METADATA = tablets do >> not move very frequently).", I do not know how the 6 times round trip = time >> is calculated, if anyone could answer this puzzle, it will be great. = :-) >>=20 >=20 > I'm not sure what the 6 is about either. Here is a guesstimate: >=20 > 1. Go to cached location for a server for a particular user region, > but server says that it does not have a region, the client location is > stale > 2. Go back to client cached meta region that holds user region w/ row > we want, but its location is stale. > 3. Go to root location, to find new location of meta, but the root > location has moved.... what the client has is stale > 4. Find new root location and do lookup of meta region location > 5. Go to meta region location to find new user region > 6. Go to server w/ user region >=20 > St.Ack