Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 08A727908 for ; Fri, 2 Sep 2011 22:48:38 +0000 (UTC) Received: (qmail 94983 invoked by uid 500); 2 Sep 2011 22:48:37 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 94946 invoked by uid 500); 2 Sep 2011 22:48:36 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 94938 invoked by uid 99); 2 Sep 2011 22:48:36 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Sep 2011 22:48:36 +0000 X-ASF-Spam-Status: No, hits=-2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received-SPF: unknown (athena.apache.org: error in processing during lookup of mingma@ebay.com) Received: from [216.113.175.153] (HELO den-mipot-002.corp.ebay.com) (216.113.175.153) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Sep 2011 22:48:30 +0000 DomainKey-Signature: s=corp; d=ebay.com; c=nofws; q=dns; h=X-EBay-Corp:X-IronPort-AV:Received:Received:From:To:Date: Subject:Thread-Topic:Thread-Index:Message-ID: Accept-Language:Content-Language:X-MS-Has-Attach: X-MS-TNEF-Correlator:acceptlanguage:x-ems-proccessed: x-ems-stamp:Content-Type:MIME-Version:X-CFilter; b=v+h7qvMHqRmWWonIJ3CEF+hlCLmOzdWoRB544n0bAjiOt6ftzvwthia3 O/fz8lCR2l8Fih0tyiv/yxl6MoeI5pIdaGQwuZ5qdiEg7xMytoQabyeZa bQOYRi7Q1cZ1aoW; DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=ebay.com; i=mingma@ebay.com; q=dns/txt; s=corp; t=1315003710; x=1346539710; h=from:to:date:subject:message-id:mime-version; bh=griWhVjsdX+3/BDYVyJZOes9rYtpv6ZrvrYNKaiEOXY=; b=V5YR3m3BA57Gqf+Z3YxxD44eIhn9eOmHiO2+I0BxQ0laKgu1KCcgah8+ j3vVjlXocWtSf5RvxYTEbBBYOQn1uFBqOrOLqfHGg8sPxV/LB5+ha1GFA 17LPqA3HIrwDw83; X-EBay-Corp: Yes X-IronPort-AV: E=Sophos;i="4.68,321,1312182000"; d="scan'208,217";a="3771779" Received: from den-vtenf-001.corp.ebay.com (HELO DEN-MEXHT-001.corp.ebay.com) ([10.101.112.212]) by den-mipot-002.corp.ebay.com with ESMTP; 02 Sep 2011 15:48:08 -0700 Received: from DEN-MEXMS-001.corp.ebay.com ([10.241.16.225]) by DEN-MEXHT-001.corp.ebay.com ([10.241.17.52]) with mapi; Fri, 2 Sep 2011 16:48:07 -0600 From: "Ma, Ming" To: "dev@hbase.apache.org" Date: Fri, 2 Sep 2011 16:48:03 -0600 Subject: region transition trace Thread-Topic: region transition trace Thread-Index: Acxpwl+0zJbHE6BqTROR9UZ0JPcgSQ== Message-ID: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US x-ems-proccessed: 10SqDH0iR7ekR7SRpKqm5A== x-ems-stamp: KG271kj5/Pn/hTIZNDPuFQ== Content-Type: multipart/alternative; boundary="_000_D10C58C9BA49DC40AA68056049009BF36464A0D57BDENMEXMS001co_" MIME-Version: 1.0 X-CFilter: Scanned --_000_D10C58C9BA49DC40AA68056049009BF36464A0D57BDENMEXMS001co_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Perhaps this feature was discussed before. we currently don't have the full= transition history of all regions { fromState, ToState, the machine that i= nitiates it }; the states will be removed from zookeeper /hbase/unassigned = once the region is assigned; also ".META." table has max version of 10 thus= only tracks the last 10 RS assignments of a given region. One idea is to put such transition history data to zookeeper. One issue is = it could blow up zookeeper memory if we have large number of regions and th= e cluster runs for a long time. I would like to get your feedback on differ= ent approaches to address the issue. One assumption is region assignment do= esn't happen with high frequency and thus the overhead introduced won't hav= e much impact on the system performance. Approach 1: Zookeeper knows the history of how /hbase/unassigned is modified, if we can= get zookeeper's logs (Bookkeeper ? ) somehow, we know the history of regio= n transition. Approach 2: 1. HBase logs extra region transition data to zookeeper. It could be o= ne zookeeper node per transaction. 2. Have a separate thread on the Master to move data from zookeeper an= d append to HDFS. That will keep the zookeeper size in check. 3. Have some tool or web UI to show the history of a given region by l= ooking at zookeeper and HDFS. Ming --_000_D10C58C9BA49DC40AA68056049009BF36464A0D57BDENMEXMS001co_--