Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7C27618BAC for ; Thu, 21 Apr 2016 00:18:57 +0000 (UTC) Received: (qmail 77820 invoked by uid 500); 21 Apr 2016 00:18:55 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 77741 invoked by uid 500); 21 Apr 2016 00:18:55 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 77729 invoked by uid 99); 21 Apr 2016 00:18:55 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 21 Apr 2016 00:18:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 20BCFC0940 for ; Thu, 21 Apr 2016 00:18:54 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.179 X-Spam-Level: * X-Spam-Status: No, score=1.179 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id uegSMCM-1Qi9 for ; Thu, 21 Apr 2016 00:18:51 +0000 (UTC) Received: from mail-lb0-f179.google.com (mail-lb0-f179.google.com [209.85.217.179]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id ABB935F1F0 for ; Thu, 21 Apr 2016 00:18:51 +0000 (UTC) Received: by mail-lb0-f179.google.com with SMTP id b1so16092176lbi.1 for ; Wed, 20 Apr 2016 17:18:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to; bh=2g8VdyP96wElAtd+7nc5zvhqCJeZ0WNxVGrT6wNTD/0=; b=UUYb8aCCDVDBOkEI3IfFrRXGDJ9E43ycwF9WTlYh7PAlC05UTkPwNGGUDIA3VpFViw 8t5AHc0xuLn0QCzAx7PLw7+YSbhf0qtnFplvJrq0eUdyCsr2OL49FO7TBRHcVmA1APvH stH3LKuDUDHTpnW+Lb9sSOy6d9AZa9w+3h1TCj+swjcCump8+Ii7LRRHcoDrYaDkAuYP M+PeUg0YtIjT45BnVzz7pNJSrn1/JJ4Jo3IN3kTI2X6Belj9InWfCkAlFRoC8+NfeaEB vgaj2k8kFjtRTmNWM+a6ABxampPxlGOAfW3V7/ZeM0uIkLAjCHcosOrwbrIWvuNNT1DK lvCQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to; bh=2g8VdyP96wElAtd+7nc5zvhqCJeZ0WNxVGrT6wNTD/0=; b=mgHw3pWVSbKgYN2cJ0IEA+i5k1Aidskbzc2GpNgTft7pYXy1YEI2URyOUU2DRU7y9F 1UlXWI861eEti1HYr5y4NB81W5n4+TqqnN1an/hC2Kt24AaNHS39KXVvWnACAO8Z3auS odd7S+L7fS5xbg38pFRfF0+Qk8JS2Q1sSWAvuwqy4/JzbVSr5aOXg2jvmcxK+UJkBFIt UtBNxCmOBkbQ9LKbdZ23ejtv0IyAZ74eMvlvgDw0+G8N2WHJWju3UcT4YjLH1m53sEJL /IEFzC9ERrP2MwHPo+re8l7znUtgJAR2jYlVwji68eVgFS3G2AXU2y1kIeelfi2pZI7x swtg== X-Gm-Message-State: AOPr4FU8t6aVZLvbEf+zsundjYPh1OIyr9sEcZs1Vyx0/otLHpZjTY/t3r+F+62BDm4G7g+iIswCZPJgl96/Eg== MIME-Version: 1.0 X-Received: by 10.112.134.229 with SMTP id pn5mr4740488lbb.36.1461197925386; Wed, 20 Apr 2016 17:18:45 -0700 (PDT) Received: by 10.112.110.167 with HTTP; Wed, 20 Apr 2016 17:18:45 -0700 (PDT) In-Reply-To: References: Date: Wed, 20 Apr 2016 17:18:45 -0700 Message-ID: Subject: Re: Retiring empty regions From: Vladimir Rodionov To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=089e0115f8c64a12770530f3a7f8 --089e0115f8c64a12770530f3a7f8 Content-Type: text/plain; charset=UTF-8 >I'd love to hear your thoughts on this design, Vlad. Maybe you'd like to >write up a post for the blog? Meanwhile, I'm sure of a couple of us on here >on the list would appreciate your Cliff's Notes version. I can take this >into account for my v2 schema design. Nick, there will be a presentation on time-series HBase (hbasecon.com) Come join us :) On Mon, Apr 4, 2016 at 8:34 AM, Nick Dimiduk wrote: > > Crazy idea, but you might be able to take stripped down version of region > > normalizer code and make a Tool to run? Requesting split or merge is done > > through the client API, and the only weighing information you need is > > whether region empty or not, that you could find out too? > > Yeah, that's the direction I'm headed. > > > A bit off topic, but I think unfortunately region normalizer now ignores > > empty regions to avoid undoing pre-split on the table. > > Unfortunate indeed. Maybe we should be keeping around the initial splits > list as a metadata attribute on the table? > > > With a right row-key design you will never have empty regions due to TTL. > > I'd love to hear your thoughts on this design, Vlad. Maybe you'd like to > write up a post for the blog? Meanwhile, I'm sure of a couple of us on here > on the list would appreciate your Cliff's Notes version. I can take this > into account for my v2 schema design. > > > So Nick, merge on 1.1 is not recommended??? Was working very well on > > previous versions. Is ProcV2 really impact it that bad?? > > How to answer here carefully... I have no reason to believe merge is not > working on 1.1. I've been on the wrong end of enough "regions stuck in > transition" support tickets that I'm not keen to put undue stress on my > master. ProcV2 insures against many scenarios that cause master trauma, > hence my interest in the implementation details and my preference for > cluster administration tasks that use it as their source of authority. > > Thanks for the thoughts folks. > -n > > On Fri, Apr 1, 2016 at 10:52 AM, Jean-Marc Spaggiari < > jean-marc@spaggiari.org> wrote: > > > ;) That was not the question ;) > > > > So Nick, merge on 1.1 is not recommended??? Was working very well on > > previous versions. Is ProcV2 really impact it that bad?? > > > > JMS > > > > 2016-04-01 13:49 GMT-04:00 Vladimir Rodionov : > > > > > >> This is something > > > >> which makes it far less useful for time-series databases with short > > TTL > > > on > > > >> the tables. > > > > > > With a right row-key design you will never have empty regions due to > TTL. > > > > > > -Vlad > > > > > > On Thu, Mar 31, 2016 at 10:31 PM, Mikhail Antonov < > olorinbant@gmail.com> > > > wrote: > > > > > > > Crazy idea, but you might be able to take stripped down version of > > region > > > > normalizer code and make a Tool to run? Requesting split or merge is > > done > > > > through the client API, and the only weighing information you need is > > > > whether region empty or not, that you could find out too? > > > > > > > > > > > > "Short of upgrading to 1.2 for the region normalizer," > > > > > > > > A bit off topic, but I think unfortunately region normalizer now > > ignores > > > > empty regions to avoid undoing pre-split on the table. This is > > something > > > > which makes it far less useful for time-series databases with short > TTL > > > on > > > > the tables. We'll need to address that. > > > > > > > > -Mikhail > > > > > > > > On Thu, Mar 31, 2016 at 9:56 PM, Nick Dimiduk > > > wrote: > > > > > > > > > Hi folks, > > > > > > > > > > I have a table with TTL enabled. It's been receiving data for a > while > > > > > beyond the TTL and I now have a number of empty regions. I'd like > to > > > drop > > > > > those empty regions to free up heap space on the region servers and > > > > reduce > > > > > master load. I'm running a 1.1 derivative. > > > > > > > > > > The only threads I found on this topic are from circa 0.92 > timeframe. > > > > > > > > > > Short of upgrading to 1.2 for the region normalizer, what's the > > > > recommended > > > > > method of cleaning up this cruft? Should I be merging empty regions > > > into > > > > > their neighbor's? Looks like region merge hasn't been migrated to > > > ProcV2 > > > > > yet so would be wise to reduce online table activity, or at least > aim > > > > for a > > > > > "quiet period"? Is there a documented process for off-lining and > > > > deleting a > > > > > region by name? I don't see anything in the book about it. > > > > > > > > > > I experimented with online merge on pseudodist, looks like it's > > working > > > > > fine for the most basic case. I'll probably pursue this unless > > someone > > > > has > > > > > some other ideas. > > > > > > > > > > Thanks, > > > > > Nick > > > > > > > > > > > > > > > > > > > > > -- > > > > Thanks, > > > > Michael Antonov > > > > > > > > > > --089e0115f8c64a12770530f3a7f8--