Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7F88618668 for ; Fri, 19 Jun 2015 16:19:59 +0000 (UTC) Received: (qmail 19263 invoked by uid 500); 19 Jun 2015 16:19:57 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 19185 invoked by uid 500); 19 Jun 2015 16:19:57 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 19172 invoked by uid 99); 19 Jun 2015 16:19:57 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 19 Jun 2015 16:19:57 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 9373218019D for ; Fri, 19 Jun 2015 16:19:56 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.88 X-Spam-Level: ** X-Spam-Status: No, score=2.88 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id KmYch6ut3Zjf for ; Fri, 19 Jun 2015 16:19:49 +0000 (UTC) Received: from mail-wi0-f169.google.com (mail-wi0-f169.google.com [209.85.212.169]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 594FF249EA for ; Fri, 19 Jun 2015 16:19:48 +0000 (UTC) Received: by wiga1 with SMTP id a1so23753351wig.0 for ; Fri, 19 Jun 2015 09:19:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=AWwftcm2LF0i5ANelOzhGHjRsvFt2hY8W28S+a0yFYI=; b=IxYmPDTmBZ8+KpuYfR9Cm0CQrMMqgIgDfsDnQTzyc0vO/vp2PsKXw2WdHetpMctI6f HycS9mKWRnKK7UytwUSMBfMYq/QNZfk5jTqP8ERyhYJ013Qay+uroz0JWwHlwJBoDC14 7d0M+h5e2p+m/ZgZNDxo2nVJCjTVsiUmEA5eMTg9ba8Hy1tupSnz18TsiSXOfElb1wsC 6oWx0OVQ+dkRizXLpJUuyMttFjt7Ro6/G0ZDtndIaVGSfDL7Ypk6GdI+3cFpPXdkq6IV RRSLX7oQsRlutxHCMc4uqp7xCKCQH/jLgNg8q+kB1lgw8RfjaUG+/be+oo47hlOxVzpx LQWA== X-Received: by 10.180.99.2 with SMTP id em2mr8129441wib.59.1434730781092; Fri, 19 Jun 2015 09:19:41 -0700 (PDT) MIME-Version: 1.0 Received: by 10.28.146.193 with HTTP; Fri, 19 Jun 2015 09:19:20 -0700 (PDT) In-Reply-To: References: From: Nick Dimiduk Date: Fri, 19 Jun 2015 09:19:20 -0700 Message-ID: Subject: Re: Stochastic Balancer by tables To: hbase-user Content-Type: multipart/alternative; boundary=f46d04426c688e30ff0518e14a24 --f46d04426c688e30ff0518e14a24 Content-Type: text/plain; charset=UTF-8 On Fri, Jun 19, 2015 at 7:45 AM, Nasron Cheong wrote: > I couldn't find a tool to show regions and their sizes, for a specific > table, so ended up writing one. > Nasron, Would you mind having a look at the patch/RB on HBASE-13103? Does the API pair RegionNormalizer/Normalization plan look like a reasonable harness for you to hang your custom tool onto? Just like the balancer, it's designed to be extensible with different normalization strategies. On Fri, Jun 19, 2015 at 3:47 AM, Dejan Menges > wrote: > > > Just have to say that hbase.master.loadbalance.bytable saved us after we > > discovered it. In our case we had to set it manually to true, and then it > > was easy to catch hot spotting on unusually large regions and handle it. > > > > Btw +1 for HBASE-13013, had to say it, something that makes me starting > > upgrading our HDP stack on Monday morning. > > > > On Thu, Jun 18, 2015 at 11:04 PM, Bryan Beaudreault < > > bbeaudreault@hubspot.com> wrote: > > > > > Just had to say, https://issues.apache.org/jira/browse/HBASE-13103 > looks > > > *AWESOME* > > > > > > On Thu, Jun 18, 2015 at 5:00 PM Mikhail Antonov > > > wrote: > > > > > > > Yeah, I could see 2 reasons for remaining few regions to take > > > > unproportionally long time - 1) those regions are unproportionally > > > > large (you should be able to quickly confirm it) and 2) they happened > > > > to be hosted on really slow/overloaded machine(s). #1 seems far more > > > > likely to me. > > > > > > > > And as Nick said, there's ongoing effort to provide exactly what > > > > you've described - centralized periodic analysis of region sizes and > > > > equalization as needed (somewhat complementary to balancing), and any > > > > feedback (especially from folks experiencing real issues with unequal > > > > region sizes) is much appreciated. > > > > > > > > -Mikhail > > > > > > > > On Thu, Jun 18, 2015 at 10:07 AM, Nick Dimiduk > > > wrote: > > > > > If you're interested in region size balancing, please have a look > at > > > > > https://issues.apache.org/jira/browse/HBASE-13103 . Please provide > > > > feedback > > > > > as we're hoping to have an early version available in 1.2. > > > > > > > > > > Which reminds me, I owe Mikhail another review... > > > > > > > > > > On Thu, Jun 18, 2015 at 9:39 AM, Elliott Clark > > > > wrote: > > > > > > > > > >> The balancer is not responsible fore region size decisions. The > > > > balancer is > > > > >> only responsible for deciding which regionservers should host > which > > > > >> regions. > > > > >> Splits are determined by data size of a region. See max store file > > > size. > > > > >> > > > > >> On Thu, Jun 18, 2015 at 7:50 AM, Nasron Cheong > > > > wrote: > > > > >> > > > > >> > Hi, > > > > >> > > > > > >> > I've noticed there are two settings available when using the > HBase > > > > >> balancer > > > > >> > (specifically the default stochastic balancer) > > > > >> > > > > > >> > hbase.master.balancer.stochastic.tableSkewCost > > > > >> > > > > > >> > hbase.master.loadbalance.bytable > > > > >> > > > > > >> > How do these two settings relate? The documentation indicates > when > > > > using > > > > >> > the stochastic balancer that 'bytable' should be set to false? > > > > >> > > > > > >> > Our deployment relies on very few, very large tables, and I've > > > noticed > > > > >> bad > > > > >> > distribution when accessing some of the tables. E.g. there are > 443 > > > > >> regions > > > > >> > for a single table, but when doing a MR job over a full scan of > > the > > > > >> table, > > > > >> > the first 426 regions scan quickly (minutes), but the remaining > 17 > > > > >> regions > > > > >> > take significantly longer (hours) > > > > >> > > > > > >> > My expectation is to have the balancer equalize the size of the > > > > regions > > > > >> for > > > > >> > each table. > > > > >> > > > > > >> > Thanks! > > > > >> > > > > > >> > - Nasron > > > > >> > > > > > >> > > > > > > > > > > > > > > > > -- > > > > Thanks, > > > > Michael Antonov > > > > > > > > > > --f46d04426c688e30ff0518e14a24--