Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6E6DE1056E for ; Mon, 16 Feb 2015 16:25:39 +0000 (UTC) Received: (qmail 75769 invoked by uid 500); 16 Feb 2015 16:25:37 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 75701 invoked by uid 500); 16 Feb 2015 16:25:37 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 75687 invoked by uid 99); 16 Feb 2015 16:25:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Feb 2015 16:25:37 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (athena.apache.org: local policy) Received: from [209.85.220.171] (HELO mail-vc0-f171.google.com) (209.85.220.171) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 Feb 2015 16:25:32 +0000 Received: by mail-vc0-f171.google.com with SMTP id kv19so10909160vcb.2 for ; Mon, 16 Feb 2015 08:24:51 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=CbLbXJx3bjhMSyLLZyFDX1niqYUAq1mMJquNegZ2s6A=; b=NFn0CDRvKmiQz1Wy7hUfzkTwbVkAJ7kKaRKGhITb8IxSouzJQMwcHNOeS0FYalL5Lo tg4w2RFdbiSASDthbUvMY6+x3c7r37L3bZI5kBFJES5ZxX1rgXySaOL1t96uFB+V2wcW cINVX6vjtAOECktZiNk+DW5d+kan633aClAdNBcgfIaR1AOmmzoSMmOAzCkzDBZkwZV1 +OltaBW+i+JGVR8mFMLZlq4u4/f0HZG9SJgwam9BscgP88QzDYgQi3w++IUM/OSPJRSE UPNB+1/jkaveaZ6zuqGEWMRBkHbELkzTcXvh3IgKhaEl0ym1EJs/satXVtyubAF3iL2I 9njA== X-Gm-Message-State: ALoCoQme4xNZGWwGFBclj8y4+1yUMnPFG3vXVSZ0oCOeuiIWAdtSStliskYdTB3uAEy/fe8AtyR1 X-Received: by 10.52.67.75 with SMTP id l11mr13687077vdt.36.1424103891358; Mon, 16 Feb 2015 08:24:51 -0800 (PST) MIME-Version: 1.0 Received: by 10.52.249.233 with HTTP; Mon, 16 Feb 2015 08:24:21 -0800 (PST) In-Reply-To: References: From: Abe Weinograd Date: Mon, 16 Feb 2015 11:24:21 -0500 Message-ID: Subject: Re: Region balancing query To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a1136860e9164ea050f37068f X-Virus-Checked: Checked by ClamAV on apache.org --001a1136860e9164ea050f37068f Content-Type: text/plain; charset=UTF-8 balancer said "true" and it is not disabled. Thanks again for your help. Abe On Mon, Feb 16, 2015 at 11:23 AM, Ted Yu wrote: > What was the output from 'balancer' command ? > > Was is possible that balancer was disabled ? > > Cheers > > On Mon, Feb 16, 2015 at 8:04 AM, Abe Weinograd wrote: > > > Ok. I forced the balancer run and am not seeing anything after a few > > minutes. Master logs isn't showing anything. Should I look at the RS > ones > > instead? > > > > On Mon, Feb 16, 2015 at 11:03 AM, Ted Yu wrote: > > > > > You should see effect in the next balancer run. > > > > > > Cheers > > > > > > On Mon, Feb 16, 2015 at 7:52 AM, Abe Weinograd wrote: > > > > > > > Excellent. If i trigger the balancer, should this start to help or > only > > > for > > > > future region creation? > > > > > > > > Thanks, > > > > Abe > > > > > > > > On Mon, Feb 16, 2015 at 9:35 AM, Ted Yu wrote: > > > > > > > > > Yes. This setting should be modified on Master. > > > > > > > > > > Cheers > > > > > > > > > > On Mon, Feb 16, 2015 at 6:27 AM, Abe Weinograd > > wrote: > > > > > > > > > > > Thanks Ted. We are putting this in the hbase-site.xml for the > > > Master? > > > > > > > > > > > > Abe > > > > > > > > > > > > On Fri, Feb 13, 2015 at 5:03 PM, Shahab Yunus < > > > shahab.yunus@gmail.com> > > > > > > wrote: > > > > > > > > > > > > > Thanks, we will try that and report back. > > > > > > > > > > > > > > Regards, > > > > > > > Shahab > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 4:56 PM, Ted Yu > > > wrote: > > > > > > > > > > > > > > > You can make TableSkewCostFunction more prominent by > increasing > > > the > > > > > > value > > > > > > > > for config parameter: > > > > > > > > > > > > > > > > hbase.master.balancer.stochastic.tableSkewCost > > > > > > > > > > > > > > > > Its default is 35. > > > > > > > > > > > > > > > > See if raising to 100 or 200 helps. > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 1:09 PM, Shahab Yunus < > > > > > shahab.yunus@gmail.com> > > > > > > > > wrote: > > > > > > > > > > > > > > > > > Yes, this sever hosts other regions from other tables as > > well. > > > > > > > > > > > > > > > > > > Regards > > > > > > > > > Shahab > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 1:45 PM, Ted Yu < > yuzhihong@gmail.com > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > Interesting, server7.ec3.internal,60020,1423845018628 was > > > > > > > consistently > > > > > > > > > > chosen as destination for the table. > > > > > > > > > > Did server7.ec3.internal,60020,1423845018628 host regions > > > from > > > > > > other > > > > > > > > > table > > > > > > > > > > ? > > > > > > > > > > > > > > > > > > > > Cheers > > > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 10:27 AM, Shahab Yunus < > > > > > > > shahab.yunus@gmail.com > > > > > > > > > > > > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > Table name is: > > > > > > > > > > > MYTABLE_RECENT_4W_V2 > > > > > > > > > > > > > > > > > > > > > > Pastebin snippet 1: http://pastebin.com/dQzMhGyP > > > > > > > > > > > Pastebin snippet 2: http://pastebin.com/Y7ZsNAgF > > > > > > > > > > > > > > > > > > > > > > This is the master log after invoking balancer command > > from > > > > > hbase > > > > > > > > > shell. > > > > > > > > > > > > > > > > > > > > > > Regards, > > > > > > > > > > > Shahab > > > > > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 12:00 PM, Ted Yu < > > > > yuzhihong@gmail.com> > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > > > bq. all the regions of this table were back on this > > same > > > > RS! > > > > > > > > > > > > > > > > > > > > > > > > Interesting. Please check master log around the time > > this > > > > RS > > > > > > was > > > > > > > > > > brought > > > > > > > > > > > > online. You can pastebin the relevant snippet. > > > > > > > > > > > > > > > > > > > > > > > > Thanks > > > > > > > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 8:55 AM, Shahab Yunus < > > > > > > > > > shahab.yunus@gmail.com> > > > > > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > > > > > Hi Ted. > > > > > > > > > > > > > > > > > > > > > > > > > > Yes, the cluster itself is balanced. On average 300 > > > > regions > > > > > > per > > > > > > > > > node > > > > > > > > > > on > > > > > > > > > > > > 10 > > > > > > > > > > > > > nodes. > > > > > > > > > > > > > > > > > > > > > > > > > > # of tables is 53 of varying sizes. > > > > > > > > > > > > > > > > > > > > > > > > > > Balancer was invoked and it didn't do anything > (i.e. > > no > > > > > > > movement > > > > > > > > of > > > > > > > > > > > > > regions) but we didn't check the master's logs. We > > can > > > do > > > > > > that. > > > > > > > > > > > > > > > > > > > > > > > > > > Interestingly, we restarted the RS which was > holding > > > all > > > > > the > > > > > > > > > regions > > > > > > > > > > of > > > > > > > > > > > > > this one table. The regions were nicely spread out > to > > > the > > > > > > > > remaining > > > > > > > > > > RS. > > > > > > > > > > > > But > > > > > > > > > > > > > when we brought back this RS, all the regions of > this > > > > table > > > > > > > were > > > > > > > > > back > > > > > > > > > > > on > > > > > > > > > > > > > this same RS! > > > > > > > > > > > > > > > > > > > > > > > > > > Thanks. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Regards, > > > > > > > > > > > > > Shahab > > > > > > > > > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 11:46 AM, Ted Yu < > > > > > > yuzhihong@gmail.com> > > > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > > > > > > > How many tables are there in your cluster ? > > > > > > > > > > > > > > > > > > > > > > > > > > > > Is the cluster balanced overall (in terms of > number > > > of > > > > > > > regions > > > > > > > > > per > > > > > > > > > > > > > server) > > > > > > > > > > > > > > but this table is not ? > > > > > > > > > > > > > > > > > > > > > > > > > > > > What happens (check master log) when you issue > > > > 'balancer' > > > > > > > > command > > > > > > > > > > > > through > > > > > > > > > > > > > > shell ? > > > > > > > > > > > > > > > > > > > > > > > > > > > > Cheers > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 8:19 AM, Shahab Yunus < > > > > > > > > > > > shahab.yunus@gmail.com> > > > > > > > > > > > > > > wrote: > > > > > > > > > > > > > > > > > > > > > > > > > > > > > CDH 5.3 > > > > > > > > > > > > > > > HBase 98.6 > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > We are writing data to an HBase table through a > > M/R > > > > > job. > > > > > > We > > > > > > > > pre > > > > > > > > > > > split > > > > > > > > > > > > > the > > > > > > > > > > > > > > > table before each job run. The problem is that > > most > > > > of > > > > > > the > > > > > > > > > > regions > > > > > > > > > > > > end > > > > > > > > > > > > > up > > > > > > > > > > > > > > > on the same RS. This results in that one RS > being > > > > > > severely > > > > > > > > > > > overloaded > > > > > > > > > > > > > and > > > > > > > > > > > > > > > subsequent M/R jobs failing trying to write to > > the > > > > > > regions > > > > > > > on > > > > > > > > > > that > > > > > > > > > > > > RS. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > The balancer is on and the split policy is > > default. > > > > No > > > > > > > > changes > > > > > > > > > > > there. > > > > > > > > > > > > > It > > > > > > > > > > > > > > is > > > > > > > > > > > > > > > a 10 node cluster. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > All other related properties are defaults too. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Any idea, how can we force balancing of the new > > > > > regions? > > > > > > Do > > > > > > > > we > > > > > > > > > > have > > > > > > > > > > > > to > > > > > > > > > > > > > > > consider compaction into the equation as well? > > > > Thanks. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Regards, > > > > > > > > > > > > > > > Shahab > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > --001a1136860e9164ea050f37068f--