Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BCC5A17DE0 for ; Wed, 18 Feb 2015 17:02:29 +0000 (UTC) Received: (qmail 68837 invoked by uid 500); 18 Feb 2015 17:02:27 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 68761 invoked by uid 500); 18 Feb 2015 17:02:27 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 68746 invoked by uid 99); 18 Feb 2015 17:02:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Feb 2015 17:02:26 +0000 X-ASF-Spam-Status: No, hits=2.5 required=5.0 tests=FREEMAIL_REPLY,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yuzhihong@gmail.com designates 209.85.213.50 as permitted sender) Received: from [209.85.213.50] (HELO mail-yh0-f50.google.com) (209.85.213.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Feb 2015 17:02:01 +0000 Received: by yhaf73 with SMTP id f73so1582218yha.11 for ; Wed, 18 Feb 2015 09:01:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=pzPRS9uM9BZgPgZfUkOzRjCA7DZ2rScD2WFU9C5BM7M=; b=XmTivle34kCX2KwGjpAy4u/obrz3EdGcnN0N91ioJjrGja+oollsYwrir1F456vvmx CxgUeandIEuhEmJisKHs8BmtYC1zI9A7XuHjETuk5SsfMzsHKyUSwdoNtV8lbQDIeZPz kIOgPy9XZy+etbHt3AvFHDtHdZfC5DqAVtxYfmZ7BXjnra+bq9JeE+E23h68CWP3vyY8 vLAu1Jkqx2edHCo+P+u0ttDsN1njTJcvxddOb9WVOFbGeF5VhUMsxvlGgcgX82P8ws/H IkV8DXz5d6a6xo550uDIIJ8qSOPVq8a8Fc7eIuNjZBplS2i9uyv6JX5aMXEzlBRz2nNL UnYw== MIME-Version: 1.0 X-Received: by 10.236.2.200 with SMTP id 48mr98112yhf.23.1424278874105; Wed, 18 Feb 2015 09:01:14 -0800 (PST) Received: by 10.170.111.72 with HTTP; Wed, 18 Feb 2015 09:01:14 -0800 (PST) In-Reply-To: References: Date: Wed, 18 Feb 2015 09:01:14 -0800 Message-ID: Subject: Re: Region balancing query From: Ted Yu To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=089e011845aa5a2e22050f5fc4f7 X-Virus-Checked: Checked by ClamAV on apache.org --089e011845aa5a2e22050f5fc4f7 Content-Type: text/plain; charset=UTF-8 When a region splits, the two daughter regions stay on the same host as the parent region did. It is through balancing that the daughter regions (may) be assigned to other server(s). Cheers On Wed, Feb 18, 2015 at 7:35 AM, Abe Weinograd wrote: > Another question, would this help when we split or only when we balance? > > Thanks, > Abe > > On Mon, Feb 16, 2015 at 11:24 AM, Abe Weinograd wrote: > > > balancer said "true" and it is not disabled. Thanks again for your help. > > > > Abe > > > > On Mon, Feb 16, 2015 at 11:23 AM, Ted Yu wrote: > > > >> What was the output from 'balancer' command ? > >> > >> Was is possible that balancer was disabled ? > >> > >> Cheers > >> > >> On Mon, Feb 16, 2015 at 8:04 AM, Abe Weinograd wrote: > >> > >> > Ok. I forced the balancer run and am not seeing anything after a few > >> > minutes. Master logs isn't showing anything. Should I look at the RS > >> ones > >> > instead? > >> > > >> > On Mon, Feb 16, 2015 at 11:03 AM, Ted Yu wrote: > >> > > >> > > You should see effect in the next balancer run. > >> > > > >> > > Cheers > >> > > > >> > > On Mon, Feb 16, 2015 at 7:52 AM, Abe Weinograd > >> wrote: > >> > > > >> > > > Excellent. If i trigger the balancer, should this start to help or > >> only > >> > > for > >> > > > future region creation? > >> > > > > >> > > > Thanks, > >> > > > Abe > >> > > > > >> > > > On Mon, Feb 16, 2015 at 9:35 AM, Ted Yu > >> wrote: > >> > > > > >> > > > > Yes. This setting should be modified on Master. > >> > > > > > >> > > > > Cheers > >> > > > > > >> > > > > On Mon, Feb 16, 2015 at 6:27 AM, Abe Weinograd > >> > wrote: > >> > > > > > >> > > > > > Thanks Ted. We are putting this in the hbase-site.xml for the > >> > > Master? > >> > > > > > > >> > > > > > Abe > >> > > > > > > >> > > > > > On Fri, Feb 13, 2015 at 5:03 PM, Shahab Yunus < > >> > > shahab.yunus@gmail.com> > >> > > > > > wrote: > >> > > > > > > >> > > > > > > Thanks, we will try that and report back. > >> > > > > > > > >> > > > > > > Regards, > >> > > > > > > Shahab > >> > > > > > > > >> > > > > > > On Fri, Feb 13, 2015 at 4:56 PM, Ted Yu < > yuzhihong@gmail.com> > >> > > wrote: > >> > > > > > > > >> > > > > > > > You can make TableSkewCostFunction more prominent by > >> increasing > >> > > the > >> > > > > > value > >> > > > > > > > for config parameter: > >> > > > > > > > > >> > > > > > > > hbase.master.balancer.stochastic.tableSkewCost > >> > > > > > > > > >> > > > > > > > Its default is 35. > >> > > > > > > > > >> > > > > > > > See if raising to 100 or 200 helps. > >> > > > > > > > > >> > > > > > > > On Fri, Feb 13, 2015 at 1:09 PM, Shahab Yunus < > >> > > > > shahab.yunus@gmail.com> > >> > > > > > > > wrote: > >> > > > > > > > > >> > > > > > > > > Yes, this sever hosts other regions from other tables as > >> > well. > >> > > > > > > > > > >> > > > > > > > > Regards > >> > > > > > > > > Shahab > >> > > > > > > > > > >> > > > > > > > > On Fri, Feb 13, 2015 at 1:45 PM, Ted Yu < > >> yuzhihong@gmail.com > >> > > > >> > > > > wrote: > >> > > > > > > > > > >> > > > > > > > > > Interesting, server7.ec3.internal,60020,1423845018628 > >> was > >> > > > > > > consistently > >> > > > > > > > > > chosen as destination for the table. > >> > > > > > > > > > Did server7.ec3.internal,60020,1423845018628 host > >> regions > >> > > from > >> > > > > > other > >> > > > > > > > > table > >> > > > > > > > > > ? > >> > > > > > > > > > > >> > > > > > > > > > Cheers > >> > > > > > > > > > > >> > > > > > > > > > On Fri, Feb 13, 2015 at 10:27 AM, Shahab Yunus < > >> > > > > > > shahab.yunus@gmail.com > >> > > > > > > > > > >> > > > > > > > > > wrote: > >> > > > > > > > > > > >> > > > > > > > > > > Table name is: > >> > > > > > > > > > > MYTABLE_RECENT_4W_V2 > >> > > > > > > > > > > > >> > > > > > > > > > > Pastebin snippet 1: http://pastebin.com/dQzMhGyP > >> > > > > > > > > > > Pastebin snippet 2: http://pastebin.com/Y7ZsNAgF > >> > > > > > > > > > > > >> > > > > > > > > > > This is the master log after invoking balancer > command > >> > from > >> > > > > hbase > >> > > > > > > > > shell. > >> > > > > > > > > > > > >> > > > > > > > > > > Regards, > >> > > > > > > > > > > Shahab > >> > > > > > > > > > > > >> > > > > > > > > > > On Fri, Feb 13, 2015 at 12:00 PM, Ted Yu < > >> > > > yuzhihong@gmail.com> > >> > > > > > > > wrote: > >> > > > > > > > > > > > >> > > > > > > > > > > > bq. all the regions of this table were back on > this > >> > same > >> > > > RS! > >> > > > > > > > > > > > > >> > > > > > > > > > > > Interesting. Please check master log around the > time > >> > this > >> > > > RS > >> > > > > > was > >> > > > > > > > > > brought > >> > > > > > > > > > > > online. You can pastebin the relevant snippet. > >> > > > > > > > > > > > > >> > > > > > > > > > > > Thanks > >> > > > > > > > > > > > > >> > > > > > > > > > > > On Fri, Feb 13, 2015 at 8:55 AM, Shahab Yunus < > >> > > > > > > > > shahab.yunus@gmail.com> > >> > > > > > > > > > > > wrote: > >> > > > > > > > > > > > > >> > > > > > > > > > > > > Hi Ted. > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > Yes, the cluster itself is balanced. On average > >> 300 > >> > > > regions > >> > > > > > per > >> > > > > > > > > node > >> > > > > > > > > > on > >> > > > > > > > > > > > 10 > >> > > > > > > > > > > > > nodes. > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > # of tables is 53 of varying sizes. > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > Balancer was invoked and it didn't do anything > >> (i.e. > >> > no > >> > > > > > > movement > >> > > > > > > > of > >> > > > > > > > > > > > > regions) but we didn't check the master's logs. > We > >> > can > >> > > do > >> > > > > > that. > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > Interestingly, we restarted the RS which was > >> holding > >> > > all > >> > > > > the > >> > > > > > > > > regions > >> > > > > > > > > > of > >> > > > > > > > > > > > > this one table. The regions were nicely spread > >> out to > >> > > the > >> > > > > > > > remaining > >> > > > > > > > > > RS. > >> > > > > > > > > > > > But > >> > > > > > > > > > > > > when we brought back this RS, all the regions of > >> this > >> > > > table > >> > > > > > > were > >> > > > > > > > > back > >> > > > > > > > > > > on > >> > > > > > > > > > > > > this same RS! > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > Thanks. > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > Regards, > >> > > > > > > > > > > > > Shahab > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > On Fri, Feb 13, 2015 at 11:46 AM, Ted Yu < > >> > > > > > yuzhihong@gmail.com> > >> > > > > > > > > > wrote: > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > > How many tables are there in your cluster ? > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > Is the cluster balanced overall (in terms of > >> number > >> > > of > >> > > > > > > regions > >> > > > > > > > > per > >> > > > > > > > > > > > > server) > >> > > > > > > > > > > > > > but this table is not ? > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > What happens (check master log) when you issue > >> > > > 'balancer' > >> > > > > > > > command > >> > > > > > > > > > > > through > >> > > > > > > > > > > > > > shell ? > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > Cheers > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > On Fri, Feb 13, 2015 at 8:19 AM, Shahab Yunus > < > >> > > > > > > > > > > shahab.yunus@gmail.com> > >> > > > > > > > > > > > > > wrote: > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > CDH 5.3 > >> > > > > > > > > > > > > > > HBase 98.6 > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > We are writing data to an HBase table > through > >> a > >> > M/R > >> > > > > job. > >> > > > > > We > >> > > > > > > > pre > >> > > > > > > > > > > split > >> > > > > > > > > > > > > the > >> > > > > > > > > > > > > > > table before each job run. The problem is > that > >> > most > >> > > > of > >> > > > > > the > >> > > > > > > > > > regions > >> > > > > > > > > > > > end > >> > > > > > > > > > > > > up > >> > > > > > > > > > > > > > > on the same RS. This results in that one RS > >> being > >> > > > > > severely > >> > > > > > > > > > > overloaded > >> > > > > > > > > > > > > and > >> > > > > > > > > > > > > > > subsequent M/R jobs failing trying to write > to > >> > the > >> > > > > > regions > >> > > > > > > on > >> > > > > > > > > > that > >> > > > > > > > > > > > RS. > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > The balancer is on and the split policy is > >> > default. > >> > > > No > >> > > > > > > > changes > >> > > > > > > > > > > there. > >> > > > > > > > > > > > > It > >> > > > > > > > > > > > > > is > >> > > > > > > > > > > > > > > a 10 node cluster. > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > All other related properties are defaults > too. > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > Any idea, how can we force balancing of the > >> new > >> > > > > regions? > >> > > > > > Do > >> > > > > > > > we > >> > > > > > > > > > have > >> > > > > > > > > > > > to > >> > > > > > > > > > > > > > > consider compaction into the equation as > well? > >> > > > Thanks. > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > Regards, > >> > > > > > > > > > > > > > > Shahab > >> > > > > > > > > > > > > > > > >> > > > > > > > > > > > > > > >> > > > > > > > > > > > > > >> > > > > > > > > > > > > >> > > > > > > > > > > > >> > > > > > > > > > > >> > > > > > > > > > >> > > > > > > > > >> > > > > > > > >> > > > > > > >> > > > > > >> > > > > >> > > > >> > > >> > > > > > --089e011845aa5a2e22050f5fc4f7--