Mailing-List: contact user-help@hbase.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hbase.apache.org
Received-SPF: pass (nike.apache.org: domain of rohitkelkar@gmail.com
 designates 209.85.223.177 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CALte62xgt99VDiYy5j3YZVBZ4wDj0pdE8s0VBNYmdTwhZV2-xA@mail.gmail.com>
References: 
 <CALCVZZuMUfLDxgz87F3jjBF90Z8+vH-4xdXZnv7NsAjEDhOh7g@mail.gmail.com>
	<CALte62z5uAWF4DbO6Sa32WuWh4iF1AdJ1m85GP-nPtvstDmo6A@mail.gmail.com>
	<CALCVZZsGgEy1BR2Bih_d5znAWL=yKq0pq8iN3SynoVUxneTVAA@mail.gmail.com>
	<CALte62xgt99VDiYy5j3YZVBZ4wDj0pdE8s0VBNYmdTwhZV2-xA@mail.gmail.com>
Date: Fri, 14 Feb 2014 19:00:13 -0600
Message-ID: 
 <CALCVZZu7aO30J3c-L-KAD51Cu4z5dMzrUWpmPgkVDcPTe6afFQ@mail.gmail.com>
Subject: Re: uneven region distribution
From: Rohit Kelkar <rohitkelkar@gmail.com>
To: "user@hbase.apache.org" <user@hbase.apache.org>
Content-Type: multipart/alternative; boundary=047d7b2e3f3ce6878c04f2677199

--047d7b2e3f3ce6878c04f2677199
Content-Type: text/plain; charset=ISO-8859-1

Thanks for your inputs,
I am sharing the master log - http://pastebin.com/Xi9P6Ykr
and the region server log of the failed region server -
http://pastebin.com/1munghDv

- R


On Fri, Feb 14, 2014 at 6:24 PM, Ted Yu <yuzhihong@gmail.com> wrote:

> Looking at bug fix since 0.94.2, I wonder if you are experiencing the
> following which went into 0.94.10 :
> HBASE-8432 a table with unbalanced regions will balance indefinitely
>
> Master log would tell us more.
>
>
> On Fri, Feb 14, 2014 at 4:18 PM, Rohit Kelkar <rohitkelkar@gmail.com>
> wrote:
>
> > Sorry mis-stated the version, its 0.94.2
> >
> > - R
> >
> >
> > On Fri, Feb 14, 2014 at 5:59 PM, Ted Yu <yuzhihong@gmail.com> wrote:
> >
> > > bq.  it does not change the status of the assignments.
> > >
> > > Can you check / pastebin master log to see what caused the balancing to
> > > stop ?
> > >
> > > bq. attributing the region server crash to the disproportionately high
> > > number of regions on that server?
> > >
> > > Checking region server log on server5 should give us more clue.
> > >
> > > bq. 0.92.4
> > >
> > > please consider upgrading :-)
> > >
> > >
> > > On Fri, Feb 14, 2014 at 3:52 PM, Rohit Kelkar <rohitkelkar@gmail.com>
> > > wrote:
> > >
> > > > I am using hbase version 0.92.4 on a 5 node cluster. I am seeing
> that a
> > > > particular region server often crashes. A status 'simple' on hbase
> > shell
> > > > gives the following stats
> > > >
> > > >
> > > > HBase Shell; enter 'help<RETURN>' for list of supported commands.
> Type
> > > > "exit<RETURN>" to leave the HBase Shell Version 0.94.2, r1395367, Sun
> > > Oct 7
> > > > 19:11:01 UTC 2012
> > > > status 'simple' 4 live servers
> > > > server7:60020 1392017875910 requestsPerSecond=0,
> > > numberOfOnlineRegions=419,
> > > > usedHeapMB=3315, maxHeapMB=6127
> > > > server4:60020 1392300859332 requestsPerSecond=843,
> > > > numberOfOnlineRegions=379, usedHeapMB=2070, maxHeapMB=6127
> > > > server3:60020 1391583646998 requestsPerSecond=429,
> > > > numberOfOnlineRegions=653, usedHeapMB=3198, maxHeapMB=6127
> > > > server6:60020 1391583647588 requestsPerSecond=0,
> > > numberOfOnlineRegions=966,
> > > > usedHeapMB=2975, maxHeapMB=6127 1 dead servers
> > > > server5,60020,1392108515637 Aggregate load: 1272, regions: 2417
> > > >
> > > > The dead region server has 2417 regions as opposed to 419, 379, 653,
> > 966
> > > > regions on other servers. Am I right in attributing the region server
> > > crash
> > > > to the disproportionately high number of regions on that server?
> > > >
> > > > If I invoke the balancer on hbase shell using the "balancer" command
> it
> > > > returns true. But it does not change the status of the assignments.
> > > >
> > > > - R
> > > >
> > >
> >
>

--047d7b2e3f3ce6878c04f2677199--