Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6978417E75 for ; Thu, 6 Nov 2014 06:16:23 +0000 (UTC) Received: (qmail 11588 invoked by uid 500); 6 Nov 2014 06:16:21 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 11516 invoked by uid 500); 6 Nov 2014 06:16:21 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 11500 invoked by uid 99); 6 Nov 2014 06:16:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Nov 2014 06:16:21 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.220.43] (HELO mail-pa0-f43.google.com) (209.85.220.43) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Nov 2014 06:16:17 +0000 Received: by mail-pa0-f43.google.com with SMTP id eu11so596571pac.16 for ; Wed, 05 Nov 2014 22:14:25 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:content-type:mime-version:subject:from :in-reply-to:date:content-transfer-encoding:message-id:references:to; bh=tL2fE72djN2gcBtQWoWYZpSEVDPGDlaNoc6dmRqyEjs=; b=NoJaWQ1S5sj0m0y9k9s2gbT22g/DaLx4gnjBmLOwjeA419iVHGR+MU1JVech5ecPsA euct3iK/eXAoMfxdVoQvsQqImMZppHBkkNrhAQ5wUL9+tqTIepot0z6HDXrycF18cnlJ lAuE+prGO0R1NKgnWPqq37NrxGV/QvMo+OkTx3r3V77TgvtQw73ePZd9IXNEaGJrcwPY Fp83lzHh3jTev1ViWzrHnuzG1IA6kZ7WDsHwhdYPfmgJuNvzKJIyDBQqOf+BmHnNpJAW 7y9g7FWhJrfQNiPVEeO/sFikTKMS+huTNvLmXY/75Jx+/naYC/d1BWZ2Vw4W4gOeMC4R viJw== X-Gm-Message-State: ALoCoQlSelji3EaWFhNRamz8ok2QCg+In3+uY5PyI/NxMrcUKRKmHAe2jodDWPumq8T4epyThZCB X-Received: by 10.68.93.132 with SMTP id cu4mr2386652pbb.36.1415254465811; Wed, 05 Nov 2014 22:14:25 -0800 (PST) Received: from peres-air-2.attlocal.net ([2602:306:3593:5550:fd6d:9dbc:89c0:8b0d]) by mx.google.com with ESMTPSA id tv4sm4887242pab.28.2014.11.05.22.14.24 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Wed, 05 Nov 2014 22:14:25 -0800 (PST) Content-Type: text/plain; charset=windows-1252 Mime-Version: 1.0 (Mac OS X Mail 7.2 \(1874\)) Subject: Re: Hbase Unusable after auto split to 1024 regions From: Pere Kyle In-Reply-To: Date: Wed, 5 Nov 2014 22:14:23 -0800 Content-Transfer-Encoding: quoted-printable Message-Id: <89A10F47-55E9-474C-BB20-206D73B4DF53@whisper.sh> References: To: user@hbase.apache.org X-Mailer: Apple Mail (2.1874) X-Virus-Checked: Checked by ClamAV on apache.org Ted, Thanks so much for that information. I now see why this split too often, = but what I am not sure of is how to fix this without blowing away the = cluster. Add more heap? Another symptom I have noticed is that load on the Master instance hbase = daemon has been pretty high (load average 4.0, whereas it used to be = 1.0) Thanks, Pere =20 On Nov 5, 2014, at 9:56 PM, Ted Yu wrote: > IncreasingToUpperBoundRegionSplitPolicy is the default split policy. >=20 > You can read the javadoc of this class to see how it works. >=20 > Cheers >=20 > On Wed, Nov 5, 2014 at 9:39 PM, Ted Yu wrote: >=20 >> Can you provide a bit more information (such as HBase release) ? >>=20 >> If you pastebin one of the region servers' log, that would help us >> determine the cause. >>=20 >> Cheers >>=20 >>=20 >> On Wed, Nov 5, 2014 at 9:29 PM, Pere Kyle wrote: >>=20 >>> Hello, >>>=20 >>> Recently our cluster which has been running fine for 2 weeks split = to >>> 1024 regions at 1GB per region, after this split the cluster is = unusable. >>> Using the performance benchmark I was getting a little better than = 100 w/s, >>> whereas before it was 5000 w/s. There are 15 nodes of m2.2xlarge = with 8GB >>> heap reserved for Hbase >>>=20 >>> Any Ideas? I am stumped: >>>=20 >>> Thanks, >>> Pere >>>=20 >>> Here is the current >>> hbase-site.xml >>> >>> >>> >>> >>> hbase.snapshot.enabled >>> true >>> >>>=20 >>> = fs.hdfs.implemr.hbase.fs.BlockableFileSystem= >>>=20 >>> = hbase.regionserver.handler.count50 >>>=20 >>> = hbase.cluster.distributedtrue >>>=20 >>> = hbase.tmp.dir/mnt/var/lib/hbase/tmp-data >>>=20 >>> = hbase.master.wait.for.log.splittingtrue >>>=20 >>> = hbase.hregion.memstore.flush.size134217728 >>> hbase.hregion.max.filesize5073741824 >>> >>>=20 >>> = zookeeper.session.timeout60000 >>>=20 >>> = hbase.thrift.maxQueuedRequests0 >>>=20 >>> = hbase.client.scanner.caching1000 >>>=20 >>> = hbase.hregion.memstore.block.multiplier4 >>> >>>=20 >>> hbase-env.sh >>> # The maximum amount of heap to use, in MB. Default is 1000. >>> export HBASE_HEAPSIZE=3D8000 >>>=20 >>> # Extra Java runtime options. >>> # Below are what we set by default. May only work with SUN JVM. >>> # For more on why as well as other possible settings, >>> # see http://wiki.apache.org/hadoop/PerformanceTuning >>> export HBASE_OPTS=3D"-XX:+UseConcMarkSweepGC=94 >>>=20 >>> hbase-env.sh >>=20 >>=20 >>=20