Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 51AED10614 for ; Sat, 25 Jan 2014 02:56:32 +0000 (UTC) Received: (qmail 32837 invoked by uid 500); 25 Jan 2014 02:56:28 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 32619 invoked by uid 500); 25 Jan 2014 02:56:27 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 32611 invoked by uid 99); 25 Jan 2014 02:56:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 25 Jan 2014 02:56:26 +0000 X-ASF-Spam-Status: No, hits=-0.5 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of rohitdevel14@gmail.com designates 209.85.161.194 as permitted sender) Received: from [209.85.161.194] (HELO mail-gg0-f194.google.com) (209.85.161.194) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 25 Jan 2014 02:56:20 +0000 Received: by mail-gg0-f194.google.com with SMTP id x14so433188ggx.5 for ; Fri, 24 Jan 2014 18:55:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=gzT5s7rfHCeg4RCtQ6xm87cNsN4cSuXrE5o9b6imVbQ=; b=pNWzIj5vv705C6GJ00E0o5M8rkfPdvlN6P4VHOCfoL6LJRETgHsp2fIBxI0/yVWeJa 2ZE0LMU/YGbYarNuVe9FvqLZh/nAGMGdhSKCuTdl26JD/AFOZDDIpZVaWjX5DNvdfzDO otJb2Cbd6YpUsBCPeKM6iIlzacgssPCYSYdVPhrESio2/iVTmhT6F/JYXJlgsnsyLD4O h0Xwn/BzcR6ViP7NZUuTF0gVdWSFZ1F7sPA6OMlUNrXN9IJ1xMpTsqYcT/D7z0To0s5+ YQxG11uBwf0n374pRR3giqtWs7NPM7o2HF8lTROPTyVhS/gw0tjhw9iSoATJ1Qt3uwnK BugA== MIME-Version: 1.0 X-Received: by 10.236.46.18 with SMTP id q18mr15817773yhb.21.1390618559910; Fri, 24 Jan 2014 18:55:59 -0800 (PST) Received: by 10.170.130.132 with HTTP; Fri, 24 Jan 2014 18:55:59 -0800 (PST) In-Reply-To: References: Date: Fri, 24 Jan 2014 18:55:59 -0800 Message-ID: Subject: Re: Hbase tuning for heavy write cluster From: Rohit Dev To: user@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Bryan, This is extremely useful information. I wanted to increase these settings but didn't know how high I could go. btw, are you running tsdb v1 on Java7 ? Thanks On Fri, Jan 24, 2014 at 6:51 PM, Bryan Beaudreault wrote: > Also, I think you can up the hbase.hstore.blockingStoreFiles quite a bit > higher. You could try something like 50. It will reduce read performance > a bit, but shouldn't be too bad especially for something like opentsdb I > think. If you are going to up the blockingStoreFiles you're probably also > going to want to up hbase.hstore.compaction.max. > > For my tsdb cluster, which is 8 i2.4xlarges in EC2, we have 90 regions for > tsdb. We were also having issues with blocking, and I upped > blockingStoreFiles to 35, compaction.max to 15, and > memstore.block.multiplier to 3. We haven't had problems since. Memstore > flushsize for the tsdb table is 512MB. > > Finally, 64GB heap may prove problematic, but it's worth a shot. I'd > definitely recommend java7 with the G1 garbage collector though. In > general, Java would have a hard time with heap sizes greater than 20-25GB > without some careful tuning. > > > On Fri, Jan 24, 2014 at 9:44 PM, Bryan Beaudreault > wrote: > >> It seems from your ingestion rate you are still blowing through HFiles too >> fast. You're going to want to up the MEMSTORE_FLUSHSIZE for the table from >> the default of 128MB. If opentsdb is the only thing on this cluster, you >> can do the math pretty easily to find the maximum allowable, based on your >> heap size and accounting for 40% (default) used for the block cache. >> >> >> On Fri, Jan 24, 2014 at 9:38 PM, Rohit Dev wrote: >> >>> Hi Kevin, >>> >>> We have about 160 regions per server with 16Gig region size and 10 >>> drives for Hbase. I've looked at disk IO and that doesn't seem to be >>> any problem ( % utilization is < 2 across all disk) >>> >>> Any suggestion what heap size I should allocation, normally I allocate >>> 16GB. >>> >>> Also, I read increasing hbase.hstore.blockingStoreFiles and >>> hbase.hregion.memstore.block.multiplier is good idea for write-heavy >>> cluster, but in my case it seem to be heading to wrong direction. >>> >>> Thanks >>> >>> On Fri, Jan 24, 2014 at 6:31 PM, Kevin O'dell >>> wrote: >>> > Rohit, >>> > >>> > 64GB heap is not ideal, you will run into some weird issues. How many >>> > regions are you running per server, how many drives in each node, any >>> other >>> > settings you changed from default? >>> > On Jan 24, 2014 6:22 PM, "Rohit Dev" wrote: >>> > >>> >> Hi, >>> >> >>> >> We are running Opentsdb on CDH 4.3 hbase cluster, with most of the >>> >> default settings. The cluster is heavy on write and I'm trying to see >>> >> what parameters I can tune to optimize the write performance. >>> >> >>> >> >>> >> # I get messages related to Memstore[1] and Slow Response[2] very >>> >> often, is this an indication of any issue ? >>> >> >>> >> I tried increasing some parameters on one node: >>> >> - hbase.hstore.blockingStoreFiles - from default 7 to 15 >>> >> - hbase.hregion.memstore.block.multiplier - from default 2 to 8 >>> >> - and heap size from 16GB to 64GB >>> >> >>> >> * 'Compaction queue' went up to ~200 within 60 mins after restarting >>> >> region server with new parameters and the log started to get even more >>> >> noisy. >>> >> >>> >> Can anyone please suggest if I'm going to right direction with these >>> >> new settings ? or if there are other thing that I could monitor or >>> >> change to make it better. >>> >> >>> >> Thank you! >>> >> >>> >> >>> >> [1] >>> >> INFO org.apache.hadoop.hbase.regionserver.HRegion: Blocking updates >>> >> for 'IPC Server handler 19 on 60020' on region >>> >> >>> >> >>> tsdb,\x008XR\xE0i\x90\x00\x00\x02Q\x7F\x1D\x00\x00(\x00\x0B]\x00\x008M(r\x00\x00Bl\xA7\x8C,1390556781703.0771bf90cab25c503d3400206417f6bf.: >>> >> memstore size 256.3 M is >= than blocking 256 M size >>> >> >>> >> [2] >>> >> WARN org.apache.hadoop.ipc.HBaseServer: (responseTooSlow): >>> >> >>> >> >>> {"processingtimems":17887,"call":"multi(org.apache.hadoop.hbase.client.MultiAction@586940ea >>> >> ), >>> >> rpc version=1, client version=29, >>> >> methodsFingerPrint=0","client":"192.168.10.10:54132 >>> >> >>> ","starttimems":1390587959182,"queuetimems":1498,"class":"HRegionServer","responsesize":0,"method":"multi"} >>> >> >>> >> >>