Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 33C411055A for ; Wed, 9 Oct 2013 01:01:12 +0000 (UTC) Received: (qmail 25431 invoked by uid 500); 9 Oct 2013 01:01:09 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 25347 invoked by uid 500); 9 Oct 2013 01:01:09 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 25339 invoked by uid 99); 9 Oct 2013 01:01:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Oct 2013 01:01:09 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ramu.malur@gmail.com designates 209.85.160.50 as permitted sender) Received: from [209.85.160.50] (HELO mail-pb0-f50.google.com) (209.85.160.50) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Oct 2013 01:01:04 +0000 Received: by mail-pb0-f50.google.com with SMTP id uo5so108690pbc.9 for ; Tue, 08 Oct 2013 18:00:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=/rCUuDM6+s/q5Qm2v8hMVeT+vNPWUN10Ns/D5ZujPA0=; b=QbWbpV+1fbNmGZ0ozX/RLhT2IxLsFlxAIiVGjCKVmCjoTgObxpLWJP8FESQ3kJCL32 L2wDXmgVQSvYHdNIvHQuXqkMYOX3xMjyiKuQlfmWDCDm8PcDUKU6Uif4bdqKcCgygqm2 ZbCSbM6dFHTIuIV32LoSfsQBf+LR6oGSP/faXofNDQalRgJdhNjCBaJvae2L8rE86yJ2 yccRKb9RKKUil0KKuJY1TBEWQ/8F1AHjE77Lpu6nOHo4L5Zq7tTFYXZwq31fuhzcSYT8 Hw8QTOTynmcJYon056DvkveTXqsj+N+M6iBt9+0DQj3vsZROfeZSeUA9FQj/+y3I2lO4 cZ8g== MIME-Version: 1.0 X-Received: by 10.66.65.134 with SMTP id x6mr6489727pas.142.1381280444097; Tue, 08 Oct 2013 18:00:44 -0700 (PDT) Received: by 10.70.103.41 with HTTP; Tue, 8 Oct 2013 18:00:44 -0700 (PDT) In-Reply-To: References: <1381123300.88874.YahooMailNeo@web140602.mail.bf1.yahoo.com> <1381126892.93964.YahooMailNeo@web140603.mail.bf1.yahoo.com> Date: Wed, 9 Oct 2013 10:00:44 +0900 Message-ID: Subject: Re: HBase Random Read latency > 100ms From: Ramu M S To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=001a1136335434ae4404e8446ae9 X-Virus-Checked: Checked by ClamAV on apache.org --001a1136335434ae4404e8446ae9 Content-Type: text/plain; charset=ISO-8859-1 Hi All, After few suggestions from the mails earlier I changed the following, 1. Heap Size to 16 GB 2. Block Size to 16KB 3. HFile size to 8 GB (Table now has 256 regions, 32 per server) 4. Data Locality Index is 100 in all RS I have clients running in 10 machines, each with 4 threads. So total 40. This is same in all tests. Result: 1. Average latency is still >100ms. 2. Heap occupancy is around 2-2.5 GB in all RS Few more tests carried out yesterday, TEST 1: Small data set (100 Million records, each with 724 bytes). =========================================== Configurations: 1. Heap Size to 1 GB 2. Block Size to 16KB 3. HFile size to 1 GB (Table now has 128 regions, 16 per server) 4. Data Locality Index is 100 in all RS I disabled Block Cache on the table, to make sure I read everything from disk, most of the time. Result: 1. Average Latency is 8ms and throughput went up to 6K/Sec per RS. 2. With Block Cache enabled again, I got average latency around 2ms and throughput of 10K/Sec per RS. Heap occupancy around 650 MB 3. Increased the Heap to 16GB, with Block Cache still enabled, I got average latency around 1 ms and throughput 20K/Sec per RS Heap Occupancy around 2-2.5 GB in all RS TEST 2: Large Data set (1.8 Billion records, each with 724 bytes) ================================================== Configurations: 1. Heap Size to 1 GB 2. Block Size to 16KB 3. HFile size to 1 GB (Table now has 2048 regions, 256 per server) 4. Data Locality Index is 100 in all RS Result: 1. Average Latency is > 500ms to start with and gradually decreases, but even after around 100 Million reads it is still >100 ms 2. Block Cache = TRUE/FALSE does not make any difference here. Even Heap Size (1GB / 16GB) does not make any difference. 3. Heap occupancy is around 2-2.5 GB under 16GB Heap and around 650 MB under 1GB Heap. GC Time in all of the scenarios is around 2ms/Second, as shown in the Cloudera Manager. Reading most of the items from Disk in less data scenario gives better results and very low latencies. Number of regions per RS and HFile size does make a huge difference in my Cluster. Keeping 100 Regions per RS as max(Most of the discussions suggest this), should I restrict the HFile size to 1GB? and thus reducing the storage capacity (From 700 GB to 100GB per RS)? Please advice. Thanks, Ramu On Wed, Oct 9, 2013 at 4:58 AM, Vladimir Rodionov wrote: > What are your current heap and block cache sizes? > > Best regards, > Vladimir Rodionov > Principal Platform Engineer > Carrier IQ, www.carrieriq.com > e-mail: vrodionov@carrieriq.com > > ________________________________________ > From: Ramu M S [ramu.malur@gmail.com] > Sent: Monday, October 07, 2013 10:55 PM > To: user@hbase.apache.org > Subject: Re: HBase Random Read latency > 100ms > > Hi All, > > Average Latency is still around 80ms. > I have done the following, > > 1. Enabled Snappy Compression > 2. Reduce the HFile size to 8 GB > > Should I attribute these results to bad Disk Configuration OR anything else > to investigate? > > - Ramu > > > On Tue, Oct 8, 2013 at 10:56 AM, Ramu M S wrote: > > > Vladimir, > > > > Thanks for the Insights into Future Caching features. Looks very > > interesting. > > > > - Ramu > > > > > > On Tue, Oct 8, 2013 at 10:45 AM, Vladimir Rodionov < > > vrodionov@carrieriq.com> wrote: > > > >> Ramu, > >> > >> If your working set of data fits into 192GB you may get additional boost > >> by utilizing OS page cache, or wait until > >> 0.98 release which introduces new bucket cache implementation (port of > >> Facebook L2 cache). You can try vanilla bucket cache in 0.96 (not > released > >> yet > >> but is due soon). Both caches stores data off-heap, but Facebook version > >> can store encoded and compressed data and vanilla bucket cache does not. > >> There are some options how to utilize efficiently available RAM (at > least > >> in upcoming HBase releases) > >> . If your data set does not fit RAM then your only hope is your 24 SAS > >> drives. Depending on your RAID settings, disk IO perf, HDFS > configuration > >> (I think the latest Hadoop is preferable here). > >> > >> OS page cache is most vulnerable and volatile, it can not be controlled > >> and can be easily polluted by either some other processes or by HBase > >> itself (long scan). > >> With Block cache you have more control but the first truly usable > >> *official* implementation is going to be a part of 0.98 release. > >> > >> As far as I understand, your use case would definitely covered by > >> something similar to BigTable ScanCache (RowCache) , but there is no > such > >> cache in HBase yet. > >> One major advantage of RowCache vs BlockCache (apart from being much > more > >> efficient in RAM usage) is resilience to Region compactions. Each minor > >> Region compaction invalidates partially > >> Region's data in BlockCache and major compaction invalidates this > >> Region's data completely. This is not the case with RowCache (would it > be > >> implemented). > >> > >> Best regards, > >> Vladimir Rodionov > >> Principal Platform Engineer > >> Carrier IQ, www.carrieriq.com > >> e-mail: vrodionov@carrieriq.com > >> > >> ________________________________________ > >> From: Ramu M S [ramu.malur@gmail.com] > >> Sent: Monday, October 07, 2013 5:25 PM > >> To: user@hbase.apache.org > >> Subject: Re: HBase Random Read latency > 100ms > >> > >> Vladimir, > >> > >> Yes. I am fully aware of the HDD limitation and wrong configurations wrt > >> RAID. > >> Unfortunately, the hardware is leased from others for this work and I > >> wasn't consulted to decide the h/w specification for the tests that I am > >> doing now. Even the RAID cannot be turned off or set to RAID-0 > >> > >> Production system is according to the Hadoop needs (100 Nodes with 16 > Core > >> CPU, 192 GB RAM, 24 X 600GB SAS Drives, RAID cannot be completely turned > >> off, so we are creating 1 Virtual Disk containing only 1 Physical Disk > and > >> the VD RAID level set to* *RAID-0). These systems are still not > >> available. If > >> you have any suggestion on the production setup, I will be glad to hear. > >> > >> Also, as pointed out earlier, we are planning to use HBase also as an in > >> memory KV store to access the latest data. > >> That's why RAM was considered huge in this configuration. But looks like > >> we > >> would run into more problems than any gains from this. > >> > >> Keeping that aside, I was trying to get the maximum out of the current > >> cluster or as you said Is 500-1000 OPS the max I could get out of this > >> setup? > >> > >> Regards, > >> Ramu > >> > >> > >> > >> Confidentiality Notice: The information contained in this message, > >> including any attachments hereto, may be confidential and is intended > to be > >> read only by the individual or entity to whom this message is > addressed. If > >> the reader of this message is not the intended recipient or an agent or > >> designee of the intended recipient, please note that any review, use, > >> disclosure or distribution of this message or its attachments, in any > form, > >> is strictly prohibited. If you have received this message in error, > please > >> immediately notify the sender and/or Notifications@carrieriq.com and > >> delete or destroy any copy of this message and its attachments. > >> > > > > > > Confidentiality Notice: The information contained in this message, > including any attachments hereto, may be confidential and is intended to be > read only by the individual or entity to whom this message is addressed. If > the reader of this message is not the intended recipient or an agent or > designee of the intended recipient, please note that any review, use, > disclosure or distribution of this message or its attachments, in any form, > is strictly prohibited. If you have received this message in error, please > immediately notify the sender and/or Notifications@carrieriq.com and > delete or destroy any copy of this message and its attachments. > --001a1136335434ae4404e8446ae9--