Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 39B0510583 for ; Mon, 7 Oct 2013 05:43:04 +0000 (UTC) Received: (qmail 28758 invoked by uid 500); 7 Oct 2013 05:42:47 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 28714 invoked by uid 500); 7 Oct 2013 05:42:42 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 28698 invoked by uid 99); 7 Oct 2013 05:42:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Oct 2013 05:42:41 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ramu.malur@gmail.com designates 209.85.192.180 as permitted sender) Received: from [209.85.192.180] (HELO mail-pd0-f180.google.com) (209.85.192.180) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Oct 2013 05:42:36 +0000 Received: by mail-pd0-f180.google.com with SMTP id y10so6624525pdj.39 for ; Sun, 06 Oct 2013 22:42:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=KN/H2pWmJQ6iZcpRws7so/Tu3wNNrr9EwHfvb/KCCA0=; b=YBenKL8P1+XLlsyjnJoXe2z/K/jxi28J2IeEbC0mwgvHjwAfF/SJzZEAx4ehLzWoHN TmQo7axKteFhVzJEgUfI152Yinq7nt4hqD1sXgLtlpHhv/EfD3rhvKO8Nzjo2FAL7nlM 9O1CybvRTuwvyEH0SVLxfTo9kZ/pjVz+6Ac8PIgISvBZfBt/eRVy3fmejIEVetXM2lp/ WzWwqroMaglEwWmTP7LZm7oCmql3XaupzhxCU91aeLA9seNQP6feV0LUzfzxrxUkq0uS FbRbvtia1v0DpUlJKcwfHZhqcLqwisQdsRWKaWc30kzv7EMvoMG64Zkinl9ilPtP8Eaw W7CA== MIME-Version: 1.0 X-Received: by 10.66.231.42 with SMTP id td10mr1178822pac.144.1381124535789; Sun, 06 Oct 2013 22:42:15 -0700 (PDT) Received: by 10.70.103.41 with HTTP; Sun, 6 Oct 2013 22:42:15 -0700 (PDT) In-Reply-To: <1381123300.88874.YahooMailNeo@web140602.mail.bf1.yahoo.com> References: <1381123300.88874.YahooMailNeo@web140602.mail.bf1.yahoo.com> Date: Mon, 7 Oct 2013 14:42:15 +0900 Message-ID: Subject: Re: HBase Random Read latency > 100ms From: Ramu M S To: user@hbase.apache.org, lars hofhansl Content-Type: multipart/alternative; boundary=047d7b111d0758aec304e8201de1 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b111d0758aec304e8201de1 Content-Type: text/plain; charset=ISO-8859-1 Lars, - Yes Short Circuit reading is enabled on both HDFS and HBase. - I had issued Major compaction after table is loaded. - Region Servers have max heap set as 128 GB. Block Cache Size is 0.25 of heap (So 32 GB for each Region Server) Do we need even more? - Decreasing HFile Size (Default is 1GB )? Should I leave it to default? - Keys are Zipfian distributed (By YCSB) Bharath, Bloom Filters are enabled. Here is my table details, {NAME => 'usertable', FAMILIES => [{NAME => 'cf', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER => 'ROWCOL', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'NONE', MIN_VERSIONS => '0', TTL => '2147483647', KEEP_DELETED_CELLS => 'false', BLOCKSIZE => '16384', IN_MEMORY => 'false', ENCODE_ON_DISK => 'true', BLOCKCACHE => 'true'}]} When the data size is around 100GB (100 Million records), then the latency is very good. I am getting a throughput of around 300K OPS. In both cases (100 GB and 1.8 TB) Ganglia stats show that Disk reads are around 50-60 MB/s throughout the read cycle. Thanks, Ramu On Mon, Oct 7, 2013 at 2:21 PM, lars hofhansl wrote: > Have you enabled short circuit reading? See here: > http://hbase.apache.org/book/perf.hdfs.html > > How's your data locality (shown on the RegionServer UI page). > > > How much memory are you giving your RegionServers? > If you reads are truly random and the data set does not fit into the > aggregate cache, you'll be dominated by the disk and network. > Each read would need to bring in a 64k (default) HFile block. If short > circuit reading is not enabled you'll get two or three context switches. > > So I would try: > 1. Enable short circuit reading > 2. Increase the block cache size per RegionServer > 3. Decrease the HFile block size > 4. Make sure your data is local (if it is not, issue a major compaction). > > > -- Lars > > > > ________________________________ > From: Ramu M S > To: user@hbase.apache.org > Sent: Sunday, October 6, 2013 10:01 PM > Subject: HBase Random Read latency > 100ms > > > Hi All, > > My HBase cluster has 8 Region Servers (CDH 4.4.0, HBase 0.94.6). > > Each Region Server is with the following configuration, > 16 Core CPU, 192 GB RAM, 800 GB SATA (7200 RPM) Disk > (Unfortunately configured with RAID 1, can't change this as the Machines > are leased temporarily for a month). > > I am running YCSB benchmark tests on HBase and currently inserting around > 1.8 Billion records. > (1 Key + 7 Fields of 100 Bytes = 724 Bytes per record) > > Currently I am getting a write throughput of around 100K OPS, but random > reads are very very slow, all gets have more than 100ms or more latency. > > I have changed the following default configuration, > 1. HFile Size: 16GB > 2. HDFS Block Size: 512 MB > > Total Data size is around 1.8 TB (Excluding the replicas). > My Table is split into 128 Regions (No pre-splitting used, started with 1 > and grew to 128 over the insertion time) > > Taking some inputs from earlier discussions I have done the following > changes to disable Nagle (In both Client and Server hbase-site.xml, > hdfs-site.xml) > > > hbase.ipc.client.tcpnodelay > true > > > > ipc.server.tcpnodelay > true > > > Ganglia stats shows large CPU IO wait (>30% during reads). > > I agree that disk configuration is not ideal for Hadoop cluster, but as > told earlier it can't change for now. > I feel the latency is way beyond any reported results so far. > > Any pointers on what can be wrong? > > Thanks, > Ramu > --047d7b111d0758aec304e8201de1--