Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6806EFFAB for ; Mon, 1 Apr 2013 11:34:07 +0000 (UTC) Received: (qmail 5946 invoked by uid 500); 1 Apr 2013 11:34:05 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 5835 invoked by uid 500); 1 Apr 2013 11:34:04 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 5796 invoked by uid 99); 1 Apr 2013 11:34:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Apr 2013 11:34:03 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of azuryyyu@gmail.com designates 209.85.223.174 as permitted sender) Received: from [209.85.223.174] (HELO mail-ie0-f174.google.com) (209.85.223.174) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Apr 2013 11:33:58 +0000 Received: by mail-ie0-f174.google.com with SMTP id aq17so2157201iec.19 for ; Mon, 01 Apr 2013 04:33:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=DxM9kexDbLNp9jl3vW6ygjVDWLrUPLCdslHw6SfJOfM=; b=QGBx0i9gWaqDN1afMmTcbV/TCF9qZ0vH0f4miNlrG47Mzh3WQuSlr9RuQ4S2V08MBk G8cAtHxW0F2jTJH97KvGIBXsuJT06EWdPrBiniC640c32PPc4UlpsppmmONHrA+B4yb2 uYmI9cZ3jIRXGDt3G75U4pfV+VCK2PkuSaGYj9Y/cO1T4TXlXxevAUnuYn83/ee20Hoi paTsMtWE/LgjtThyHXkD2dBiEozmIl55y1N6ug4UiWRPRgNZdXOOFR04INR9aucK5h3d dklU+ZDjfp/8KDPqIRsjaLtFJk/In6qkn6sA+L0WZRI8gmzXp21Qw3Bkfg6zphxSDOyO z5xw== MIME-Version: 1.0 X-Received: by 10.50.20.69 with SMTP id l5mr3042809ige.106.1364816018185; Mon, 01 Apr 2013 04:33:38 -0700 (PDT) Received: by 10.64.26.70 with HTTP; Mon, 1 Apr 2013 04:33:37 -0700 (PDT) Received: by 10.64.26.70 with HTTP; Mon, 1 Apr 2013 04:33:37 -0700 (PDT) In-Reply-To: References: Date: Mon, 1 Apr 2013 19:33:37 +0800 Message-ID: Subject: Re: Read thruput From: Azuryy Yu To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=047d7bd76ae6f2a1e704d94afd48 X-Virus-Checked: Checked by ClamAV on apache.org --047d7bd76ae6f2a1e704d94afd48 Content-Type: text/plain; charset=ISO-8859-1 can you output GC log? CMS GC should be optimized futher. please find it on official site. another, use vmstat monitor page rate during query. On Apr 1, 2013 6:09 PM, "Vibhav Mundra" wrote: > Hi All, > > I am trying to use Hbase for real-time data retrieval with a timeout of 50 > ms. > > I am using 2 machines as datanode and regionservers, > and one machine as a master for hadoop and Hbase. > > But I am able to fire only 3000 queries per sec and 10% of them are timing > out. > The database has 60 million rows. > > Are these figure okie, or I am missing something. > I have used the scanner caching to be equal to one, because for each time > we are fetching a single row only. > > Here are the various configurations: > > *Our schema > *{NAME => 'mytable', FAMILIES => [{NAME => 'cf', DATA_BLOCK_ENCODING => > 'NONE', BLOOMFILTER => 'ROWCOL', REPLICATION_SCOPE => '0', COMPRESSION => > 'GZ', VERSIONS => '1', TTL => '2147483647', MIN_VERSIONS => '0', KEE > P_DELETED_CELLS => 'false', BLOCKSIZE => '8192', ENCODE_ON_DISK => 'true', > IN_MEMORY => 'false', BLOCKCACHE => 'true'}]} > > *Configuration* > 1 Machine having both hbase and hadoop master > 2 machines having both region server node and datanode > total 285 region servers > > *Machine Level Optimizations:* > a)No of file descriptors is 1000000(ulimit -n gives 1000000) > b)Increase the read-ahead value to 4096 > c)Added noatime,nodiratime to the disks > > *Hadoop Optimizations:* > dfs.datanode.max.xcievers = 4096 > dfs.block.size = 33554432 > dfs.datanode.handler.count = 256 > io.file.buffer.size = 65536 > hadoop data is split on 4 directories, so that different disks are being > accessed > > *Hbase Optimizations*: > > hbase.client.scanner.caching=1 #We have specifcally added this, as we > return always one row. > hbase.regionserver.handler.count=3200 > hfile.block.cache.size=0.35 > hbase.hregion.memstore.mslab.enabled=true > hfile.min.blocksize.size=16384 > hfile.min.blocksize.size=4 > hbase.hstore.blockingStoreFiles=200 > hbase.regionserver.optionallogflushinterval=60000 > hbase.hregion.majorcompaction=0 > hbase.hstore.compaction.max=100 > hbase.hstore.compactionThreshold=100 > > *Hbase-GC > *-XX:+UseConcMarkSweepGC -XX:+UseParNewGC -XX:+CMSParallelRemarkEnabled > -XX:SurvivorRatio=20 -XX:ParallelGCThreads=16 > *Hadoop-GC* > -XX:+UseConcMarkSweepGC -XX:+UseParNewGC > > -Vibhav > --047d7bd76ae6f2a1e704d94afd48--