Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@locus.apache.org Received: (qmail 25997 invoked from network); 8 Feb 2008 05:08:29 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 8 Feb 2008 05:08:29 -0000 Received: (qmail 45966 invoked by uid 500); 8 Feb 2008 05:08:22 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 45955 invoked by uid 500); 8 Feb 2008 05:08:22 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 45946 invoked by uid 99); 8 Feb 2008 05:08:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Feb 2008 21:08:21 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [63.203.238.117] (HELO dns.duboce.net) (63.203.238.117) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 08 Feb 2008 05:08:07 +0000 Received: by dns.duboce.net (Postfix, from userid 1008) id 6E6E3C563; Thu, 7 Feb 2008 19:34:08 -0800 (PST) X-Spam-Checker-Version: SpamAssassin 3.1.4 (2006-07-26) on dns.duboce.net X-Spam-Level: Received: from durruti.local (unknown [192.168.1.135]) by dns.duboce.net (Postfix) with ESMTP id 458F8C1CE for ; Thu, 7 Feb 2008 19:34:06 -0800 (PST) Message-ID: <47ABE320.9050304@duboce.net> Date: Thu, 07 Feb 2008 21:05:36 -0800 From: stack User-Agent: Thunderbird 2.0.0.9 (Macintosh/20071031) MIME-Version: 1.0 To: hbase-user@hadoop.apache.org Subject: Re: HBase Random Read Performance References: <47ABCC44.20208@openplaces.com> In-Reply-To: <47ABCC44.20208@openplaces.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No, score=-4.4 required=5.0 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.1.4 Tthe test described can only favor mysql (single column, just a million rows). Do you need Hbase? You might also tell us more about your hbase setup. Is it using localfs or hdfs? Is it a distributed hdfs or all on single server? Thanks, St.Ack Joost Ouwerkerk wrote: > I'm working on a web application with primarily read-oriented > performance requirements. I've been running some benchmarking tests > that include our application layer, to get a sense of what is possible > with Hbase. A variation on the Bigtable test that is reproduced by > org.apache.hadoop.hbase.PerformanceEvaluation, I'm randomly reading 1 > column from a table with 1 million rows. In our case, the contents of > that column need to be deserialized by our application (which adds > some overhead that I'm also trying to measure), the deserialized > contents represent a little over 1K of data. > Although a single thread can only achieve 125 reads per second, with > 12 client threads (from 3 different machines) I'm able to read as many > as 500 objects per second. Now, I've replicated my test on a basic > MySQL table and am able to get a throughput of 2,300 reads/sec; > roughly 5 times what I'm seeing with Hbase. Besides the obvious code > maturity thing, is the discrepancy related to random reads not > actually being served from memcache, but rather from the disk, by > Hbase? The HBase performance page > (http://wiki.apache.org/hadoop/Hbase/PerformanceEvaluation) shows > random reads(mem) as "Not implemented." > > Can anyone shed some light on the state of HBase's memcaching? > Cheers, > Joost.