Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5B2B71010F for ; Sat, 29 Jun 2013 22:10:29 +0000 (UTC) Received: (qmail 65906 invoked by uid 500); 29 Jun 2013 22:10:28 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 65856 invoked by uid 500); 29 Jun 2013 22:10:28 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 65848 invoked by uid 99); 29 Jun 2013 22:10:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 29 Jun 2013 22:10:28 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.139.213.141] (HELO nm23-vm1.bullet.mail.bf1.yahoo.com) (98.139.213.141) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 29 Jun 2013 22:10:20 +0000 Received: from [98.139.215.142] by nm23.bullet.mail.bf1.yahoo.com with NNFMP; 29 Jun 2013 22:09:59 -0000 Received: from [98.139.212.243] by tm13.bullet.mail.bf1.yahoo.com with NNFMP; 29 Jun 2013 22:09:59 -0000 Received: from [127.0.0.1] by omp1052.mail.bf1.yahoo.com with NNFMP; 29 Jun 2013 22:09:59 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 642280.72206.bm@omp1052.mail.bf1.yahoo.com Received: (qmail 43420 invoked by uid 60001); 29 Jun 2013 22:09:59 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1372543799; bh=s1qYxMmm4sbJdjZWzxWDnh+/SDMuUSvJFPRmAcYDL4I=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=REny7lNgmvd1sv6PaVE27PQR+pB3D/1lriEq0SwYBKxSUGo3dJ8bMXJjphfwG/rsNHqPEG+nu76GcRJH85Iauj2dQ6ku3w/QApEFiNzCH56/WM5c++E2dortuw1vaNZaWGyibT8RvJKD6TNY3SAY4Fu/5gWtNdeABZF7mevV4FE= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-RocketYMMF:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=PO7h4mm9RbJMJEyZsBliFo1DPxs+i6gQ4tSUOYrjGH+B8Y+6srSBK1jJ58zfLFDjs7uVsJMa4EB0PoPa71p6KuYSUnoMeS8qlYlFUgb/fpnadO51BdBUHyHBD54CCSh+gYKDwD22GJl6i+3lXEO8CtsAByD53NP/nliYbaF7NU0=; X-YMail-OSG: MiF2zA4VM1lalNP_EgJKmVmp1cuvG9wIhWVMFn1kd1UnTUg OR0cqlt5Ejw4hoCEqshuJ Received: from [93.213.30.200] by web140601.mail.bf1.yahoo.com via HTTP; Sat, 29 Jun 2013 15:09:59 PDT X-Rocket-MIMEInfo: 002.001,SSd2ZSBzZWVuIHRoZSBzYW1lIGJhZCBwZXJmb3JtYW5jZSBiZWhhdmlvciB3aGVuIEkgdGVzdGVkIHRoaXMgb24gYSByZWFsIGNsdXN0ZXIuIChJIHRoaW5rIGl0IHdhcyBpbiAwLjk0LjYpCgoKSW5zdGVhZCBvZiBlbi9kaXNhYmxpbmcgdGhlIGJsb2NrY2FjaGUsIEkgdGVzdGVkIHNlcXVlbnRpYWwgYW5kIHJhbmRvbSByZWFkcyBvbiBhIGRhdGEgc2V0IHRoYXQgZG9lcyBub3QgZml0IGludG8gdGhlIChhZ2dyZWdhdGUpIGJsb2NrIGNhY2hlLgpTZXF1ZW50aWFsIHJlYWRzIHdlcmUgZHJhc3RpY2FsbHkgZmEBMAEBAQE- X-RocketYMMF: lhofhansl X-Mailer: YahooMailWebService/0.8.148.557 References: Message-ID: <1372543799.39326.YahooMailNeo@web140601.mail.bf1.yahoo.com> Date: Sat, 29 Jun 2013 15:09:59 -0700 (PDT) From: lars hofhansl Reply-To: lars hofhansl Subject: Re: Poor HBase random read performance To: "dev@hbase.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="969045052-417836057-1372543799=:39326" X-Virus-Checked: Checked by ClamAV on apache.org --969045052-417836057-1372543799=:39326 Content-Type: text/plain; charset=us-ascii I've seen the same bad performance behavior when I tested this on a real cluster. (I think it was in 0.94.6) Instead of en/disabling the blockcache, I tested sequential and random reads on a data set that does not fit into the (aggregate) block cache. Sequential reads were drastically faster than Random reads (7 vs 34 minutes), which can really only be explained with the fact that the next get will with high probability hit an already cached block, whereas in the random read case it likely will not. In the RandomRead case I estimate that each RegionServer brings in between 100 and 200mb/s from the OS cache. Even at 200mb/s this would be quite slow.I understand that performance is bad when index/bloom blocks are not cached, but bringing in data blocks from the OS cache should be faster than it is. So this is something to debug. -- Lars ________________________________ From: Varun Sharma To: "dev@hbase.apache.org" Sent: Saturday, June 29, 2013 12:13 PM Subject: Poor HBase random read performance Hi, I was doing some tests on how good HBase random reads are. The setup is consists of a 1 node cluster with dfs replication set to 1. Short circuit local reads and HBase checksums are enabled. The data set is small enough to be largely cached in the filesystem cache - 10G on a 60G machine. Client sends out multi-get operations in batches to 10 and I try to measure throughput. Test #1 All Data was cached in the block cache. Test Time = 120 seconds Num Read Ops = 12M Throughput = 100K per second Test #2 I disable block cache. But now all the data is in the file system cache. I verify this by making sure that IOPs on the disk drive are 0 during the test. I run the same test with batched ops. Test Time = 120 seconds Num Read Ops = 0.6M Throughput = 5K per second Test #3 I saw all the threads are now stuck in idLock.lockEntry(). So I now run with the lock disabled and the block cache disabled. Test Time = 120 seconds Num Read Ops = 1.2M Throughput = 10K per second Test #4 I re enable block cache and this time hack hbase to only cache Index and Bloom blocks but data blocks come from File System cache. Test Time = 120 seconds Num Read Ops = 1.6M Throughput = 13K per second So, I wonder how come such a massive drop in throughput. I know that HDFS code adds tremendous overhead but this seems pretty high to me. I use 0.94.7 and cdh 4.2.0 Thanks Varun --969045052-417836057-1372543799=:39326--