Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@locus.apache.org Received: (qmail 46094 invoked from network); 21 Jan 2009 21:04:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 21 Jan 2009 21:04:26 -0000 Received: (qmail 71249 invoked by uid 500); 21 Jan 2009 21:04:20 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 71232 invoked by uid 500); 21 Jan 2009 21:04:20 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 71205 invoked by uid 99); 21 Jan 2009 21:04:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Jan 2009 13:04:20 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Jan 2009 21:04:19 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id A900B234C4A8 for ; Wed, 21 Jan 2009 13:03:59 -0800 (PST) Message-ID: <144060086.1232571839691.JavaMail.jira@brutus> Date: Wed, 21 Jan 2009 13:03:59 -0800 (PST) From: "Erik Holstad (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-80) [hbase] Add a cache of 'hot' cells MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-80?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12665964#action_12665964 ] Erik Holstad commented on HBASE-80: ----------------------------------- Sorry for not posting on this issue, even thought I have been assigned and everything :) So the basic idea that I have been working on is to make a key/value cache to speed up random reads. Test setup: Used the same test parameters that are used in the BT paper so it would be easy to compare and test have currently only been done on a single machine cluster with one HRegionServer. That setup includes 1column/family and every value is 1000B. Some numbers for testing this extremely simple cache are: Tests done over 10000 reads Random reads without cache: 481 r/s 481 KB/s Random reads with cache: 4019 r/s 4019 KB/s Some other test to compare the difference when using multiple columns/family turned out to give the following numbers: 5 columns/family everything else the same as above. Random reads without cache: 445 r/s 2223 KB/s Random reads without cache: 3588 r/s 17940 KB/s 10 columns/family everything else the same as above. Random reads without cache: 24 r/s 24000 KB/s Random reads without cache: 25 r/s 25000 KB/s For the rest of the test only 100 rows where used to avoid out of memory errors. Like first test but fewer rows: Random reads without cache: 284 r/s 284 KB/s Random reads with cache: 2083 r/s 2083 KB/s Same as above but with 1000 columns/family Random reads without cache: 23 r/s 23000 KB/s Random reads with cache: 76 r/s 76000 KB/s > [hbase] Add a cache of 'hot' cells > ---------------------------------- > > Key: HBASE-80 > URL: https://issues.apache.org/jira/browse/HBASE-80 > Project: Hadoop HBase > Issue Type: Improvement > Components: regionserver > Reporter: stack > Assignee: Erik Holstad > Priority: Minor > Fix For: 0.20.0 > > Attachments: cache.patch > > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.