Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5A02918E32 for ; Fri, 26 Feb 2016 09:00:33 +0000 (UTC) Received: (qmail 92907 invoked by uid 500); 26 Feb 2016 09:00:27 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 92862 invoked by uid 500); 26 Feb 2016 09:00:27 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 92833 invoked by uid 99); 26 Feb 2016 09:00:26 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 26 Feb 2016 09:00:26 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id B2B882C1F6E for ; Fri, 26 Feb 2016 09:00:26 +0000 (UTC) Date: Fri, 26 Feb 2016 09:00:26 +0000 (UTC) From: "Anoop Sam John (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-15338) Add a option to disable the data block cache for testing the performance of underlying file system MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-15338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15168682#comment-15168682 ] Anoop Sam John commented on HBASE-15338: ---------------------------------------- Again am missing some thing - family.isBlockCacheEnabled(), + conf.getBoolean(CACHE_DATA_ON_READ_KEY, DEFAULT_CACHE_DATA_ON_READ) + && family.isBlockCacheEnabled(), Why we need this new config? Why can not we rely on HCD setting? {code} /** * Returns whether the DATA blocks of this HFile should be cached on read or not (we always * cache the meta blocks, the INDEX and BLOOM blocks). * @return true if blocks should be cached on read, false if not */ public boolean shouldCacheDataOnRead() { return isBlockCacheEnabled() && cacheDataOnRead; } {code} This may be the issue you are saying? This is called from getMetaBlock(). As per the comment, when we read meta blocks, we must cache it. As we do not pass any type we seems may not do that.. That is a bug IMO.. So we better correct that bug (Any other?) and test ur case with HCD setting? > Add a option to disable the data block cache for testing the performance of underlying file system > -------------------------------------------------------------------------------------------------- > > Key: HBASE-15338 > URL: https://issues.apache.org/jira/browse/HBASE-15338 > Project: HBase > Issue Type: Improvement > Components: integration tests > Reporter: Liu Shaohui > Assignee: Liu Shaohui > Priority: Minor > Fix For: 2.0.0 > > Attachments: HBASE-15338-trunk-v1.diff, HBASE-15338-trunk-v2.diff, HBASE-15338-trunk-v3.diff > > > When testing and comparing the performance of different file systems(HDFS, Azure blob storage, AWS S3 and so on) for HBase, it's better to avoid the affect of the HBase BlockCache and get the actually random read latency when data block is read from underlying file system. (Usually, the index block and meta block should be cached in memory in the testing). > So we add a option in CacheConfig to disable the data block cache. > Suggestions are welcomed~ Thanks -- This message was sent by Atlassian JIRA (v6.3.4#6332)