Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2F83117D04 for ; Sat, 11 Apr 2015 01:11:15 +0000 (UTC) Received: (qmail 7686 invoked by uid 500); 11 Apr 2015 01:11:13 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 7514 invoked by uid 500); 11 Apr 2015 01:11:13 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 7336 invoked by uid 99); 11 Apr 2015 01:11:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 11 Apr 2015 01:11:13 +0000 Date: Sat, 11 Apr 2015 01:11:13 +0000 (UTC) From: "Andrew Purtell (JIRA)" To: dev@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Resolved] (HBASE-5303) Separate out rows into multiple files during compaction for better block cache utilization MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-5303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell resolved HBASE-5303. ----------------------------------- Resolution: Incomplete Assignee: (was: dhruba borthakur) > Separate out rows into multiple files during compaction for better block cache utilization > ------------------------------------------------------------------------------------------ > > Key: HBASE-5303 > URL: https://issues.apache.org/jira/browse/HBASE-5303 > Project: HBase > Issue Type: Improvement > Components: regionserver > Reporter: dhruba borthakur > > This seems odd at first sight, a compaction is supposed to reduce the number of files. But there is great advantage to creating hbase blocks where adjacent keys are accessed together. One approach would be to make compaction write out hot-rows into one file and the cold-rows into another file. This increases the efficiency of the block cache because only the blocks in the hot-rows file will most frequently be populated in the block cache. -- This message was sent by Atlassian JIRA (v6.3.4#6332)