hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vladimir Rodionov (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-14272) Enforce major compaction on stores with KEEP_DELETED_CELLS=true
Date Fri, 04 Sep 2015 23:11:46 GMT

    [ https://issues.apache.org/jira/browse/HBASE-14272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14731580#comment-14731580

Vladimir Rodionov commented on HBASE-14272:

OK, if KEEP_DELETED_CELLS is TRUE/TTL, the only way to remove deleted cells when TTL expires
and we have only one store file (quite rare condition) is to trigger major compaction manually

If # of store files == 1, KEEP_DELETED_CELLS = TRUE/TTL, periodic major compaction will never
pick this file up if its locality is above threshold.  

The same is true for *hbase.hstore.time.to.purge.deletes* > 0. If we add these checks into
CompactionPolicy implementation, we will need to keep somewhere flag - FILE_IS_CLEAN, otherwise
we will be running major compaction on a single file every time major compaction interval

Not sure it is worth doing. Close it as Invalid/Won't fix? What do you think [~lhofhansl]?

> Enforce major compaction on stores with KEEP_DELETED_CELLS=true
> ---------------------------------------------------------------
>                 Key: HBASE-14272
>                 URL: https://issues.apache.org/jira/browse/HBASE-14272
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>             Fix For: 2.0.0
>         Attachments: HBASE-14272-v2.patch, HBASE-14272.patch
> Currently, if store has one (major compacted) file, the only case when major compaction
will be triggered for this file again - when locality is below threshold, defined by *hbase.hstore.min.locality.to.skip.major.compact*
or TTL expired some cells. If file has locality greater than this threshold it will never
be major compacted until Store's TTL kicks in. For CF with KEEP_DELETED_CELLS on, compaction
must be enabled always (even for single file), regardless of locality, when deleted cells
are expired (*hbase.hstore.time.to.purge.deletes*)

This message was sent by Atlassian JIRA

View raw message