hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18165) Predicate based deletion during major compactions
Date Mon, 05 Jun 2017 22:36:12 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16037784#comment-16037784

Lars Hofhansl commented on HBASE-18165:

I was thinking Filters indeed. We already have string-to-filter converters for the HBase shell.
Could "simply" store those in an HBase table and read before compactions.

Here's another crazy idea: An HFileReader that produces Cells arithmetically. Now one could
potentially write some metadata into a file and have an HFileReader that produces (say) the
delete marker you want - would need to insert one of these for each affected region. Now everything
would just work, HBase merge-sort would do the right thing.

> Predicate based deletion during major compactions
> -------------------------------------------------
>                 Key: HBASE-18165
>                 URL: https://issues.apache.org/jira/browse/HBASE-18165
>             Project: HBase
>          Issue Type: Brainstorming
>            Reporter: Lars Hofhansl
> In many cases it is expensive to place a delete per version, column, or family.
> HBase should have way to specify a predicate and remove all Cells matching the predicate
during the next compactions (major and minor).
> Nothing more concrete. The tricky part would be to know when it is safe to remove the
predicate, i.e. when we can be sure that all Cells matching the predicate actually have been
> Could potentially use HBASE-12859 for that.

This message was sent by Atlassian JIRA

View raw message