hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Latham (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-18165) Predicate based deletion during major compactions
Date Mon, 05 Jun 2017 21:10:05 GMT

    [ https://issues.apache.org/jira/browse/HBASE-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16037600#comment-16037600
] 

Dave Latham commented on HBASE-18165:
-------------------------------------

Fascinating thought.  We've long toyed with the idea of trying to push column custom logic
into compaction via a coprocessor.  However, a well defined but simpler interface, similar
to Filter would make it much easier to do.  An interface with methods called for the boundary
of each row, and for each cell within the row, allowing filtering, altering, potentially even
inserting (so long as sort order is maintained) would be awesome.  

> Predicate based deletion during major compactions
> -------------------------------------------------
>
>                 Key: HBASE-18165
>                 URL: https://issues.apache.org/jira/browse/HBASE-18165
>             Project: HBase
>          Issue Type: Brainstorming
>            Reporter: Lars Hofhansl
>
> In many cases it is expensive to place a delete per version, column, or family.
> HBase should have way to specify a predicate and remove all Cells matching the predicate
during the next compactions (major and minor).
> Nothing more concrete. The tricky part would be to know when it is safe to remove the
predicate, i.e. when we can be sure that all Cells matching the predicate actually have been
removed.
> Could potentially use HBASE-12859 for that.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message