hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ryan rawson (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1521) Optimize codepath for minor compactions
Date Sun, 14 Jun 2009 19:23:07 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12719321#action_12719321

ryan rawson commented on HBASE-1521:

for now we have a reasonably fast (but not fastest) minor compactor that enforces deletes.
 So we don't have to decide today.

we may want to consider moving that for semantic/data preservation reasons, so we can undo
deletes before a major compact, but I'm ok either way.

> Optimize codepath for minor compactions
> ---------------------------------------
>                 Key: HBASE-1521
>                 URL: https://issues.apache.org/jira/browse/HBASE-1521
>             Project: Hadoop HBase
>          Issue Type: Improvement
>    Affects Versions: 0.20.0
>            Reporter: Jonathan Gray
>             Fix For: 0.20.1
> There are some additional optimizations in the specialized StoreScanner and also in HFile
for minor compactions.  For example, there is some KV disassembling and sanity checking in
HFile even though in a minor compaction these checks have already been run during the flush.
> Another area to discuss is whether we should actually process deletes during minor compactions.
 It's not especially expensive (ScanDeleteTracker is quite simple) but it requires looking
at both the row and the qualifier value of every single KV.  Removing this would drop our
axiom that "Deletes only apply to later storefiles", which is used during Get processing to
have more efficient delete handling.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message