hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ted Yu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-15381) Implement a distributed MOB compaction by procedure
Date Wed, 23 Mar 2016 15:04:25 GMT

    [ https://issues.apache.org/jira/browse/HBASE-15381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15208548#comment-15208548

Ted Yu commented on HBASE-15381:

For step 2:
bq. record the mapping <regionServer, regions> in Zookeeper.

Have you considered other persistence medium ? We should reduce reliance on zookeeper.

For step 8:
bq. Check if all found regions in this region server that are recorded in Zookeeper before
the procedure is started are still online,

What if some regions are transitioned offline during the compaction process ?

bq. adding a new item to metadata is a more reasonable choice

And this is amenable to compatibility.

What if duration of MOB compaction is longer than the interval which ScheduledChore runs ?

> Implement a distributed MOB compaction by procedure
> ---------------------------------------------------
>                 Key: HBASE-15381
>                 URL: https://issues.apache.org/jira/browse/HBASE-15381
>             Project: HBase
>          Issue Type: Improvement
>          Components: mob
>            Reporter: Jingcheng Du
>            Assignee: Jingcheng Du
>         Attachments: mob distributed compaction design.pdf
> In MOB, there is a periodical compaction which runs in HMaster (It can be disabled by
configuration), some small mob files are merged into bigger ones. Now the compaction only
runs in HMaster which is not efficient and might impact the running of HMaster. In this JIRA,
a distributed MOB compaction is introduced, it is triggered by HMaster, but all the compaction
jobs are distributed to HRegionServers.

This message was sent by Atlassian JIRA

View raw message