hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Li Pi (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-5403) Checkpoint the compressed HLog
Date Wed, 15 Feb 2012 20:15:01 GMT

    [ https://issues.apache.org/jira/browse/HBASE-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13208752#comment-13208752

Li Pi commented on HBASE-5403:

Rolling the log would just reset the dictionary, which means performance will be degraded
for a bit until the dictionary was built back up again. 

I'm assuming checkpointing would involve dumping the contents of the dictionary at certain
points - but the max size of the dictionary can be quite large, up to 32 megabytes or so in
extreme cases. This has its own problems.
> Checkpoint the compressed HLog
> ------------------------------
>                 Key: HBASE-5403
>                 URL: https://issues.apache.org/jira/browse/HBASE-5403
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Liyin Tang
>            Assignee: Liyin Tang
> Let's assume that HBase replication can be based on replaying the HLog in the replica
> The replica process could be crash during the replay. Obviously, the replica process
need a way to start from the lastest check point in the HLog, even the HLog is compressed.
> So the proposal is to write a series of checkpoints within the HLog. 
> Each each checkpoint will have a header with some special sequence of bytes.
> And between each checkpoints, HLog should use new dictionaries to compress.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message