hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amit Kabra (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-19106) Backup self validation for its correctness.
Date Tue, 31 Oct 2017 09:42:00 GMT

    [ https://issues.apache.org/jira/browse/HBASE-19106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16226521#comment-16226521

Amit Kabra commented on HBASE-19106:

The validation job should restore the data to temp table which will get deleted once the validation
job finishes. 
Also we don't have to restore complete backup but we can restore some part of it, may be 1
out of x hfiles from backups or portion of each hfile, etc. 
Next level of this could be to provide validation percentage from command line to backups
eg 10% , in that case after backups 10% of backups would be validated and only if validation
passes that we call backups are successful, else we fail the backups.

Overall, idea is to somehow check if the backups are restorable and backup taken are the correct

> Backup self validation for its correctness.
> -------------------------------------------
>                 Key: HBASE-19106
>                 URL: https://issues.apache.org/jira/browse/HBASE-19106
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Amit Kabra
> Backups are critical and if they don't work when we need them at the time of restore
than they are not useful. We should do sanity test for each backup job we run that it is restorable
and hence can be trusted.
> A self validation feature can be added for the same to the backups where whenever a backup
is run , once it finishes it will trigger a validation job that will do a sample restoration
of the backed up data and will make sure that it compares well with actual data.

This message was sent by Atlassian JIRA

View raw message