hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vladimir Rodionov (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-15442) HBase Backup Phase 2: Potential data loss and or data duplication in incremental backup
Date Thu, 10 Mar 2016 22:07:40 GMT
Vladimir Rodionov created HBASE-15442:
-----------------------------------------

             Summary: HBase Backup Phase 2: Potential data loss and or data duplication in
incremental backup
                 Key: HBASE-15442
                 URL: https://issues.apache.org/jira/browse/HBASE-15442
             Project: HBase
          Issue Type: Bug
            Reporter: Vladimir Rodionov
            Assignee: Vladimir Rodionov
            Priority: Critical


Suppose we have two tables T1 and T2

# Create full backup T1 with backup id = B1
# Create full backup T2 backupId = B2
# New data arrived into file WAL1
# Create incremental backup of T1 with backupId = B3
# Create incremental backup of T2 with backupid = B4

The directory structure for backup site after this steps

BACKUP_ROOT/WALs/B3
BACKUP_ROOT/WALs/B4
BACKUP_ROOT/T1/B1
BACKUP_ROOT/T2/B2

File WAL1 may end up either in BACKUP_ROOT/WALs/B3 or in both: 
BACKUP_ROOT/WALs/B3 and BACKUP_ROOT/WALs/B4 location. Both are bad: in first case we lose
data for backup B4 in second case we have duplicate copies of WAL1









--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message