hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-14949) Resolve name conflict when splitting if there are duplicated WAL entries
Date Wed, 17 Feb 2016 05:39:18 GMT

    [ https://issues.apache.org/jira/browse/HBASE-14949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15149924#comment-15149924
] 

stack commented on HBASE-14949:
-------------------------------

More on the patch:

    if (!fs.exists(dir) && !fs.mkdirs(dir)) {

We use to have an isCreate flag. We don't have it anymore. Was it always true? (It looks like
it going by your patch).

Should you change formatRecoveredEditsFileName to take the original file name? It looks like
it is called from one other place at least.

So, we write with the name of the WAL in the split file name. Where do we read it back? (I'm
asking you because you probably have your finger on it). I want to see if we handle case of
bare sequenceid as well as this new format. In fact, should we have a test that demonstrates
this?

Otherwise, patch is looking like it would work w/ branch-1 now and does not need migration
as you say. Thanks [~Apache9]


> Resolve name conflict when splitting if there are duplicated WAL entries
> ------------------------------------------------------------------------
>
>                 Key: HBASE-14949
>                 URL: https://issues.apache.org/jira/browse/HBASE-14949
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Heng Chen
>            Assignee: Duo Zhang
>         Attachments: HBASE-14949-v3.patch, HBASE-14949-v4.patch, HBASE-14949.patch, HBASE-14949_v1.patch,
HBASE-14949_v2.patch
>
>
> The AsyncFSHLog introduced in HBASE-14790 may write same WAL entries to different WAL
files. WAL entry itself is idempotent so replay is not a problem but the intermediate file
name and final name when splitting is constructed using the lowest or highest sequence id
of the WAL entries written, so it is possible that different WAL files will have same intermediate
or final file name when splitting. In the currentm implementation, this will cause split fail
or data loss. We need to solve this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message