zookeeper-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jiafu Jiang (Jira)" <j...@apache.org>
Subject [jira] [Created] (ZOOKEEPER-3607) Potential data inconsistency due to the inconsistency between ZKDatabase.committedLog and dataTree in Trunc sync.
Date Wed, 06 Nov 2019 07:34:00 GMT
Jiafu Jiang created ZOOKEEPER-3607:
--------------------------------------

             Summary: Potential data inconsistency due to the inconsistency between ZKDatabase.committedLog
and dataTree in Trunc sync.
                 Key: ZOOKEEPER-3607
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3607
             Project: ZooKeeper
          Issue Type: Bug
          Components: quorum
    Affects Versions: 3.4.14
            Reporter: Jiafu Jiang


I will describe the problem by a detail example.


1. Suppose we have three zk servers: zk1, zk2, and zk3. zk1 and zk2 are online, zk3 is offline,
zk1 is the leader.


2. In TRUNC sync, zk1 sends a TRUNC request to zk2, then sends the remaining proposals in
the committedLog. *When the follower zk2 receives the proposals, it applies them directly
into the datatree, but not the committedLog.* 


3. After the data sync phase, zk1 may continue to send zk2 more committed proposals, and they
will be applied to both the datatree and the committedLog of zk2.

 

4. Then zk1 fails, zk3 restarts successfully, zk2 becomes the leader.

 

5. The leader zk2 sends a TRUNC request to zk3, then the remaining proposals from the committedLog.
But since some proposals, which are from the leader zk1 in TRUNC sync(as I describe above),
are not in the committedLog, they will not be sent to zk3.

 

6. Now data inconsistency happens between zk2 and zk3, since some data may exist in zk2's
datatree, but not zk3's datatree.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message