flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiaogang Shi (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-6284) Incorrect sorting of completed checkpoints in ZooKeeperCompletedCheckpointStore
Date Mon, 10 Apr 2017 04:03:42 GMT
Xiaogang Shi created FLINK-6284:
-----------------------------------

             Summary: Incorrect sorting of completed checkpoints in ZooKeeperCompletedCheckpointStore
                 Key: FLINK-6284
                 URL: https://issues.apache.org/jira/browse/FLINK-6284
             Project: Flink
          Issue Type: Bug
            Reporter: Xiaogang Shi


Now all completed checkpoints are sorted in their paths when they are recovered in {{ZooKeeperCompletedCheckpointStore}}
. In the cases where the latest checkpoint's id is not the largest in lexical order (e.g.,
"100" is smaller than "99" in lexical order), Flink will not recover from the latest completed
checkpoint.

The problem can be easily observed by setting the checkpoint ids in {{ZooKeeperCompletedCheckpointStoreITCase#testRecover()}}
to be 99, 100 and 101. 

To fix the problem, we should explicitly sort found checkpoints in their checkpoint ids, without
the usage of {{ZooKeeperStateHandleStore#getAllSortedByName()}}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message