flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Stephan Ewen (JIRA)" <j...@apache.org>
Subject [jira] [Created] (FLINK-4808) Allow skipping failed checkpoints
Date Wed, 12 Oct 2016 08:52:20 GMT
Stephan Ewen created FLINK-4808:
-----------------------------------

             Summary: Allow skipping failed checkpoints
                 Key: FLINK-4808
                 URL: https://issues.apache.org/jira/browse/FLINK-4808
             Project: Flink
          Issue Type: New Feature
    Affects Versions: 1.1.2, 1.1.3
            Reporter: Stephan Ewen
             Fix For: 1.2.0


Currently, if Flink cannot complete a checkpoint, it results in a failure and recovery.

To make the impact of less stable storage infrastructure on the performance of Flink less
severe, Flink should be able to tolerate a certain number of failed checkpoints and simply
keep executing.

This should be controllable via a parameter, for example:
{code}
env.getCheckpointConfig().setAllowedFailedCheckpoints(3);
{code}

A value of {{-1}} could indicate an infinite number of checkpoint failures tolerated by Flink.
The default value should still be {{0}}, to keep compatibility with the existing behavior.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message