flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ufuk Celebi (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-7067) Cancel with savepoint does not restart checkpoint scheduler on failure
Date Thu, 19 Oct 2017 10:21:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-7067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16210839#comment-16210839
] 

Ufuk Celebi commented on FLINK-7067:
------------------------------------

Hey Till, I think the fix is done from my side. The open question is what we do with the test:
1) Keep the test although it over uses Mockito (due to how we expose the CheckpointCoordinator
etc.)
2) Remove the test
3) Refactor what needs to be refactored and then rewrite the test

I don't have time for 3), but I am fine with both 1) or 2). What do you propose?


> Cancel with savepoint does not restart checkpoint scheduler on failure
> ----------------------------------------------------------------------
>
>                 Key: FLINK-7067
>                 URL: https://issues.apache.org/jira/browse/FLINK-7067
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>    Affects Versions: 1.3.1
>            Reporter: Ufuk Celebi
>            Assignee: Ufuk Celebi
>            Priority: Blocker
>             Fix For: 1.4.0, 1.3.3
>
>
> The `CancelWithSavepoint` action of the JobManager first stops the checkpoint scheduler,
then triggers a savepoint, and cancels the job after the savepoint completes.
> If the savepoint fails, the command should not have any side effects and we don't cancel
the job. The issue is that the checkpoint scheduler is not restarted though.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message