flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Flink Jira Bot (Jira)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-19359) Restore from Checkpoint fails if checkpoint folders is corrupt/partial
Date Thu, 22 Apr 2021 22:53:02 GMT

    [ https://issues.apache.org/jira/browse/FLINK-19359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17329586#comment-17329586

Flink Jira Bot commented on FLINK-19359:

This issue has been labeled "stale-minor" for 7 days. It is closed now. If you are still affected
by this or would like to raise the priority of this ticket please re-open, removing the label
"auto-closed" and raise the ticket priority accordingly.

> Restore from Checkpoint fails if checkpoint folders is corrupt/partial
> ----------------------------------------------------------------------
>                 Key: FLINK-19359
>                 URL: https://issues.apache.org/jira/browse/FLINK-19359
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.8.0
>            Reporter: Arpith Prakash
>            Priority: Minor
>              Labels: stale-minor
>         Attachments: Checkpoints.png
> I'm using Flink 1.8.0 version and have enabled externalized checkpoint to hdfs location,
we have seen few scenarios where checkpoint folders will have checkpoint files but only missing
"*_metadata*" file. If we attempt to restore application from this path, application fails
with exception "Could not find *_metadata* file. There is similar discussion in Flink user
mailing list with subject  "Zookeeper connection loss causing checkpoint corruption" around
it. I've attached sample snapshot on how folder structure looks as well.

This message was sent by Atlassian Jira

View raw message