flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Flink Jira Bot (Jira)" <j...@apache.org>
Subject [jira] [Updated] (FLINK-19359) Restore from Checkpoint fails if checkpoint folders is corrupt/partial
Date Thu, 22 Apr 2021 22:53:02 GMT

     [ https://issues.apache.org/jira/browse/FLINK-19359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Flink Jira Bot updated FLINK-19359:
    Labels: auto-closed  (was: stale-minor)

> Restore from Checkpoint fails if checkpoint folders is corrupt/partial
> ----------------------------------------------------------------------
>                 Key: FLINK-19359
>                 URL: https://issues.apache.org/jira/browse/FLINK-19359
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.8.0
>            Reporter: Arpith Prakash
>            Priority: Minor
>              Labels: auto-closed
>         Attachments: Checkpoints.png
> I'm using Flink 1.8.0 version and have enabled externalized checkpoint to hdfs location,
we have seen few scenarios where checkpoint folders will have checkpoint files but only missing
"*_metadata*" file. If we attempt to restore application from this path, application fails
with exception "Could not find *_metadata* file. There is similar discussion in Flink user
mailing list with subject  "Zookeeper connection loss causing checkpoint corruption" around
it. I've attached sample snapshot on how folder structure looks as well.

This message was sent by Atlassian Jira

View raw message