flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robin Cassan <robin.cas...@contentsquare.com>
Subject Making job fail on Checkpoint Expired?
Date Thu, 02 Apr 2020 13:37:20 GMT
Hi all,

I am wondering if there is a way to make a flink job fail (not cancel it)
when one or several checkpoints have failed due to being expired (taking
longer than the timeout) ?
I am using Flink 1.9.2 and have set `
*setTolerableCheckpointFailureNumber(1)*` which doesn't do the trick.
Looking into the CheckpointFailureManager.java class, it looks like this
only works when the checkpoint failure reason is `*CHECKPOINT_DECLINED*`,
but the number of failures isn't incremented on `*CHECKPOINT_EXPIRED*`.
Am I missing something?

Thanks!

Mime
View raw message