flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karthik Deivasigamani <karthi...@gmail.com>
Subject Checkpoint was declined (tasks not ready)
Date Fri, 06 Oct 2017 14:52:38 GMT
    I'm noticing a weird issue with our flink streaming job. We use async
io operator which makes a HTTP call and in certain cases when the async
task times out, it throws an exception and causing the job to restart.

java.lang.Exception: An async function call terminated with an
exception. Failing the AsyncWaitOperator.
	at org.apache.flink.streaming.api.operators.async.Emitter.output(Emitter.java:136)
	at org.apache.flink.streaming.api.operators.async.Emitter.run(Emitter.java:83)
	at java.lang.Thread.run(Thread.java:745)
Caused by: java.util.concurrent.ExecutionException:
java.util.concurrent.TimeoutException: Async function call has timed
	at org.apache.flink.runtime.concurrent.impl.FlinkFuture.get(FlinkFuture.java:110)

After the job restarts(we have a fixed restart strategy) we notice that the
checkpoints start failing continuously with this message :
Checkpoint was declined (tasks not ready)

[image: Inline image 1]

But we see the job is running, its processing data, the accumulators we
have are getting incremented etc but checkpointing fails with tasks not
ready message.

Wanted to reach out to the community to see if anyone else has experienced
this issue before?

View raw message