flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Julio Biason <julio.bia...@azion.com>
Subject Re: Trying to figure out why a slot takes a long time to checkpoint
Date Fri, 14 Sep 2018 23:03:39 GMT
(Just an addendum: Although it's not a huge problem -- we can always
increase the checkpoint timeout time -- this anomalous situation makes me
think there is something wrong in our pipeline or in our cluster, and that
is what is making the checkpoint creation go crazy.)

On Fri, Sep 14, 2018 at 8:00 PM, Julio Biason <julio.biason@azion.com>
wrote:

> Hey guys,
>
> On our pipeline, we have a single slot that it's taking longer to create
> the checkpoint compared to other slots and we are wondering what could be
> causing it.
>
> The operator in question is the window metric -- the only element in the
> pipeline that actually uses the state. While the other slots take 7 mins to
> create the checkpoint, this one -- and only this one -- takes 55mins.
>
> Is there something I should look at to understand what's going on?
>
> (We are storing all checkpoints in HDFS, in case that helps.)
>
> --
> *Julio Biason*, Sofware Engineer
> *AZION*  |  Deliver. Accelerate. Protect.
> Office: +55 51 3083 8101 <callto:+555130838101>  |  Mobile: +55 51
> <callto:+5551996209291>*99907 0554*
>



-- 
*Julio Biason*, Sofware Engineer
*AZION*  |  Deliver. Accelerate. Protect.
Office: +55 51 3083 8101 <callto:+555130838101>  |  Mobile: +55 51
<callto:+5551996209291>*99907 0554*

Mime
View raw message