From user-return-34072-archive-asf-public=cust-asf.ponee.io@flink.apache.org  Sun Apr 12 07:32:27 2020
Return-Path: <user-return-34072-archive-asf-public=cust-asf.ponee.io@flink.apache.org>
X-Original-To: archive-asf-public@cust-asf.ponee.io
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [207.244.88.153])
	by mx-eu-01.ponee.io (Postfix) with SMTP id 49841180608
	for <archive-asf-public@cust-asf.ponee.io>; Sun, 12 Apr 2020 09:32:27 +0200 (CEST)
Received: (qmail 38602 invoked by uid 500); 12 Apr 2020 07:32:25 -0000
Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:user-help@flink.apache.org>
List-Unsubscribe: <mailto:user-unsubscribe@flink.apache.org>
List-Post: <mailto:user@flink.apache.org>
List-Id: <user.flink.apache.org>
Delivered-To: mailing list user@flink.apache.org
Received: (qmail 38590 invoked by uid 99); 12 Apr 2020 07:32:25 -0000
Received: from ui-eu-02.ponee.io (HELO localhost) (116.202.110.96)
    by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 12 Apr 2020 07:32:25 +0000
In-Reply-To: 
 <CY4PR20MB12239E0E9F628F3DFDFF8899DAC30@CY4PR20MB1223.namprd20.prod.outlook.com>
From: Shachar Carmeli <carmeli.dev@gmail.com>
MIME-Version: 1.0
x-ponymail-agent: PonyMail Composer/0.2
References: 
 <CY4PR20MB12239E0E9F628F3DFDFF8899DAC30@CY4PR20MB1223.namprd20.prod.outlook.com> 
 <pony-8ce66dedde08e8543255928c096e3a3ab7c378cb-80c5066eec714ade1ec59ca9829abcaf698d0676@user.flink.apache.org>
Message-ID: <pony-8ce66dedde08e8543255928c096e3a3ab7c378cb-c297b5c0d272c918bf2f4c5095a3db0d5d7bb4b3@user.flink.apache.org>
x-ponymail-sender: 8ce66dedde08e8543255928c096e3a3ab7c378cb
X-Mailer: LuaSocket 3.0-rc1
To: <user@flink.apache.org>
Subject: Re: Flink incremental checkpointing - how long does data is kept in the share folder
Date: Sun, 12 Apr 2020 07:32:24 -0000
Content-Type: text/plain; charset=utf-8

Thank you for the quick response
Your answer related to the checkpoint folder that contains the _metadata file e.g. chk-1829 
What about the "shared" folder , how do I know which  files in that folder are still relevant and which are left over from a failed checkpoint , they are not directly related to the _metadata checkpoint or am I missing something?


On 2020/04/07 18:37:57, Yun Tang <myasuka@live.com> wrote: 
> Hi Shachar
> 
> Why do we see data that is older from lateness configuration
> There might existed three reasons:
> 
>   1.  RocksDB really still need that file in current checkpoint. If we upload one file named as 42.sst at 2/4 at some old checkpoint, current checkpoint could still include that 42.sst file again if that file is never be compacted since then. This is possible in theory.
>   2.  Your checkpoint size is large and checkpoint coordinator could not remove as fast as possible before exit.
>   3.  That file is created by a crash task manager and not known to checkpoint coordinator.
> 
> How do I know that the files belong to a valid checkpoint and not a checkpoint of a crushed job - so we can delete those files
> You have to call Checkpoints#loadCheckpointMetadata[1] to load latest _metadata in checkpoint directory and compare the file paths with current files in checkpoint directory. The ones are not in the checkpoint meta and older than latest checkpoint could be removed. You could follow this to debug or maybe I could write a tool to help know what files could be deleted later.
> 
> [1] https://github.com/apache/flink/blob/693cb6adc42d75d1db720b45013430a4c6817d4a/flink-runtime/src/main/java/org/apache/flink/runtime/checkpoint/Checkpoints.java#L96
> 
> Best
> Yun Tang
> 
> ________________________________
> From: Shachar Carmeli <carmeli.dev@gmail.com>
> Sent: Tuesday, April 7, 2020 16:19
> To: user@flink.apache.org <user@flink.apache.org>
> Subject: Flink incremental checkpointing - how long does data is kept in the share folder
> 
> We are using Flink 1.6.3 and keeping the checkpoint in CEPH ,retaining only one checkpoint at a time , using incremental and using rocksdb.
> 
> We run windows with lateness of 3 days , which means that we expect that no data in the checkpoint share folder will be kept after 3-4 days ,Still We see that there is data from more than that
> e.g.
> If today is 7/4 there are some files from the 2/4
> 
> Sometime we see checkpoints that we assume (due to the fact that its index number is not in synch) that it belongs to a job that crushed and the checkpoint was not used to restore the job
> 
> My questions are
> 
> Why do we see data that is older from lateness configuration
> How do I know that the files belong to a valid checkpoint and not a checkpoint of a crushed job - so we can delete those files
>