Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BC453200B80 for ; Wed, 14 Sep 2016 10:24:24 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id BAD0F160ABA; Wed, 14 Sep 2016 08:24:24 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0C03F160AB4 for ; Wed, 14 Sep 2016 10:24:23 +0200 (CEST) Received: (qmail 14062 invoked by uid 500); 14 Sep 2016 08:24:23 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 14053 invoked by uid 99); 14 Sep 2016 08:24:23 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Sep 2016 08:24:23 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 7AC512C014F for ; Wed, 14 Sep 2016 08:24:20 +0000 (UTC) Date: Wed, 14 Sep 2016 08:24:20 +0000 (UTC) From: "Aljoscha Krettek (JIRA)" To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (FLINK-4410) Split checkpoint times into synchronous and asynchronous part MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 14 Sep 2016 08:24:24 -0000 [ https://issues.apache.org/jira/browse/FLINK-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15489788#comment-15489788 ] Aljoscha Krettek commented on FLINK-4410: ----------------------------------------- Hi, there's actually three different durations that could be reported: - time from the checkpoint coordinator initiating a checkpoint to an operator acknowledging that checkpoint - time that an operator spends in the synchronous part of the checkpoint - time that an operator spends in the asynchronous part of the checkpoint About synchronous/asynchronous. For this you can look at {{StreamTask.performCheckpoint()}}. At the end of the method a Thread is started that does the asynchronous work of the checkpoint and the method returns. Thus, time until then would be the synchronous part and the time spend in that thread would be the asynchronous part. > Split checkpoint times into synchronous and asynchronous part > ------------------------------------------------------------- > > Key: FLINK-4410 > URL: https://issues.apache.org/jira/browse/FLINK-4410 > Project: Flink > Issue Type: Improvement > Components: Webfrontend > Reporter: Ufuk Celebi > Priority: Minor > > Checkpoint statistics contain the duration of a checkpoint. We should split this time into the synchronous and asynchronous part. This will give more insight into the inner workings of the checkpointing mechanism and help users better understand what's going on. -- This message was sent by Atlassian JIRA (v6.3.4#6332)