Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8D55D18938 for ; Wed, 13 Jan 2016 11:25:24 +0000 (UTC) Received: (qmail 62647 invoked by uid 500); 13 Jan 2016 11:25:24 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 62565 invoked by uid 500); 13 Jan 2016 11:25:24 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 62555 invoked by uid 99); 13 Jan 2016 11:25:24 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Jan 2016 11:25:24 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id D1F7F1A01D1 for ; Wed, 13 Jan 2016 11:25:23 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 8jW5aHyBF5qG for ; Wed, 13 Jan 2016 11:25:22 +0000 (UTC) Received: from mail-wm0-f52.google.com (mail-wm0-f52.google.com [74.125.82.52]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 009C6258C7 for ; Wed, 13 Jan 2016 11:25:22 +0000 (UTC) Received: by mail-wm0-f52.google.com with SMTP id l65so289101031wmf.1 for ; Wed, 13 Jan 2016 03:25:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :content-type; bh=kvY7SnhU/Yt3rhoJsYaOdXY3LEVuOYPOn2USSOSdWWQ=; b=jqQ/uo7+UUMrYh83w00vYDCTzbi+TCrMFg0U5lgfZTpv4D8rc+rbK1Dg+XQ2k43zZv SU3XkzrXbeHuojZH7rWFVAgc5q+SaRPDb1oBOjMzBCH9dy6sGiCdli6Wkfk38+ZF57ra K0OZ/U//NfXUphVAqiDgLFwVBun4CVU7b3YPYGZhqWHDMYpC0bK6bWKT3VcUAP8M7x6e BdSZV/3rSElbWI+1Uwh8TQSa/kKPlr67K1TgKmrBVAqcNAQXVotguPSmXQ2vxjCQVDO1 qx6boVyfflML4IB31GCsmbyFcsf7YC0Rmnzbrgirtxgsvl8DCWX57mE4ZEY3UasCFNMC eWWw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:content-type; bh=kvY7SnhU/Yt3rhoJsYaOdXY3LEVuOYPOn2USSOSdWWQ=; b=PXKeRb/wZXB1BVjo3erCJp/Ap8e6KM+MlHfa25L6C4R0GHrdiD4uLqdK0lTMuNW+79 cSQJk2FShLEitvqYJm62qnzaQqe9qF7dneLF/HSGPwKDJlija/7VMuBd/kTWg7+EMBEo ufDzBlxkwhrp/fa8B8riDzBUPTk58D/QCPm/rq41oGS8cSmJslI/3GpjRHHm2HJzCMl3 SSnSfBexxojAYG+ZYTJIA/dMTQyQTim/AQLmZPT1PvSOAjggT60+vYNS56kgYBbgOFeK wtqjlYIqMG5tRKKLQtZ8H+FgDw43zV81I+i2I07ISfptPLIkuo86glD8NuuGFeRxmsdk Jj2g== X-Gm-Message-State: ALoCoQlx5VhXY6a00ta76khmy0DwJgnEW4txwFhileVuZI/LQwhmZCbS8UMzO0tzc6jb4/Qaa7Q1ZvBTSFt9Jf17Eo3yBkZhug== X-Received: by 10.28.173.71 with SMTP id w68mr26632825wme.88.1452684321650; Wed, 13 Jan 2016 03:25:21 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Don Frascuchon Date: Wed, 13 Jan 2016 11:25:11 +0000 Message-ID: Subject: Re: Checkpoint for exact-once proccessing To: user@flink.apache.org Content-Type: multipart/alternative; boundary=001a11444b90f680ae0529356c72 --001a11444b90f680ae0529356c72 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi Stephan, Thanks for your quickly response. So, consider an operator task with two processed records and no barrier incoming. If the task fail and must be records, the last consistent snapshot will be used, which no includes information about the processed but no checkpointed records. What about this situation? The registers will be resent to failed task after, or will be discarded? How flink manage information about this records for exact-once guarantees? The user function inside operator must be idempotent (i think about some kind of persistence in a sink task) Thanks in advance ! El mi=C3=A9., 13 ene. 2016 a las 11:17, Stephan Ewen () escribi=C3=B3: > Hi! > > I think there is a misunderstanding. There are no identifiers maintained > and no individual records deleted. > > On recovery, all operators reset their state to a consistent snapshot: > https://ci.apache.org/projects/flink/flink-docs-release-0.10/internals/st= ream_checkpointing.html > > > Greetings, > Stephan > > > On Wed, Jan 13, 2016 at 11:08 AM, Don Frascuchon > wrote: > >> Hello, >> >> I'm trying to understand the process of checkpoint processing for >> exact-once in Flink, and I have some doubts. >> >> The documentation says that when there is a failure and the state of an >> operator is restored, the already processed records are deleted based on >> their identifiers. >> >> My doubts is, how these identifiers between two checkpoints are >> maintained? Every time a new input record comes to the stateful operator= , >> Flink persists it before making the checkpoint? Otherwise, there may be >> messages to reprocess after a failure. >> >> Thanks !!! >> > > --001a11444b90f680ae0529356c72 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi Stephan,

Thanks for your quick= ly response.

So, consider an operator task with two proc= essed records and no barrier incoming. If the task fail and must be records= , the last consistent snapshot will be used, which no includes information = about the processed but no checkpointed =C2=A0records. What about this situ= ation? The registers will be resent to failed task after, or will be discar= ded? How flink manage information about this records for exact-once=C2=A0guarantee= s? The user function inside operator must be idempotent (i think about some= kind of persistence in=C2=A0=C2=A0a sink task)

=
Thanks in advance !


El mi=C3=A9., 13 ene. 2016 a las 11:17, Stephan Ewe= n (<sewen@apache.org>) escrib= i=C3=B3:
Hi!
I think there is a misunderstanding. There are no identifiers maintained = and no individual records deleted.

On recovery, all oper= ators reset their state to a consistent snapshot:=C2=A0https://ci.apache.org/projects/flink/flink-= docs-release-0.10/internals/stream_checkpointing.html


Greetings,
Stephan


On Wed, Jan 13= , 2016 at 11:08 AM, Don Frascuchon <frascuchon@gmail.com>= wrote:
Hello,=C2=A0
I'm trying to understand the process of checkpoint= processing for exact-once in Flink, and I have some doubts.

=
The documentation says that when there is a failure and the stat= e of an operator is restored, the already processed records are deleted bas= ed on their identifiers.

My doubts is, how these i= dentifiers between two checkpoints are maintained? Every time a new input r= ecord comes to the stateful operator, Flink persists it before making the c= heckpoint? Otherwise, there may be messages to reprocess after a failure.

Thanks !!!

--001a11444b90f680ae0529356c72--