Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 9FE98200CDF for ; Wed, 26 Jul 2017 05:52:38 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 9E856168121; Wed, 26 Jul 2017 03:52:38 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E3FD8168119 for ; Wed, 26 Jul 2017 05:52:37 +0200 (CEST) Received: (qmail 65446 invoked by uid 500); 26 Jul 2017 03:52:31 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@flink.apache.org Received: (qmail 65436 invoked by uid 99); 26 Jul 2017 03:52:31 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Jul 2017 03:52:31 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 43BBAC02A9 for ; Wed, 26 Jul 2017 03:52:31 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -2.901 X-Spam-Level: X-Spam-Status: No, score=-2.901 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-2.8, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id GEuwL3_lwbEK for ; Wed, 26 Jul 2017 03:52:30 +0000 (UTC) Received: from mail-pg0-f48.google.com (mail-pg0-f48.google.com [74.125.83.48]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 894C65FB84 for ; Wed, 26 Jul 2017 03:52:29 +0000 (UTC) Received: by mail-pg0-f48.google.com with SMTP id v190so78539302pgv.2 for ; Tue, 25 Jul 2017 20:52:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding:content-language; bh=vprhQRn0u9CfCw0i7GMTksu2ohnZznRQrxWsiTxtbz8=; b=WibsZJnUj1Tp38UKKShZGDIVDqVhZi22MltKQ+q6oBsDJERtX0i2umY+7Wh3ITr05z 5j/YbLEJyVKPArTyiJXjuxSEdmrF3D9g76bZRx7kcC5GEGenwDR4+4bRAtY+dKjwBxld l6Vq+PEgFFNXBCqU1YUt2pImLV8SuD4/JcwKaxt7DE4xq1WhJVu4Q312Lf0AoZGxh5tW LGrzKIG6oDFjLbOinzSauN0rSHaJkLX5wfR5qMYuamXP+PRyMno8TJBsfK77WMt/+doS JtM19p9fqEvLffjy/DcXPlYg9Lrx43pdfdZizJr51F4/aE3Je9EeF+2Z2pwzsy8wA/xk jxDw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=vprhQRn0u9CfCw0i7GMTksu2ohnZznRQrxWsiTxtbz8=; b=cundjGBvJu+HMAYVbahucn+zEBmh+EVKqczbtbhgCxcjdmV3Vy1s1P/vtfzdD5cyIb dIxkF/g+AvMdthxj9tJHp8UXZPYx/9z/ZLowFjKD7OYvqEG7kmP6pE296a2BifwPFBuN yrK8bZNcI3zZw/xmeZWdUb+l6GI0ec8e1zwdq6DkQgvgd2xi/MlHbPFVarsF6ZLJnERC PPtwknR11eOoJDgRw2vkEy0fleB3OUmGhZU6DPOylOyOgx1JhhjKloet6NUY+Ve7G7kY KAHDnm+7WKEQWDPIBCmc1pADbQn1ddjgPa8dCbexm9XOgliNH6S9BuZHhtDg/ic2RFNm 2/Cg== X-Gm-Message-State: AIVw111RBVCURUvcS0k8jP3UyPNsFRikbGpCOO6jCcQ+a9Hwgxwily+h s+sbDp7f9GSwUPwa7z4= X-Received: by 10.99.106.1 with SMTP id f1mr21624822pgc.139.1501041147665; Tue, 25 Jul 2017 20:52:27 -0700 (PDT) Received: from [192.168.0.3] ([113.190.239.92]) by smtp.gmail.com with ESMTPSA id f9sm1860293pgr.92.2017.07.25.20.52.24 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 25 Jul 2017 20:52:25 -0700 (PDT) Subject: Re: Purging Late stream data To: "G.S.Vijay Raajaa" , user References: From: Kien Truong Message-ID: Date: Wed, 26 Jul 2017 10:52:24 +0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:55.0) Gecko/20100101 Thunderbird/55.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US archived-at: Wed, 26 Jul 2017 03:52:38 -0000 Hi, One method you can use is using a ProcessFunction. In the process function, you get the timer service through the function context, which can then be used to schedule a task to clean up late data. Check out the docs for ProcessFunction https://ci.apache.org/projects/flink/flink-docs-release-1.3/dev/stream/process_function.html Regards, Kien On 7/26/2017 9:37 AM, G.S.Vijay Raajaa wrote: > Hi, > > I am having 3 streams which is being merged from a union of kafka > topics on a given timestamp. The problem I am facing is that, if there > is a delay  in one of the stream and when the data in that particular > stream arrives at a later point in time, the merge happens in a > delayed fashion. The way I want to solve is that, I want to drop such > data streams which comes after a delay ( say 5sec ). Kindly direct me > on how to go about it? > > Will watermarking (to process in even time) + the allowed lateness > help solve the same? > > Regards, > Vijay Raajaa G S