Return-Path: X-Original-To: apmail-flink-dev-archive@www.apache.org Delivered-To: apmail-flink-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7D1BA17B46 for ; Wed, 26 Aug 2015 10:38:47 +0000 (UTC) Received: (qmail 76370 invoked by uid 500); 26 Aug 2015 10:38:47 -0000 Delivered-To: apmail-flink-dev-archive@flink.apache.org Received: (qmail 76154 invoked by uid 500); 26 Aug 2015 10:38:47 -0000 Mailing-List: contact dev-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list dev@flink.apache.org Received: (qmail 75961 invoked by uid 99); 26 Aug 2015 10:38:47 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 26 Aug 2015 10:38:47 +0000 Date: Wed, 26 Aug 2015 10:38:47 +0000 (UTC) From: "Aljoscha Krettek (JIRA)" To: dev@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (FLINK-2577) Watermarks Stall When a Source Finishes Prematurely MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Aljoscha Krettek created FLINK-2577: --------------------------------------- Summary: Watermarks Stall When a Source Finishes Prematurely Key: FLINK-2577 URL: https://issues.apache.org/jira/browse/FLINK-2577 Project: Flink Issue Type: Bug Components: Streaming Affects Versions: 0.10 Reporter: Aljoscha Krettek Assignee: Aljoscha Krettek Fix For: 0.10 The problem with a streaming source that closes is that downstream operations never notice that it is not running anymore and keep waiting for watermarks from all upstream operations (including the source). This has the effect that watermarks just stop propagating through the topology. I think an easy fix is to change sources to emit a last watermark of +Inf before closing. Because watermarks are always the minimum of all watermarks on the inputs this would have the effect of advancing only depending on the other inputs. The added benefit would be that once all sources emit a +Inf watermark the operator also get's a last +Inf watermark which tells it that all sources are done. Right now, streaming operators (and user code) have no way of telling if there are going to come elements in the future. This is especially problematic in Co-Map (Co-FlatMap) operations where you have one input that feeds a hash-table and the other input is elements that you want to stream by this hash-table. -- This message was sent by Atlassian JIRA (v6.3.4#6332)