Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 822BB200B78 for ; Fri, 2 Sep 2016 17:30:22 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 8098C160AAE; Fri, 2 Sep 2016 15:30:22 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C8473160A8C for ; Fri, 2 Sep 2016 17:30:21 +0200 (CEST) Received: (qmail 64247 invoked by uid 500); 2 Sep 2016 15:30:20 -0000 Mailing-List: contact dev-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@kafka.apache.org Delivered-To: mailing list dev@kafka.apache.org Received: (qmail 64232 invoked by uid 99); 2 Sep 2016 15:30:20 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 Sep 2016 15:30:20 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 9F2322C1B81 for ; Fri, 2 Sep 2016 15:30:20 +0000 (UTC) Date: Fri, 2 Sep 2016 15:30:20 +0000 (UTC) From: "Matthias J. Sax (JIRA)" To: dev@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (KAFKA-3478) Finer Stream Flow Control MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Fri, 02 Sep 2016 15:30:22 -0000 [ https://issues.apache.org/jira/browse/KAFKA-3478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15458831#comment-15458831 ] Matthias J. Sax commented on KAFKA-3478: ---------------------------------------- All newly creates "Streams" Jiras are assigned to him by default on creation. This does not mean anything. Just reassign to yourself if you want to pick onw up. > Finer Stream Flow Control > ------------------------- > > Key: KAFKA-3478 > URL: https://issues.apache.org/jira/browse/KAFKA-3478 > Project: Kafka > Issue Type: Sub-task > Components: streams > Reporter: Guozhang Wang > Labels: user-experience > Fix For: 0.10.1.0 > > > Today we have a event-time based flow control mechanism in order to synchronize multiple input streams in a best effort manner: > http://docs.confluent.io/3.0.0/streams/architecture.html#flow-control-with-timestamps > However, there are some use cases where users would like to have finer control of the input streams, for example, with two input streams, one of them always reading from offset 0 upon (re)-starting, and the other reading for log end offset. > Today we only have one consumer config "offset.auto.reset" to control that behavior, which means all streams are read either from "earliest" or "latest". > We should consider how to improve this settings to allow users have finer control over these frameworks. > ===== > A finer flow control could also be used to allow for populating a {{KTable}} (with an "initial" state) before starting the actual processing (this feature was ask for in the mailing list multiple times already). Even if it is quite hard to define, *when* the initial populating phase should end, this might still be useful. There would be the following possibilities: > 1) an initial fixed time period for populating > (it might be hard for a user to estimate the correct value) > 2) an "idle" period, ie, if no update to a KTable for a certain time is > done, we consider it as populated > 3) a timestamp cut off point, ie, all records with an older timestamp > belong to the initial populating phase > 4) a throughput threshold, ie, if the populating frequency falls below > the threshold, the KTable is considered "finished" > 5) maybe something else ?? > The API might look something like this > {noformat} > KTable table = builder.table("topic", 1000); // populate the table without reading any other topics until see one record with timestamp 1000. > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)