Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 51398200CA4 for ; Wed, 24 May 2017 07:45:03 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 50511160BD3; Wed, 24 May 2017 05:45:03 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 96E1F160BC3 for ; Wed, 24 May 2017 07:45:02 +0200 (CEST) Received: (qmail 38312 invoked by uid 500); 24 May 2017 05:45:01 -0000 Mailing-List: contact dev-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@kafka.apache.org Delivered-To: mailing list dev@kafka.apache.org Received: (qmail 38299 invoked by uid 99); 24 May 2017 05:45:01 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 May 2017 05:45:01 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id B43B0190DBF for ; Wed, 24 May 2017 05:45:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.979 X-Spam-Level: * X-Spam-Status: No, score=1.979 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=confluent-io.20150623.gappssmtp.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id VKkFiLk0VfaF for ; Wed, 24 May 2017 05:44:57 +0000 (UTC) Received: from mail-vk0-f54.google.com (mail-vk0-f54.google.com [209.85.213.54]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 4200F5F19B for ; Wed, 24 May 2017 05:44:57 +0000 (UTC) Received: by mail-vk0-f54.google.com with SMTP id y190so69919194vkc.1 for ; Tue, 23 May 2017 22:44:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=confluent-io.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=srwym5oRdQQpcWaxekRedVaO+umCGOWemqV2Bx7gv6A=; b=GwG8NwwTy9KYVFGg3U4tfF9h4fJ36k8uqpqJ+tTEWNFBNkq5Y8F6JC6hE4DezwuWLc C6iZKL2c6G/UtM77mMnwGraObs98qp/BWdB9x1WsK7cLNt0rEhZJooY3mqZGbrF567Ju fQtVuObAGRShHmpEQ8Bba2QWlTdOv1YQJwdLMqIzi4FHocANie6EUBoJW5xc2t4PdYXZ VGumNVpgntekiiEMlgOI9ki2+k2VOdVkirXSlQ0qvafULRZeGvOSUBWcem6nsBmIM5lX JzXVYnjt5slG62h+k9FH1LIFlKEunJev2FmH7mSSCUmZqIELkMbPrDW1J+8I3Floc2Pg yFPQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=srwym5oRdQQpcWaxekRedVaO+umCGOWemqV2Bx7gv6A=; b=qNM8Rnp4gtNepyCsXjL//yo+2cWuzX0/rkWDR2r3XcchzUKjcKWO/WYadzgsWx6HUI m0xMejDCR7xH3cVozE0VR6L3mS1JgNcSVDEIdP1K9x+pH5J96xI9mAZxK2s9WzXKNwb1 3sIwdEDNEOo2KlxriRXM1bXfXjsltFSTOe+oLN3Rd0EFenUqrZrQnhhs9E16MWQ2pgvt ++CuF1s5IGSBPMpxxmtJILUNY9SxGqkZt2ShFvCthN1GFgi4dTNdvzA+QocAQFFSYgnc kigmv4p9934kqPSKF0fiOdbMPKqTR0cQeaChzLpJwhFUB1lPcQwMXeOkOGDxh97JQz3J gqGw== X-Gm-Message-State: AODbwcBVVUiM0QxJ1RpesrlMFC+yRZrVkizx4Pwxj0brDSai+TnxbMxF dNsfi9epV3tdqmuuNoYFSCUJY/EVnPHT X-Received: by 10.31.196.66 with SMTP id u63mr742397vkf.47.1495604695770; Tue, 23 May 2017 22:44:55 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: =?UTF-8?B?WGF2aWVyIEzDqWF1dMOp?= Date: Wed, 24 May 2017 05:44:45 +0000 Message-ID: Subject: Re: [Vote] KIP-150 - Kafka-Streams Cogroup To: "dev@kafka.apache.org" Content-Type: multipart/alternative; boundary="001a114e739a9dbd0b05503e9a3c" archived-at: Wed, 24 May 2017 05:45:03 -0000 --001a114e739a9dbd0b05503e9a3c Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I don't think we should wait for entries from each stream, since that might limit the usefulness of the cogroup operator. There are instances where it can be useful to compute something based on data from one or more stream, without having to wait for all the streams to produce something for the group. In the example I gave in the discussion, it is possible to compute impression/auction statistics without having to wait for click data, which can typically arrive several minutes late. We could have a separate discussion around adding inner / outer modifiers to each of the streams to decide which fields are optional / required before sending updates if we think that might be useful. On Tue, May 23, 2017 at 6:28 PM Guozhang Wang wrote: > The proposal LGTM, +1 > > One question I have is about when to send the record to the resulted KTab= le > changelog. For example in your code snippet in the wiki page, before you > see the end result of > > 1L, Customer[ > > cart:{Item[no:01], Item[no:03], Item[no:04]}, > purchases:{Item[no:07], Item[no:08]}, > wishList:{Item[no:11]} > ] > > > You will firs see > > 1L, Customer[ > > cart:{Item[no:01]}, > purchases:{}, > wishList:{} > ] > > 1L, Customer[ > > cart:{Item[no:01]}, > purchases:{Item[no:07],Item[no:08]}, > > wishList:{} > ] > > 1L, Customer[ > > cart:{Item[no:01]}, > purchases:{Item[no:07],Item[no:08]}, > > wishList:{} > ] > > ... > > > I'm wondering if it makes more sense to only start sending the update if > the corresponding agg-key has seen at least one input from each of the > input stream? Maybe it is out of the scope of this KIP and we can make it= a > more general discussion in a separate one. > > > Guozhang > > > On Fri, May 19, 2017 at 8:37 AM, Xavier L=C3=A9aut=C3=A9 > wrote: > > > Hi Kyle, I left a few more comments in the discussion thread, if you > > wouldn't mind taking a look > > > > On Fri, May 19, 2017 at 5:31 AM Kyle Winkelman > > > wrote: > > > > > Hello all, > > > > > > I would like to start the vote on KIP-150. > > > > > > https://cwiki.apache.org/confluence/display/KAFKA/KIP-150+-+ > > Kafka-Streams+Cogroup > > > > > > Thanks, > > > Kyle > > > > > > > > > -- > -- Guozhang > --001a114e739a9dbd0b05503e9a3c--