Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id BAB16200D5A for ; Thu, 30 Nov 2017 02:05:53 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id B7E11160C19; Thu, 30 Nov 2017 01:05:53 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 0A6D9160C04 for ; Thu, 30 Nov 2017 02:05:52 +0100 (CET) Received: (qmail 13515 invoked by uid 500); 30 Nov 2017 01:05:51 -0000 Mailing-List: contact dev-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@kafka.apache.org Delivered-To: mailing list dev@kafka.apache.org Received: (qmail 13503 invoked by uid 99); 30 Nov 2017 01:05:51 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Nov 2017 01:05:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 1DCECC49D8 for ; Thu, 30 Nov 2017 01:05:51 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.811 X-Spam-Level: X-Spam-Status: No, score=-99.811 tagged_above=-999 required=6.31 tests=[KB_WAM_FROM_NAME_SINGLEWORD=0.2, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id OA6uVguEsLdD for ; Thu, 30 Nov 2017 01:05:50 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 221CD5F6C8 for ; Thu, 30 Nov 2017 01:05:47 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 887EFE25BE for ; Thu, 30 Nov 2017 01:05:44 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 346DA21065 for ; Thu, 30 Nov 2017 01:05:43 +0000 (UTC) Date: Thu, 30 Nov 2017 01:05:43 +0000 (UTC) From: "Guozhang Wang (JIRA)" To: dev@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (KAFKA-6286) Consider reusing materialized store for multi-same-stream join MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Thu, 30 Nov 2017 01:05:53 -0000 Guozhang Wang created KAFKA-6286: ------------------------------------ Summary: Consider reusing materialized store for multi-same-stream join Key: KAFKA-6286 URL: https://issues.apache.org/jira/browse/KAFKA-6286 Project: Kafka Issue Type: Bug Components: streams Reporter: Guozhang Wang Imagine the following streams application: {code} stream1.join(stream2...)..join(stream2) {code} Each join will result in {{stream2}} being materialized into a separate store. Arguably such multi-joins that involves the same stream multiple times is rare, but it worth considering if we can optimize such cases. One thing to note, though, is that in our DSL parser today we do "put into store first, and then query the other store second", which means if we share the same store it would result in duplicates as the matching would already see the newly put records in the second join. -- This message was sent by Atlassian JIRA (v6.4.14#64029)