Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 6F18E200CE6 for ; Wed, 2 Aug 2017 08:18:07 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 6D9E4168C36; Wed, 2 Aug 2017 06:18:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 8F1C0168C32 for ; Wed, 2 Aug 2017 08:18:06 +0200 (CEST) Received: (qmail 75078 invoked by uid 500); 2 Aug 2017 06:18:05 -0000 Mailing-List: contact dev-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list dev@flink.apache.org Received: (qmail 75064 invoked by uid 99); 2 Aug 2017 06:18:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Aug 2017 06:18:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 975121A1D06 for ; Wed, 2 Aug 2017 06:18:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.378 X-Spam-Level: ** X-Spam-Status: No, score=2.378 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, KAM_NUMSUBJECT=0.5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=google.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 19ffFeS45zwL for ; Wed, 2 Aug 2017 06:18:02 +0000 (UTC) Received: from mail-vk0-f54.google.com (mail-vk0-f54.google.com [209.85.213.54]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id BDD775FC43 for ; Wed, 2 Aug 2017 06:18:01 +0000 (UTC) Received: by mail-vk0-f54.google.com with SMTP id n125so14316253vke.1 for ; Tue, 01 Aug 2017 23:18:01 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=azaJoAM0Ngt38ZLDMy7TP8F9Mwc8NkRElhXwSH9IMCc=; b=Ei7KlUTTJdeP0gZ7s5FWQ1+TUQSbVGRGjx88aSKQRqvG/AYTxx63nSXpP3r1pGlQHu 7IK8JLC42oTmO6Zx0Il617O1fuPNycJfsJv79wLq1A9WTcpcPVPvQN/Wx5g6c6GroDhy grvcXYiYtre+0RHFN580sASuqu5ohXLmfk8PuBori3l6tdSVTjS+o2QEP1XGg7JNS/Dr 1P5C3JuKi54YgSmi8O9VXb65qc9dqXFLZEVR3SltRtZoVni8g/E/i24KGmVTY/8oK9ip 0ufxy6GfN34zg5C7Mou0vRaII5RfVHmt8GGxHiK/e7e3XNMmNFyt3vU4WYbFGAslam4t KrbA== X-Gm-Message-State: AIVw110WOTZO4J69V5GFuvTcXCKvP1JJrJVKITAbeb0iaiQWPyNBpeMd fUTJD85OesVF8dwKpZerCFtUV+H4tqn3 X-Received: by 10.31.206.133 with SMTP id e127mr10093569vkg.145.1501654675217; Tue, 01 Aug 2017 23:17:55 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Tyler Akidau Date: Wed, 02 Aug 2017 06:17:44 +0000 Message-ID: Subject: Re: Towards a spec for robust streaming SQL, Part 2 To: dev@calcite.apache.org Cc: Dev , dev@beam.apache.org Content-Type: multipart/alternative; boundary="001a114e56fe7e6b7c0555bf3966" archived-at: Wed, 02 Aug 2017 06:18:07 -0000 --001a114e56fe7e6b7c0555bf3966 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Thank you all for the comments/input, I appreciate the time you've put into this. I've responded to a handful of the major ones. There are some more I'd like to respond to, but I'm out of time for tonight, so more tomorrow. -Tyler On Tue, Aug 1, 2017 at 12:24 PM Julian Hyde wrote: > I have problems with a couple of the axioms: that a SQL object is > either a table or a stream, but not both; and that a query is bounded > if and only if it contains no unbounded streams. > > I don't have problems with other axioms, such that a query is either > bounded or unbounded. And I haven't looked in detail at triggering > semantics; I don't think there will be major issues, but let's clear > up the 2 problems above first. > > I have added a section "Julian=E2=80=99s thoughts on the fundamentals" to= the > end of the document. > > Julian > > > On Tue, Aug 1, 2017 at 6:40 AM, Fabian Hueske wrote: > > As promised, I went of the document and made some comments. > > I also added a bit of information about the current SQL support in Flin= k > > and its internals. > > > > Thanks, Fabian > > > > 2017-07-30 13:22 GMT+02:00 Shaoxuan Wang : > > > >> Hi Tyler, > >> Thanks for putting all the efforts into a doc. It is really well writt= en > >> and organized. > >> I like the most part. The major concern I have is about the "explicit > >> trigger". I left a few comments towards this and would like to know wh= at > >> the others think about it. > >> > >> Regards, > >> Shaoxuan > >> > >> On Sun, Jul 30, 2017 at 4:43 PM, Fabian Hueske > wrote: > >> > >> > Thanks for the great write up! > >> > > >> > I think this s very good starting point for a detailed discussion > about > >> > features, syntax and semantics of streaming SQL. > >> > I'll comment on the document in the next days and describe Flink's > >> current > >> > status, our approaches (or planned approaches) and ask a couple of > >> > questions. > >> > > >> > Thanks, Fabian > >> > > >> > 2017-07-28 3:05 GMT+02:00 Julian Hyde : > >> > > >> > > Tyler, > >> > > > >> > > Thanks for this. I am reading the document thoroughly and will giv= e > my > >> > > feedback in a day or two. > >> > > > >> > > Julian > >> > > > >> > > > On Jul 25, 2017, at 12:54 PM, Pramod Immaneni < > >> pramod@datatorrent.com> > >> > > wrote: > >> > > > > >> > > > Thanks for the invitation Tyler. I am sure folks who worked on t= he > >> > > calcite > >> > > > integration and others would be interested. > >> > > > > >> > > > On Tue, Jul 25, 2017 at 12:12 PM, Tyler Akidau > >> > > > >> > > > wrote: > >> > > > > >> > > >> +dev@apex.apache.org, since I'm told Apex has a Calcite > integration > >> > as > >> > > >> well. If anyone on the Apex side wants to join in on the fun, > your > >> > input > >> > > >> would be welcomed! > >> > > >> > >> > > >> -Tyler > >> > > >> > >> > > >> > >> > > >> On Mon, Jul 24, 2017 at 4:34 PM Tyler Akidau > > >> > > wrote: > >> > > >> > >> > > >>> Hello Flink, Calcite, and Beam dev lists! > >> > > >>> > >> > > >>> Linked below is the second document I promised way back in Apr= il > >> > > >> regarding > >> > > >>> a collaborative spec for streaming SQL in Beam/Calcite/Flink (= & > >> > > apologies > >> > > >>> for the delay; I thought I was nearly done a while back and th= en > >> > > temporal > >> > > >>> joins expanded to something much larger than expected). > >> > > >>> > >> > > >>> To repeat what it says in the doc, my hope is that it can serv= e > >> > various > >> > > >>> purposes over it's lifetime: > >> > > >>> > >> > > >>> - > >> > > >>> - A discussion ground for ironing out any remaining features > >> > > necessary > >> > > >>> for supporting robust streaming semantics in Calcite SQL. > >> > > >>> > >> > > >>> - A rough, high-level source of truth for tracking efforts > >> underway > >> > > in > >> > > >>> support of this, currently spanning the Calcite, Flink, and > Beam > >> > > >> projects. > >> > > >>> > >> > > >>> - A written specification of the changes that were made, for > the > >> > sake > >> > > >>> of understanding the delta after the fact. > >> > > >>> > >> > > >>> The first and third points are, IMO, the most important. AFAIK= , > >> there > >> > > are > >> > > >>> a few features missing still that need to be defined (e.g., > >> triggers > >> > > >>> equivalents via EMIT, robust temporal join support). I'm also > >> > > proposing a > >> > > >>> clear distinction of streams and tables, which I think is > >> important, > >> > > but > >> > > >>> which I believe is not the approach most folks have been takin= g > in > >> > this > >> > > >>> area. Sorting out these open issues and then having a concise > >> record > >> > of > >> > > >> the > >> > > >>> solutions adopted will be important for providing a solid > streaming > >> > > >>> experience and teaching folks how to use it. > >> > > >>> > >> > > >>> At any rate, I would much appreciate it if anyone with an > interest > >> in > >> > > >> this > >> > > >>> stuff could please take a look and add comments/suggestions/ > >> > references > >> > > >> to > >> > > >>> related work in flight/etc as appropriate. For now please use > >> > > >>> comments/suggestions, but if you really want to dive in with > edit > >> > > access, > >> > > >>> let me know. > >> > > >>> > >> > > >>> The doc: http://s.apache.org/streaming-sql-spec > >> > > >>> > >> > > >>> -Tyler > >> > > >>> > >> > > >>> > >> > > >>> > >> > > >> > >> > > > >> > > > >> > > >> > --001a114e56fe7e6b7c0555bf3966--