Return-Path: X-Original-To: apmail-spark-dev-archive@minotaur.apache.org Delivered-To: apmail-spark-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 602EE18B44 for ; Sun, 8 Nov 2015 11:33:52 +0000 (UTC) Received: (qmail 36484 invoked by uid 500); 8 Nov 2015 11:33:50 -0000 Delivered-To: apmail-spark-dev-archive@spark.apache.org Received: (qmail 36374 invoked by uid 500); 8 Nov 2015 11:33:50 -0000 Mailing-List: contact dev-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list dev@spark.apache.org Received: (qmail 36363 invoked by uid 99); 8 Nov 2015 11:33:49 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 08 Nov 2015 11:33:49 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 552FD180183 for ; Sun, 8 Nov 2015 11:33:49 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.001 X-Spam-Level: X-Spam-Status: No, score=0.001 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera_com.20150623.gappssmtp.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id uPAeU8D5FVnt for ; Sun, 8 Nov 2015 11:33:36 +0000 (UTC) Received: from mail-wm0-f52.google.com (mail-wm0-f52.google.com [74.125.82.52]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 133B3203BD for ; Sun, 8 Nov 2015 11:33:36 +0000 (UTC) Received: by wmnn186 with SMTP id n186so76195940wmn.1 for ; Sun, 08 Nov 2015 03:33:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera_com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type:content-transfer-encoding; bh=ihHsP6nhKhWcwvwYbOhLQhreBfYmShoQzy5vH4U8smk=; b=kaP4E/lIdEmiRrsEBtq9GhU67yypFDMN3gjTxlBFHbW9NJ1Ft0YsiD0FdzJ2ZB2SLl F90Ir6aupPgscROXtb7PNCPAq4HRwZMBEOW03uB3MzCyqrzxLm96kqrog+w+wxRoYzLY VKiPuXvfTp8/hR6aLm9ZnwqpaSEmXTGpuIbqMRbg7yGi5eJ53C3V24Ti6sT/bwn5nUKT ighpwgkrVWYVRs5sT3fNhWGkvNbt1hC2T5ecxyZEpymfc9CookAkB8dpqpLXGjsSqSs+ zApwv6QZwpv8OAQMPXbcCcA2D0lPAH1+Tef7M02nMCr50dKG58HnZvV3O6Kb1oFc1+S+ 1dfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-type:content-transfer-encoding; bh=ihHsP6nhKhWcwvwYbOhLQhreBfYmShoQzy5vH4U8smk=; b=YNB6f70pdNEXgZsoI1xHQk1o4qcEJlci/xpm7DeZx+Q6nVexYOkswGXES+s6lvi/dy 8B808BBuLIH0tgX50L82yrt4Q9i7jCF3ol/8ROELU3mmZQ+WukJR3LasSDLxJlprkw16 qYwsTIvVmfL+NWVqo1s/z829xh9/ybhvzvAdgUQmVaJU8yjlujRnJeYpCKziC4dGuald LtD71nItzdFEeex9SPOUi/U3Gl+t2/pjQvAw1J9xBKsF7tcDniRCJezTIAgTt+hnnKJl ZrASoXwR5Igr9sY3CIpdplfgczdMa5mrrj7dzdlm9ksrldgCqOVK6I1XYYHiZIMnmDFy 9h/w== X-Gm-Message-State: ALoCoQmH6YKoZtLsfCnE96n072eNqnfj5RFDVsATQ0VFmwaHvHXWXz8OCdooQmdxfXD5N6mbSCzn X-Received: by 10.28.145.134 with SMTP id t128mr19029808wmd.64.1446982408984; Sun, 08 Nov 2015 03:33:28 -0800 (PST) MIME-Version: 1.0 Received: by 10.27.214.3 with HTTP; Sun, 8 Nov 2015 03:33:09 -0800 (PST) In-Reply-To: References: <563CA2C0.6090702@nanthrax.net> From: Sean Owen Date: Sun, 8 Nov 2015 11:33:09 +0000 Message-ID: Subject: Re: Ready to talk about Spark 2.0? To: Romi Kuntsman Cc: dev Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Major releases can change APIs, yes. Although Flink is pretty similar in broad design and goals, the APIs are quite different in particulars. Speaking for myself, I can't imagine merging them, as it would either mean significantly changing Spark APIs, or making Flink use Spark APIs. It would mean effectively removing one project which seems infeasible. I am not sure of what you're saying the difference is, but I would not describe Spark as primarily for interactive use. Philosophically, I don't think One Big System to Rule Them All is a good goal. One project will never get it all right even within one niche. It's actually valuable to have many takes on important problems. Hence any problem worth solving gets solved 10 times. Just look at all those SQL engines and logging frameworks... On Sun, Nov 8, 2015 at 10:53 AM, Romi Kuntsman wrote: > A major release usually means giving up on some API backward compatibilit= y? > Can this be used as a chance to merge efforts with Apache Flink > (https://flink.apache.org/) and create the one ultimate open source big d= ata > processing system? > Spark currently feels like it was made for interactive use (like Python a= nd > R), and when used others (batch/streaming), it feels like scripted > interactive instead of really a standalone complete app. Maybe some base > concepts may be adapted? > > (I'm not currently a committer, but as a heavy Spark user I'd love to > participate in the discussion of what can/should be in Spark 2.0) > > Romi Kuntsman, Big Data Engineer > http://www.totango.com > > On Fri, Nov 6, 2015 at 2:53 PM, Jean-Baptiste Onofr=C3=A9 > wrote: >> >> Hi Sean, >> >> Happy to see this discussion. >> >> I'm working on PoC to run Camel on Spark Streaming. The purpose is to ha= ve >> an ingestion and integration platform directly running on Spark Streamin= g. >> >> Basically, we would be able to use a Camel Spark DSL like: >> >> >> from("jms:queue:foo").choice().when(predicate).to("job:bar").when(predic= ate).to("hdfs:path").otherwise("file:path").... >> >> Before a formal proposal (I have to do more work there), I'm just >> wondering if such framework can be a new Spark module (Spark Integration= for >> instance, like Spark ML, Spark Stream, etc). >> >> Maybe it could be a good candidate for an addition in a "major" release >> like Spark 2.0. >> >> Just my $0.01 ;) >> >> Regards >> JB >> >> >> On 11/06/2015 01:44 PM, Sean Owen wrote: >>> >>> Since branch-1.6 is cut, I was going to make version 1.7.0 in JIRA. >>> However I've had a few side conversations recently about Spark 2.0, and >>> I know I and others have a number of ideas about it already. >>> >>> I'll go ahead and make 1.7.0, but thought I'd ask, how much other >>> interest is there in starting to plan Spark 2.0? is that even on the >>> table as the next release after 1.6? >>> >>> Sean >> >> >> -- >> Jean-Baptiste Onofr=C3=A9 >> jbonofre@apache.org >> http://blog.nanthrax.net >> Talend - http://www.talend.com >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org >> For additional commands, e-mail: dev-help@spark.apache.org >> > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org For additional commands, e-mail: dev-help@spark.apache.org