Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DBEC718413 for ; Sun, 20 Dec 2015 02:49:35 +0000 (UTC) Received: (qmail 58917 invoked by uid 500); 20 Dec 2015 02:49:35 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 58688 invoked by uid 500); 20 Dec 2015 02:49:35 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 58676 invoked by uid 99); 20 Dec 2015 02:49:34 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 20 Dec 2015 02:49:34 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 569E31804BD for ; Sun, 20 Dec 2015 02:49:34 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 4.151 X-Spam-Level: **** X-Spam-Status: No, score=4.151 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=3, KAM_LIVE=1, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id uEuzJbKd6okX for ; Sun, 20 Dec 2015 02:49:21 +0000 (UTC) Received: from mail-ig0-f174.google.com (mail-ig0-f174.google.com [209.85.213.174]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 6F15C42BD7 for ; Sun, 20 Dec 2015 02:49:21 +0000 (UTC) Received: by mail-ig0-f174.google.com with SMTP id to18so16259811igc.0 for ; Sat, 19 Dec 2015 18:49:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=Hlr2jofs3Sj+2eGUZGH+kRJ+HZVOTwbX5OeE5pzp/EQ=; b=ueCXRhbB+Z9Aw8LWNJpn8XDNdIJ2fjT+7T9fsW1MJCVrIsBn4nJ3tyunHiuR8kp0Ph bi6+9YRm4mzmHT1WFEYeh5OxhbrWjwa+CoIzZtYWz0COQrFE5qqf5gsM6Te3nEbmbm5Z SNgaenB/bkFr8OBrwXwyI1MQbRGLV1BTsAXDXrWBwq5Dqig0stYZi0CoxJnIrebOWI83 2WyEiJ6GHerjgrARkKAQkP4nrM5JmXdK9BFHHnOZbS4KAXCQORPc9O3OI3164n4D8a8G Z3h3tNRJIA6DHBL+xB8ro8NEvw7Lfv41RSpD2JbY6/lqtXZbSWj2m+kJ/hr58Sda0MlM EX1Q== MIME-Version: 1.0 X-Received: by 10.50.79.196 with SMTP id l4mr11126178igx.59.1450579755061; Sat, 19 Dec 2015 18:49:15 -0800 (PST) Received: by 10.64.145.164 with HTTP; Sat, 19 Dec 2015 18:49:14 -0800 (PST) In-Reply-To: References: Date: Sat, 19 Dec 2015 18:49:14 -0800 Message-ID: Subject: Re: [RESULT][VOTE] Accept Torii into Apache Incubator From: Luciano Resende To: "general@incubator.apache.org" Content-Type: multipart/alternative; boundary=089e01183f4804dacd05274b6b60 --089e01183f4804dacd05274b6b60 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Wed, Dec 2, 2015 at 5:24 PM, Luciano Resende wrote: > Vote has passed with 7 binding +1 from : Hitesh Shah, Luciano Resende, > Sam Ruby, Chris A Mattmann, Jim Jagielski, Reynold Xin, Steve Loughran > and 2 non-binding +1 from Sree V, Luke Han. > > There is an issue with the project name, see discussion at [1]. We will b= e > identifying a new name for the project before we start creating the proje= ct > infrastructure. I will update the vote thread with the project new name f= or > historical reasons. > > [1] > https://www.mail-archive.com/general@incubator.apache.org/msg52224.html > > Thank you. > > Just an Update on the vote thread, we have chosen the name Toree which currently seems available (see [1] for some more details) . I'll start working on the podling infrastructure setup soon. Thank You [1] https://www.mail-archive.com/general%40incubator.apache.org/msg52527.html > On Thu, Nov 26, 2015 at 7:33 AM, Luciano Resende > wrote: > >> After initial discussion (under the name Spark-Kernel), please vote on >> the acceptance of Torii Project for incubation at the Apache Incubator. >> The full proposal is >> available at the end of this message and on the wiki at : >> >> https://wiki.apache.org/incubator/ToriiProposal >> >> Please cast your votes: >> >> [ ] +1, bring Torii into Incubator >> [ ] +0, I don't care either way >> [ ] -1, do not bring Torii into Incubator, because... >> >> Due to long weekend holiday in US, I will leave the vote open until >> December 1st. >> >> >> =3D Torii =3D >> >> =3D=3D Abstract =3D=3D >> Torii provides applications with a mechanism to interactively and >> remotely access Apache Spark. >> >> =3D=3D Proposal =3D=3D >> Torii enables interactive applications to access Apache Spark clusters. >> More specifically: >> * Applications can send code-snippets and libraries for execution by >> Spark >> * Applications can be deployed separately from Spark clusters and >> communicate with the Torii using the provided Torii client >> * Execution results and streaming data can be sent back to calling >> applications >> * Applications no longer have to be network connected to the workers on >> a Spark cluster because the Torii acts as each application=E2=80=99s pro= xy >> * Work has started on enabling Torii to support languages in addition t= o >> Scala, namely Python (with PySpark), R (with SparkR), and SQL (with >> SparkSQL) >> >> =3D=3D Background & Rationale =3D=3D >> Apache Spark provides applications with a fast and general purpose >> distributed computing engine that supports static and streaming data, >> tabular and graph representations of data, and an extensive library of >> machine learning libraries. Consequently, a wide variety of applications >> will be written for Spark and there will be interactive applications tha= t >> require relatively frequent function evaluations, and batch-oriented >> applications that require one-shot or only occasional evaluation. >> >> Apache Spark provides two mechanisms for applications to connect with >> Spark. The primary mechanism launches applications on Spark clusters usi= ng >> spark-submit ( >> http://spark.apache.org/docs/latest/submitting-applications.html); this >> requires developers to bundle their application code plus any dependenci= es >> into JAR files, and then submit them to Spark. A second mechanism is an >> ODBC/JDBC API ( >> http://spark.apache.org/docs/latest/sql-programming-guide.html#distribut= ed-sql-engine) >> which enables applications to issue SQL queries against SparkSQL. >> >> Our experience when developing interactive applications, such as analyti= c >> applications integrated with Notebooks, to run against Spark was that th= e >> spark-submit mechanism was overly cumbersome and slow (requiring JAR >> creation and forking processes to run spark-submit), and the SQL interfa= ce >> was too limiting and did not offer easy access to components other than >> SparkSQL, such as streaming. The most promising mechanism provided by >> Apache Spark was the command-line shell ( >> http://spark.apache.org/docs/latest/programming-guide.html#using-the-she= ll) >> which enabled us to execute code snippets and dynamically control the ta= sks >> submitted to a Spark cluster. Spark does not provide the command-line >> shell as a consumable service but it provided us with the starting point >> from which we developed Torii. >> >> >> =3D=3D Current Status =3D=3D >> Torii was first developed by a small team working on an internal-IBM >> Spark-related project in July 2014. In recognition of its likely general >> utility to Spark users and developers, in November 2014 the Torii projec= t >> was moved to GitHub and made available under the Apache License V2. >> >> =3D=3D Meritocracy =3D=3D >> The current developers are familiar with the meritocratic open source >> development process at Apache. As the project has gathered interest at >> GitHub the developers have actively started a process to invite addition= al >> developers into the project, and we have at least one new developer who = is >> ready to contribute code to the project. >> >> =3D=3D Community =3D=3D >> We started building a community around Torii project when we moved it to >> GitHub about one year ago. Since then we have grown to about 70 people, = and >> there are regular requests and suggestions from the community. We believ= e >> that providing Apache Spark application developers with a general-purpos= e >> and interactive API holds a lot of community potential, especially >> considering possible tie-in=E2=80=99s with Notebooks and data science co= mmunity. >> >> =3D=3D Core Developers =3D=3D >> The core developers of the project are currently all from IBM, from the >> IBM Emerging Technology team and from IBM=E2=80=99s recently formed Spar= k >> Technology Center. >> >> =3D=3D Alignment =3D=3D >> Apache, as the home of Apache Spark, is the most natural home for the >> Torii project because it was designed to work with Apache Spark and to >> provide capabilities for interactive applications and data science tools >> not provided by Spark itself. >> >> The Torii also has an affinity with Jupyter (jupyter.org) because it >> uses the Jupyter protocol for communications, and so Jupyter Notebooks c= an >> directly use the Torii as a kernel for communicating with Apache Spark. >> However, we believe that the Torii provides a general-purpose mechanism >> enabling a wider variety of applications than just Notebooks to access >> Spark, and so the Torii=E2=80=99s greatest affinity is with Apache and A= pache >> Spark. >> >> =3D=3D Known Risks =3D=3D >> >> =3D=3D=3D Orphaned products =3D=3D=3D >> We believe the Torii project has a low-risk of abandonment due to >> interest in its continuing existence from several parties. More >> specifically, the Torii provides a capability that is not provided by >> Apache Spark today but it enables a wider range of applications to lever= age >> Spark. For example, IBM uses (and is considering) the Torii in several >> offerings including its IBM Analytics for Apache Spark product in the >> Bluemix Cloud. There are also a couple of other commercial users who are >> using or considering its use in their offerings. Furthermore, Jupyter >> Notebooks are used by data scientists and Spark is gaining popularity as= an >> analytic engine for them. Jupyter Notebooks are very easily enabled with >> the Torii and so there is another constituency for it. >> >> =3D=3D=3D Inexperience with Open Source =3D=3D=3D >> The Torii project has been running as an open-source project (albeit wit= h >> only IBM committers) for the past several months. The project has an act= ive >> issue tracker and due to the interest indicated by the nature and volume= of >> requests and comments, the team has publicly stated it is beginning to >> build a process so they can accept third-party contributions to the proj= ect. >> >> =3D=3D=3D Relationships with Other Apache Products =3D=3D=3D >> The Torii has a clear affinity with the Apache Spark project because it >> is designed to provide capabilities for interactive applications and da= ta >> science tools not provided by Spark itself. The Torii can be a back-end = for >> the Zeppelin project currently incubating at Apache. There is interest f= rom >> the Torii community to develop this capability and an experimental branc= h >> has been started. >> >> =3D=3D=3D Homogeneous Developers =3D=3D=3D >> The current group of developers working on Torii are all from IBM >> although the group is in the process of expanding its membership to incl= ude >> members of the GitHub community who are not from IBM and who have been >> active in the Torii community in GutHub. >> >> =3D=3D=3D Reliance on Salaried Developers =3D=3D=3D >> The initial committers are full-time employees at IBM although not all >> work on the project full-time. >> >> =3D=3D=3D Excessive Fascination with the Apache Brand =3D=3D=3D >> We believe the Torii benefits Apache Spark application developers, and w= e >> are interested in an Apache Torii project to benefit these developers by >> engaging a larger community, facilitating closer ties with the existing >> Spark project, and yes, gaining more visibility for the Torii as a solut= ion. >> >> =3D=3D=3D Documentation =3D=3D=3D >> Comprehensive documentation including =E2=80=9CGetting Started=E2=80=9D,= API >> specifications and a Roadmap are available from the GitHub project, see >> https://github.com/ibm-et/Torii/wiki. >> >> =3D=3D=3D Initial Source =3D=3D=3D >> The source code resides at https://github.com/ibm-et/Torii. >> >> =3D=3D=3D External Dependencies =3D=3D=3D >> The Torii depends upon a number of Apache projects: >> * Spark >> * Hadoop >> * Ivy >> * Commons >> >> The Torii also depends upon a number of other open source projects: >> * ZeroMQ (LGPL with Static Linking Exception, >> http://zeromq.org/area:licensing) >> * Akka (MIT) >> * JOpt Simple (MIT) >> * Spring Framework Core (Apache v2) >> * Play (Apache v2) >> * SLF4J (MIT) >> * Scala >> * Scalatest (Apache v2) >> * Scalactic (Apache v2) >> * Mockito (MIT) >> >> =3D=3D Required Resources =3D=3D >> >> =3D=3D=3D Mailing lists =3D=3D=3D >> >> * private@torii.incubator.apache.org (with moderated subscriptions) >> * commits@torii.incubator.apache.org >> * dev@torii.incubator.apache.org >> >> =3D=3D=3D Git Repository =3D=3D=3D >> >> * https://git-wip-us.apache.org/repos/asf/incubator-torii.git >> >> =3D=3D=3D Issue Tracking =3D=3D=3D >> >> * A JIRA issue tracker: https://issues.apache.org/jira/browse/TORII >> >> =3D=3D Initial Committers =3D=3D >> >> * Leugim Bustelo (lbustelo AT us DOT ibm DOT com) >> * Jakob Odersky (odersky AT us DOT ibm DOT com) >> * Luciano Resende (lresende AT apache DOT org) >> * Robert Senkbeil (rcsenkbe AT us DOT ibm DOT com) >> * Corey Stubbs (cstubbs AT us DOT ibm DOT com) >> * Miao Wang (wangmiao AT us DOT ibm DOT com) >> * Sean Welleck (swelleck AT us DOT ibm DOT com) >> >> =3D=3D=3D Affiliations =3D=3D=3D >> All of the initial committers are employed by IBM. >> >> =3D=3D Sponsors =3D=3D >> >> =3D=3D=3D Champion =3D=3D=3D >> * Sam Ruby (rubys AT apache DOT org) >> >> =3D=3D=3D Nominated Mentors =3D=3D=3D >> * Luciano Resende (lresende AT apache DOT org) >> * Reynold Xin (rxin AT apache DOT org) >> * Hitesh Shah (hitesh AT apache DOT org) >> * Julien Le Dem (julien AT apache DOT org) >> >> =3D=3D=3D Sponsoring Entity =3D=3D=3D >> >> We would like to propose the Apache Incubator to sponsor this project. >> >> >> -- >> Luciano Resende >> http://people.apache.org/~lresende >> http://twitter.com/lresende1975 >> http://lresende.blogspot.com/ >> > > > > -- > Luciano Resende > http://people.apache.org/~lresende > http://twitter.com/lresende1975 > http://lresende.blogspot.com/ > --=20 Luciano Resende http://people.apache.org/~lresende http://twitter.com/lresende1975 http://lresende.blogspot.com/ --089e01183f4804dacd05274b6b60--