Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EE81B2009C5 for ; Mon, 16 May 2016 21:14:54 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id ED15B160A16; Mon, 16 May 2016 19:14:54 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E3AE1160131 for ; Mon, 16 May 2016 21:14:53 +0200 (CEST) Received: (qmail 25295 invoked by uid 500); 16 May 2016 19:14:52 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 25284 invoked by uid 99); 16 May 2016 19:14:52 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 16 May 2016 19:14:52 +0000 Received: from mail-oi0-f43.google.com (mail-oi0-f43.google.com [209.85.218.43]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 754FC1A008B for ; Mon, 16 May 2016 19:14:52 +0000 (UTC) Received: by mail-oi0-f43.google.com with SMTP id x201so283698089oif.3 for ; Mon, 16 May 2016 12:14:52 -0700 (PDT) X-Gm-Message-State: AOPr4FUK5eb8hIN6LW6GPzpDFcz81dKszEDV/UjnFqyltUTUiwEWnre/8Izt/aqfy0E7IUGzFN+JJ+SscUjLiA== X-Received: by 10.157.56.1 with SMTP id i1mr14391493otc.62.1463426091704; Mon, 16 May 2016 12:14:51 -0700 (PDT) MIME-Version: 1.0 Received: by 10.202.215.70 with HTTP; Mon, 16 May 2016 12:14:12 -0700 (PDT) In-Reply-To: References: From: Andrew Purtell Date: Mon, 16 May 2016 12:14:12 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [DISCUSS] PredictionIO incubation proposal To: "general@incubator.apache.org" Content-Type: multipart/alternative; boundary=001a114153fa5a184c0532fa70c8 archived-at: Mon, 16 May 2016 19:14:55 -0000 --001a114153fa5a184c0532fa70c8 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable The process for transferring the rights to the name PredictionIO has started at Salesforce. I'm optimistic but can't guarantee an outcome as I am not empowered to make such a decision wearing any hat. I think we can proceed with the proposal using the PredictionIO mark conditionally as the desired podling name. Completing the transfer or finding another mark would be the earliest activity the podling would undertake working through their PODLINGNAMESEARCH ticket. Does that sound reasonable? On Sun, May 15, 2016 at 6:29 PM, John D. Ament wrote: > I just want to confirm, Salesforce plans to transfer the rights to the na= me > "PredictionIO" to the ASF? Or is the podling expected to take a new name? > > John > > On Fri, May 13, 2016 at 4:42 PM Andrew Purtell > wrote: > > > Greetings, > > > > It is my pleasure to > > =E2=80=8B =E2=80=8B > > propose the PredictionIO project for incubation at the Apache Software > > Foundation. > > =E2=80=8B =E2=80=8B > > PredictionIO is a > > =E2=80=8B popular=E2=80=8B > > open > > =E2=80=8B =E2=80=8B > > source Machine Learning Server built on top of a state-of-the-art open > > source stack, including several Apache technologies, that > > =E2=80=8B =E2=80=8B > > enables developers to manage and deploy production-ready predictive > > services for various kinds of machine learning tasks > > =E2=80=8B, with more than 400 production deployments around the world a= nd a > growing > > contributor community. =E2=80=8B > > > > > > The text of the proposal is included below and is also available at > > https://wiki.apache.org/incubator/PredictionIO > > > > Best regards, > > Andrew Purtell > > > > > > =3D PredictionIO Proposal =3D > > > > =3D=3D=3D Abstract =3D=3D=3D > > PredictionIO is an open source Machine Learning Server built on top of > > state-of-the-art open source stack, that enables developers to manage a= nd > > deploy production-ready predictive services for various kinds of machin= e > > learning tasks. > > > > =3D=3D=3D Proposal =3D=3D=3D > > The PredictionIO platform consists of the following components: > > > > * PredictionIO framework - provides the machine learning stack for > > building, evaluating and deploying engines with machine learning > > algorithms. It uses Apache Spark for processing. > > > > * Event Server - the machine learning analytics layer for unifying > events > > from multiple platforms. It can use Apache HBase or any JDBC backends > > as its data store. > > > > The PredictionIO community also maintains a > > =E2=80=8B =E2=80=8B > > Template Gallery, a place to > > publish and download (free or proprietary) engine templates for differe= nt > > types of machine learning applications, and is a complemental part of t= he > > project. At this point we exclude the Template Gallery from the proposa= l, > > as it has a separate set of contributors and we=E2=80=99re not familiar= with an > > Apache approved mechanism to maintain such a gallery. > > > > You can find the Template Gallery at https://templates.prediction.io/ > > > > =3D=3D=3D Background =3D=3D=3D > > PredictionIO was started with a mission to democratize and bring machin= e > > learning to the masses. > > > > Machine learning has traditionally been a luxury for big companies like > > Google, Facebook, and Netflix. There are ML libraries and tools lying > > around the internet but the effort of putting them all together as a > > production-ready infrastructure is a very resource-intensive task that = is > > remotely reachable by individuals or small businesses. > > > > PredictionIO is a production-ready, full stack machine learning system > that > > allows organizations of any scale to quickly deploy machine learning > > capabilities. It comes with official and community-contributed machine > > learning engine templates that are easy to customize. > > > > =3D=3D=3D Rationale =3D=3D=3D > > As usage and number of contributors to PredictionIO has grown bigger an= d > > more diverse, we have sought for an independent framework for the proje= ct > > to keep thriving. We believe the Apache foundation is a great fit. > Joining > > Apache would ensure that tried and true processes and procedures are in > > place for the growing number of organizations interested in contributin= g > > to PredictionIO. PredictionIO is also a good fit for the Apache > foundation. > > PredictionIO was built on top of several Apache projects (HBase, Spark, > > Hadoop). We are familiar with the Apache process and believe that the > > democratic and meritocratic nature of the foundation aligns with the > > project goals. > > > > =3D=3D=3D Initial Goals =3D=3D=3D > > The initial milestones will be to move the existing codebase to Apache > and > > integrate with the Apache development process. Once this is accomplishe= d, > > we plan for incremental development and releases that follow the Apache > > guidelines, as well as growing our developer and user communities. > > > > =3D=3D=3D Current Status =3D=3D=3D > > PredictionIO has undergone nine minor releases and many patches. > > PredictionIO is being used in production by Salesforce.com as well as > many > > other organizations and apps. The PredictionIO codebase is currently > > hosted at GitHub, which will form the basis of the Apache git repositor= y. > > > > =3D=3D=3D=3D Meritocracy =3D=3D=3D=3D > > We plan to invest in supporting a meritocracy. We will discuss the > > requirements in an open forum. We intend to invite additional developer= s > > to participate. We will encourage and monitor community participation s= o > > that privileges can be extended to those that contribute. > > > > =3D=3D=3D=3D Community =3D=3D=3D=3D > > Acceptance into the Apache foundation would bolster the already strong > > user and developer community around PredictionIO. That community includ= es > > many contributors from various other companies, and an active mailing > list > > composed of hundreds of users. > > > > =3D=3D=3D=3D Core Developers =3D=3D=3D=3D > > The core developers of our project are listed in our contributors and > > initial PPMC below. Though many are employed at Salesforce.com, there a= re > > also engineers from ActionML, and independent developers. > > > > =3D=3D=3D Alignment =3D=3D=3D > > The ASF is the natural choice to host the PredictionIO project as its > goal > > is democratizing Machine Learning by making it more easily accessible t= o > > every user/developer. PredictionIO is built on top of several top level > > Apache projects as outlined above. > > > > =3D=3D=3D Known Risks =3D=3D=3D > > > > =3D=3D=3D=3D Orphaned products =3D=3D=3D=3D > > PredictionIO has a solid and growing community. It is deployed on > > production environments by companies of all sizes to run various kinds = of > > predictive engines. > > > > In addition to the community contribution to PredictionIO framework, th= e > > community is also actively contributing new engines to the Template > > Gallery as well as SDKs and documentation for the project. Salesforce i= s > > committed to utilize and advance the PredictionIO code base and support > > its user community. > > > > =3D=3D=3D=3D Inexperience with Open Source =3D=3D=3D=3D > > PredictionIO has existed as a healthy open source project for almost tw= o > > years and is the most starred Scala project on GitHub. All of the > proposed > > committers have contributed to ASF and Linux Foundation open source > > projects. Several current committers on Apache projects and Apache > Members > > are involved in this proposal and intend to provide mentorship. > > > > =3D=3D=3D=3D Homogeneous Developers =3D=3D=3D=3D > > The initial list of committers includes developers from several > > institutions, including Salesforce, ActionML, Channel4, USC as well as > > unaffiliated developers. > > > > =3D=3D=3D=3D Reliance on Salaried Developers =3D=3D=3D=3D > > Like most open source projects, PredictionIO receives substantial suppo= rt > > from salaried developers. PredictionIO development is partially support= ed > > by Salesforce.com, but there are many contributors from various other > > companies, and an active mailing list composed of hundreds of users. We > > will continue our efforts to ensure stewardship of the project to be > > independent of salaried developers by meritocratically promoting those > > contributors to committers. > > > > =3D=3D=3D=3D Relationships with Other Apache Product =3D=3D=3D=3D > > PredictionIO relies heavily on top level apache projects such as Apache > > Spark, HBase and Hadoop. However it brings a distinguished functionalit= y, > > rather than just an abstraction - Machine Learning in a plug-and-play > > fashion. > > > > Compared to Apache Mahout, which focuses on the development of a wide > > variety of algorithms, PredictionIO offers a platform to manage the who= le > > machine learning workflow, including data collection, data preparation, > > modeling, deployment and management of predictive services in productio= n > > environments. > > > > =3D=3D=3D=3D An Excessive Fascination with the Apache Brand =3D=3D=3D= =3D > > PredictionIO is already a widely known open source project. This propos= al > > is not for the purpose of generating publicity. Rather, the primary > > benefits to joining Apache are those outlined in the Rationale section. > > > > =3D=3D=3D Documentation =3D=3D=3D > > PredictionIO boasts rich and live documentation, included in the code > repo > > (docs/manual directory), is built with Middleman, and publicly hosted a= t > > https://docs.prediction.io > > > > =3D=3D=3D Initial Source and Intellectual Property Submission Plan =3D= =3D=3D > > Currently, the PredictionIO codebase is distributed under the Apache 2.= 0 > > License and hosted on GitHub: > https://github.com/PredictionIO/PredictionIO > > > > =3D=3D=3D External Dependencies =3D=3D=3D > > PredictionIO has the following external dependencies: > > * Apache Hadoop 2.4.0 (optional, required only if YARN and HDFS are > > needed) > > * Apache Spark 1.3.0 for Hadoop 2.4 > > * Java SE Development Kit 8 > > * and one of the following sets: > > =E2=80=8B =E2=80=8B > > * PostgreSQL 9.1 > > > > =E2=80=8B =E2=80=8B > > or > > > > =E2=80=8B =E2=80=8B > > * MySQL 5.1 > > =E2=80=8B =E2=80=8B > > or > > > > =E2=80=8B =E2=80=8B > > * Apache HBase 0.98.6 > > > > =E2=80=8B =E2=80=8B > > * Elasticsearch 1.4.0 > > > > Upon acceptance to the incubator, we would begin a thorough analysis of > > all transitive dependencies to verify this information and introduce > > license checking into the build and release process by integrating with > > Apache RAT. > > > > =3D=3D=3D Cryptography =3D=3D=3D > > PredictionIO does not include cryptographic code. We utilize standard > > JCE and JSSE APIs provided by the Java Runtime Environment. > > > > =3D=3D=3D Required Resources =3D=3D=3D > > We request that following resources be created for the project to use > > > > =3D=3D=3D=3D Mailing lists =3D=3D=3D=3D > > > > predictionio-private@incubator.apache.org (with moderated subscriptions= ) > > > > predictionio-dev > > > > predictionio-user > > > > predictionio-commits > > > > We will migrate the existing PredictionIO mailing lists. > > > > =3D=3D=3D=3D Git repository =3D=3D=3D=3D > > The PredictionIO team would like to use Git for source control, due to > our > > current use of GitHub. > > > > git://git.apache.org/incubator-predictionio > > > > =3D=3D=3D=3D Documentation =3D=3D=3D=3D > > https://predictionio.incubator.apache.org/docs/ > > > > =3D=3D=3D=3D JIRA instance =3D=3D=3D=3D > > PredictionIO currently uses the GitHub issue tracking system associated > > with its repository: https://github.com/PredictionIO/PredictionIO/issue= s > . > > We will migrate to Apache JIRA. > > > > JIRA PREDICTIONIO > > https://issues.apache.org/jira/browse/PREDICTIONIO > > > > =3D=3D=3D=3D Other Resources =3D=3D=3D=3D > > * TravisCI for builds and test running. > > > > * PredictionIO's documentation, included in the code repo (docs/manual > > directory), is built with Middleman and publicly hosted > > https://docs.prediction.io > > > > * A blog to drive adoption and excitement at https://blog.prediction.io > > > > =3D=3D=3D Initial Committers =3D=3D=3D > > > > * Pat Ferrell > > > > * Tamas Jambor > > > > * Justin Yip > > > > * Xusen Yin > > > > * Lee Moon Soo > > > > * Donald Szeto > > > > * Kenneth Chan > > > > * Tom Chan > > > > * Simon Chan > > > > * Marco Vivero > > > > * Matthew Tovbin > > > > * Yevgeny Khodorkovsky > > > > * Felipe Oliveira > > > > * Vitaly Gordon > > > > =3D=3D=3D Affiliations =3D=3D=3D > > > > * Pat Ferrell - ActionML > > > > * Tamas Jambor - Channel4 > > > > * Justin Yip - independent > > > > * Xusen Yin - USC > > > > * Lee Moon Soo - NFLabs > > > > * Donald Szeto - Salesforce > > > > * Kenneth Chan - Salesforce > > > > * Tom Chan - Salesforce > > > > * Simon Chan - Salesforce > > > > * Marco Vivero - Salesforce > > > > * Matthew Tovbin - Salesforce > > > > * Yevgeny Khodorkovsky - Salesforce > > > > * Felipe Oliveira - Salesforce > > > > * Vitaly Gordon - Salesforce > > > > =3D=3D=3D Sponsors =3D=3D=3D > > > > =3D=3D=3D=3D Champion =3D=3D=3D=3D > > > > Andrew Purtell > > > > =3D=3D=3D=3D Nominated Mentors =3D=3D=3D=3D > > > > * Andrew Purtell > > > > * James Taylor > > > > * Lars Hofhansl > > > > * Suneel Marthi > > > > * Xiangrui Meng > > > > * Luciano Resende > > > > =3D=3D=3D=3D Sponsoring Entity =3D=3D=3D=3D > > > > Apache Incubator PMC > > > --=20 Best regards, - Andy Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White) --001a114153fa5a184c0532fa70c8--