Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 305A919086 for ; Fri, 25 Mar 2016 16:27:38 +0000 (UTC) Received: (qmail 18990 invoked by uid 500); 25 Mar 2016 16:27:37 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 18784 invoked by uid 500); 25 Mar 2016 16:27:37 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 18770 invoked by uid 99); 25 Mar 2016 16:27:36 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 25 Mar 2016 16:27:36 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 88008C059B for ; Fri, 25 Mar 2016 16:27:36 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.798 X-Spam-Level: X-Spam-Status: No, score=0.798 tagged_above=-999 required=6.31 tests=[FSL_HELO_BARE_IP_2=1.499, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id J9d7MmemgWcb for ; Fri, 25 Mar 2016 16:27:33 +0000 (UTC) Received: from relayvx12c.securemail.intermedia.net (relayvx12c.securemail.intermedia.net [64.78.52.187]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 0F66A5F479 for ; Fri, 25 Mar 2016 16:27:32 +0000 (UTC) Received: from securemail.intermedia.net (localhost [127.0.0.1]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by emg-ca-1-2.localdomain (Postfix) with ESMTPS id 9410C53E56 for ; Fri, 25 Mar 2016 09:27:25 -0700 (PDT) Subject: Re: [VOTE] Accept Airflow into the Incubator MIME-Version: 1.0 x-echoworx-msg-id: c49fa4b7-7cec-4267-b9c9-264d31fe2f30 x-echoworx-emg-received: Fri, 25 Mar 2016 09:27:25.549 -0700 x-echoworx-message-code-hashed: f35b5a6d52f905682dd72aaca49e703754a9b08f2f251605a38510ec945de15f x-echoworx-action: delivered Received: from 10.254.155.17 ([10.254.155.17]) by emg-ca-1-2 (JAMES SMTP Server 2.3.2) with SMTP ID 630 for ; Fri, 25 Mar 2016 09:27:25 -0700 (PDT) Received: from MBX080-W4-CO-1.exch080.serverpod.net (unknown [10.224.117.101]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by emg-ca-1-2.localdomain (Postfix) with ESMTPS id 5040F53E56 for ; Fri, 25 Mar 2016 09:27:25 -0700 (PDT) Received: from MBX080-W4-CO-2.exch080.serverpod.net (10.224.117.102) by MBX080-W4-CO-1.exch080.serverpod.net (10.224.117.101) with Microsoft SMTP Server (TLS) id 15.0.1130.7; Fri, 25 Mar 2016 09:27:24 -0700 Received: from MBX080-W4-CO-2.exch080.serverpod.net ([10.224.117.102]) by mbx080-w4-co-2.exch080.serverpod.net ([10.224.117.102]) with mapi id 15.00.1130.005; Fri, 25 Mar 2016 09:27:24 -0700 From: Chris Nauroth To: "general@incubator.apache.org" Thread-Topic: [VOTE] Accept Airflow into the Incubator Thread-Index: AQHRhkJ0mHQ2Jecwbk2Qf0BtK6Dlt59qaqOA Date: Fri, 25 Mar 2016 16:27:23 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-messagesentrepresentingtype: 1 x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [50.181.140.32] x-source-routing-agent: Processed Content-Type: text/plain; charset="Windows-1252" Content-ID: <79218973705F7540A3AED09ACA1453FD@exch080.serverpod.net> Content-Transfer-Encoding: quoted-printable +1 (binding) --Chris Nauroth On 3/24/16, 8:00 PM, "Siddharth Anand" wrote: >Following the discussion earlier: > https://s.apache.org/AirflowDiscussion > >I would like to call a VOTE for accepting Airflow as a new incubator >project. > >The proposal is available at: >https://wiki.apache.org/incubator/AirflowProposal > >The proposal is also included at the bottom of this email. > >Vote is open until at least Tues, 29 March 2016, 23:59:00 PDT >[ ] +1 accept Airflow into the Apache Incubator >[ ] =B10 >[ ] -1 because... > >+1 (non-binding) > >Thanks, >-s (Sid) > > >=3D=3D Abstract =3D=3D > >Airflow is a workflow automation and scheduling system that can be >used to author and manage data pipelines. > >=3D=3D Proposal =3D=3D > >Airflow provides a system for authoring and managing workflows a.k.a. >data pipelines a.k.a. DAGs (Directed Acyclic Graphs). The developer >authors DAGs in Python using an Airflow-provided framework. He/She >then executes the DAG using Airflow=B9s scheduler or registers the DAG >for event-based execution. A web-based UI provides the developer with >a range of options for managing and viewing his/her data pipelines. >Background > >Airflow was developed at Airbnb to enable easier authorship and >management of DAGs than were possible with existing solutions such as >Oozie and Azkaban. For starters, both Oozie and Azkaban rely on one or >more XML or property files to be bundled together to define a >workflow. This separation of code and config can present a challenge >to understanding the DAG - in Azkaban, a DAG=B9s structure is reflected >by its file system tree and one can find himself/herself traversing >the file system when inspecting or changing the structure of the DAG. >Airflow workflows, on the other hand, are simply and elegantly defined >in Python code, often a single file. Airflow merges the powerful >Web-based management aspects of projects like Azkaban and Oozie with >the simplicity and elegance of defining workflows in Python. Airflow, >less than a year old in terms of its Open Source launch, is currently >used in production environments in more than 30 companies and boasts >an active contributor list of more than 100 developers, the vast >majority of which (>95%) are outside of Airbnb. > >We would like to share it with the ASF and begin developing a >community of developers and users within Apache. > >=3D=3D Rationale =3D=3D > >Many organizations (>30) already benefit from running Airflow to >manage data pipelines. Our 100+ contributors continue to provide >integrations with 3rd party systems through the implementation of new >hooks and operators, both of which are used in defining the tasks that >compose workflows. > >=3D=3D Current Status =3D=3D > >=3D=3D=3D Meritocracy =3D=3D=3D > >Our intent with this incubator proposal is to start building a diverse >developer community around Airflow following the Apache meritocracy >model. Since Airflow was open-sourced in mid-2015, we have had fast >adoption and contributions by multiple organizations the world over. >We plan to continue to support new contributors and we will work to >actively promote those who contribute significantly to the project to >committers. > >=3D=3D=3D Community =3D=3D=3D > >Airflow is currently being used in over 30 companies. We hope to >extend our contributor base significantly and invite all those who are >interested in building large-scale distributed systems to participate. > >=3D=3D=3D Core Developers =3D=3D=3D > >Airflow is currently being developed by four engineers: Maxime >Beauchemin, Siddharth Anand, Bolke de Bruin, and Chris Riccomini. >Chris is a member of the Apache Samza PMC and a contributor to various >Apache projects, including Apache Kafka and Apache YARN. Maxime, >Siddharth, and Bolke have contributed to Airflow. > >=3D=3D=3D Alignment =3D=3D=3D >The ASF is the natural choice to host the Airflow project as its goal >of encouraging community-driven open-source projects fits with our >vision for Airflow. > >=3D=3D Known Risks =3D=3D > >=3D=3D=3D Orphaned Products =3D=3D=3D > >The core developers plan to work part time on the project. There is >very little risk of Airflow being abandoned as all of our companies >rely on it. > >=3D=3D=3D Inexperience with Open Source =3D=3D=3D > >All of the core developers have experience with open source >development. Chris is a member of the Apache Samza PMC and a >contributor to various Apache projects, including Apache Kafka and >Apache YARN. Bolke is contributor on multiple open source projects and >a few Apache projects as well, including Apache Hive, Apache Hadoop, >and Apache Ranger. > >=3D=3D=3D Homogeneous Developers =3D=3D=3D > >The current core developers are all from different companies. Our >community of 100 contributors hail from over 30 different companies >from across the world. > >=3D=3D=3D Reliance on Salaried Developers =3D=3D=3D > >Currently, the only developer paid to work on this project is Maxime. > >=3D=3D=3D Relationships with Other Apache Products =3D=3D=3D > >Airflow is deeply integrated with Apache products. It currently >provides hooks and operators to enable workflows to leverage Apache >Pig, Apache Hive, Apache Spark, Apache Sqoop, Apache Hadoop, etc=8A We >plan to add support for other Apache projects in the future. > >=3D=3D=3D An Excessive Fascination with the Apache Brand =3D=3D=3D > >While we respect the reputation of the Apache brand and have no doubts >that it will attract contributors and users, our interest is primarily >to give Airflow a solid home as an open source project following an >established development model. We have also given reasons in the >Rationale and Alignment sections. > >=3D=3D Documentation =3D=3D >http://wiki.apache.org/incubator/AirflowProposal > >=3D=3D Initial Source =3D=3D >https://github.com/airbnb/airflow > >=3D=3D Source and Intellectual Property Submission Plan =3D=3D > >As soon as Airflow is approved to join Apache Incubator, Airbnb will >execute a Software Grant Agreement and the source code will be >transitioned onto ASF infrastructure. The code is already licensed >under the Apache Software License, version 2.0. We know of no legal >encumberments that would inhibit the transfer of source code to the >ASF. > >=3D=3D External Dependencies =3D=3D > >The dependencies all have Apache compatible licenses. > > *=20 >[[https://bitbucket.org/zzzeek/alembic/src/9538c3e1a71c946a53f8762e68e94cf >bcb9f932f/LICENSE?fileviewer=3Dfile-view-default|alembic >(MIT)]] > * [[https://github.com/boto/boto/blob/develop/LICENSE|boto (MIT)]] > * [[https://github.com/celery/celery/blob/master/LICENSE|celery (BSD)]] > * [[https://github.com/mher/chartkick.py/blob/master/LICENSE|chartkick >(MIT)]] > *=20 >[[https://github.com/pyca/cryptography/blob/master/LICENSE.APACHE|cryptogr >aphy >(Apache 2.0/BSD)]] > *=20 >[[https://bitbucket.org/ned/coveragepy/src/b74c40b2c107db17f0775be5ec6c44f >5e1cf5cbf/LICENSE.txt?fileviewer=3Dfile-view-default|coverage >(Apache 2.0)]] > *=20 >[[https://github.com/coagulant/coveralls-python/blob/master/LICENCE|covera >lls >(MIT)]] > * [[https://pypi.python.org/pypi/croniter|croniter (MIT)]] > * [[https://github.com/uqfoundation/dill/blob/master/LICENSE|dill (BSD)]] > * [[https://github.com/docker/docker-py/blob/master/LICENSE|docker-py >(Apache 2.0)]] > *=20 >[[https://bitbucket.org/fabian/filechunkio/src/84289d7599a207f575cb28db719 >dd9d44e880208/LICENCE?fileviewer=3Dfile-view-default|filechunkio >(MIT)]] > *=20 >[[https://bitbucket.org/tarek/flake8/src/a209fb69350c572c9b2d7b4b09c7657be >153be5e/LICENSE?fileviewer=3Dfile-view-default|flake8 >(MIT)]] > * [[https://github.com/mitsuhiko/flask/blob/master/LICENSE|flask (BSD)]] > *=20 >[[https://github.com/flask-admin/flask-admin/blob/master/LICENSE|flask-adm >in >(BSD)]] > *=20 >[[https://github.com/thadeusb/flask-cache/blob/master/LICENSE|flask-cache >(BSD)]] > *=20 >[[https://github.com/maxcountryman/flask-login/blob/master/LICENSE|flask-l >ogin >(MIT)]] > * [[https://github.com/mher/flower/blob/master/LICENSE|flower (BSD)]] > *=20 >[[https://github.com/PythonCharmers/python-future/blob/master/LICENSE.txt| >future >(MIT)]] > * [[https://github.com/benoitc/gunicorn/blob/master/LICENSE|gunicorn >(MIT)]] > *=20 >[[https://github.com/youngwookim/hive-thrift-py/blob/master/setup.py|hive- >thrift-py >(Apache 2.0)]] > * [[https://github.com/ipython/ipython/blob/master/COPYING.rst|ipython >(BSD)]] > * [[https://github.com/mitsuhiko/jinja2/blob/master/LICENSE|jinja2 >(BSD)]] > *=20 >[[https://github.com/waylan/Python-Markdown/blob/master/LICENSE.md|markdow >n >(BSD)]] > * [[https://github.com/pydata/pandas/blob/master/LICENSE|pandas (BSD)]] > * [[https://pypi.python.org/pypi/Pygments|pygments (BSD)]] > * pyhive > * pydruid > * PyOpenSSL > * PySmbClient > * python-dateutil > * redis > * requests > * setproctitle > * statsd > * sphinx > * sphinx-argparse > * sphinx_rtd_theme > * Sphinx-PyPI-upload > * sqlalchemy (MIT) > * thrift > * jaydebeapi > * mysqlclient > * unicodecsv > * slackclient > * ldap3 > * Flask-WTF > * lxml > * [[https://github.com/bgamble/pykerberos/blob/master/LICENSE|pykerberos >(Apache 2.0)]] > * [[https://github.com/pyca/bcrypt/blob/master/LICENSE|bcrypt (Apache >2.0)]] > *=20 >[[https://github.com/maxcountryman/flask-bcrypt/blob/master/LICENSE|flask- >bcrypt >(BSD)]] > * [[https://github.com/testing-cabal/mock/blob/master/LICENSE.txt|mock >(BSD)]] > * [[https://github.com/mtth/hdfs/blob/master/LICENSE|hdfs (MIT)]] > >=3D=3D Cryptography =3D=3D > >None > >=3D=3D Required Resources =3D=3D > >=3D=3D=3D Mailing Lists =3D=3D=3D > > * private@airflow.incubator.apache.org (moderated) > * dev@airflow.incubator.apache.org > * commits@airflow.incubator.apache.org > >=3D=3D=3D Subversion Directory =3D=3D=3D > >Git is the preferred source control system: git://git.apache.org/Airflow > >=3D=3D=3D Issue Tracking =3D=3D=3D > >JIRA Airflow (Airflow) > >=3D=3D=3D Other Resources =3D=3D=3D > >The existing code already has unit tests, so we would like a Travis >instance to run them whenever a new patch is submitted. This can be >added after project creation. > >=3D=3D Initial Committers =3D=3D > > * Maxime Beauchemin > * Siddharth Anand > * Chris Riccomini > * Bolke de Bruin > * Arthur Wiedmer > * Dan Davydov > * Jeremiah Lowin > * Patrick Leo Tardif > >=3D=3D Affiliations =3D=3D > > * Maxime Beauchemin (Airbnb) > * Siddharth Anand (Agari) > * Chris Riccomini (WePay) > * Bolke de Bruin (ING) > * Arthur Wiedmer (Airbnb) > * Dan Davydov (Airbnb) > * Jeremiah Lowin (Kokino) > * Patrick Leo Tardif (Airbnb) > >=3D=3D Sponsors =3D=3D > >=3D=3D=3D Champion =3D=3D=3D > >Chris Riccomini (WePay, Apache Samza PMC) > >=3D=3D=3D Nominated Mentors =3D=3D=3D > > * Chris Nauroth (HortonWorks, Apache Hadoop Committer/PMC Member, >Apache ZooKeeper Committer, Apache Software Foundation Member) > * Hitesh Shah (HortonWorks, Apache Hadoop Committer/PMC Member, >Apache Ambari Committer/PMC Member, Apache Tez Committer/PMC Member, >Apache Software Foundation Member) > * Jakob Homan (OfferUp, Apache Hadoop Committer/PMC Member, Apache >Kafka Committer/PMC Member, Apache Samza Committer/PMC Member, Apache >Giraph Committer/PMC Member, Apache Software Foundation Member) > >=3D=3D=3D Sponsoring Entity =3D=3D=3D > >We are requesting the Incubator to sponsor this project. --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org For additional commands, e-mail: general-help@incubator.apache.org