Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 23FAA200BFC for ; Sat, 14 Jan 2017 09:37:28 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 22760160B35; Sat, 14 Jan 2017 08:37:28 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id EDE1C160B28 for ; Sat, 14 Jan 2017 09:37:26 +0100 (CET) Received: (qmail 52247 invoked by uid 500); 14 Jan 2017 08:37:25 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 52235 invoked by uid 99); 14 Jan 2017 08:37:25 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 14 Jan 2017 08:37:25 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 97FB01A023A for ; Sat, 14 Jan 2017 08:37:24 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.399 X-Spam-Level: * X-Spam-Status: No, score=1.399 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id WsKKq89PzUYA for ; Sat, 14 Jan 2017 08:37:21 +0000 (UTC) Received: from mail-io0-f180.google.com (mail-io0-f180.google.com [209.85.223.180]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 5CB555F29A for ; Sat, 14 Jan 2017 08:37:21 +0000 (UTC) Received: by mail-io0-f180.google.com with SMTP id j18so60634690ioe.2 for ; Sat, 14 Jan 2017 00:37:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-transfer-encoding; bh=f61SvCLdadrnzbHU7EJh8rUQ3mGxTgMMgXs1CFrzktQ=; b=bA0OW3h2CCJabJjcfbmFIt1n/Vs0CjEWEB0Lw0MjV16mFI0hM6h6mOIYgBhdPUbcKO kUcsYgn3lDKjVRBiPjTEMAWcbs64NMeJajLrqs5HlJ5hybkOeLaq+6iVNz6KmjDtHaxd /O1hgLDvRlTzY0W448E2P7IzlYj1sslNgYN8EOtkL4yg/iivDbBff4K1CMbNJzIKePPC YxiF2urI678g0j69gi1kwJOfg2EkGm6hPWwjrDjuNAFQpkS5YIxgedwZquIZ9NuB5uZ7 tF1NrnlPvBuYM7CS29uMbGMhflcaIJY0SZ63+RzLzesYG8Rfon0AJfUoRJOs3E9LdEv1 Fk5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-transfer-encoding; bh=f61SvCLdadrnzbHU7EJh8rUQ3mGxTgMMgXs1CFrzktQ=; b=GXPhNmR3XSwu+5Eq8RYdJwSqfiQuG5O1HgHMx80orx5SC+HomwsNdgVpc5EAukSVOC li2HddnYi9tWLO3+pRYLR0/caH4rrQ0LJwkQmfqHfWDG8BHpOlktYO3jwNs5NjnLwwiG XkK9SfRLX0OofG4bRtO4dxC7eZl4ihCNfaVysyiDX1cmDWg9i4HdWUGrMU7He1lJmCtL OQpy4s/r8VVgOqBCiXUjO54xEQAj0x5cr9sdEShJZkKWxmc9pxLz22eOmkNr83+wSGU2 7k1Det+XDChTuGQHalINhXsKKebznVV1Qf+5YABHpK/O6YgMz0Mjmg9K4uZBn5lOOz93 1gWA== X-Gm-Message-State: AIkVDXLUMqfcCB22Yx/iBnuuCaeuiy+fD2AUxusCaizN+H5dP5XoZIMQupTdwZcKNhJ+PXwVUkKB4v/c9aFf3w== X-Received: by 10.107.18.12 with SMTP id a12mr26213924ioj.155.1484383034962; Sat, 14 Jan 2017 00:37:14 -0800 (PST) MIME-Version: 1.0 Received: by 10.36.41.75 with HTTP; Sat, 14 Jan 2017 00:36:54 -0800 (PST) In-Reply-To: References: From: YiZhi Liu Date: Sat, 14 Jan 2017 16:36:54 +0800 Message-ID: Subject: Re: [DISCUSS] Proposing MXNet for the Apache Incubator To: general@incubator.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable archived-at: Sat, 14 Jan 2017 08:37:28 -0000 Confirmed, and please update my affiliation to 'Qihoo 360'. Thanks. 2017-01-14 16:08 GMT+08:00 =E6=A2=81=E5=BE=B7=E6=BE=8E : > Hi, > > I=E2=80=99ve been working on the MXNet-ScalaPkg for a while with Yizhi Li= u > (@javelinjs). > Please sign me up as a committer of MxNet. > > GitHub ID: Ldpe2G > Email: liangdepeng@gmail.com > Affiliations: Sun Yat-sen University > > 2017-01-14 13:49 GMT+08:00 Henri Yandell : > >> Thanks for all the feedback and interested parties :) >> >> My aim is to propose a vote on Monday, unless someone raises an issue >> before then. >> >> Hen >> >> On Thu, Jan 5, 2017 at 9:12 PM, Henri Yandell wrote: >> >> > Hello Incubator, >> > >> > I'd like to propose a new incubator Apache MXNet podling. >> > >> > The existing MXNet project (http://mxnet.io - 1.5 years old, 15 >> > committers, 200 contributors) is very interested in joining Apache. MX= Net >> > is an open-source deep learning framework that allows you to define, >> train, >> > and deploy deep neural networks on a wide array of devices, from cloud >> > infrastructure to mobile devices. >> > >> > The wiki proposal page is located here: >> > >> > https://wiki.apache.org/incubator/MXNetProposal >> > >> > I've included the text below in case anyone wants to focus on parts of= it >> > in a reply. >> > >> > Looking forward to your thoughts, and for lots of interested Apache >> > members to volunteer to mentor the project in addition to Sebastian an= d >> > myself. >> > >> > Currently the list of committers is based on the current active coders= , >> so >> > we're also very interested in hearing from anyone else who is interest= ed >> in >> > working on the project, be they current or future contributor! >> > >> > Thanks, >> > >> > Hen >> > On behalf of the MXNet project >> > >> > --------- >> > >> > =3D MXNet: Apache Incubator Proposal =3D >> > >> > =3D=3D Abstract =3D=3D >> > >> > MXNet is a Flexible and Efficient Library for Deep Learning >> > >> > =3D=3D Proposal =3D=3D >> > >> > MXNet is an open-source deep learning framework that allows you to >> define, >> > train, and deploy deep neural networks on a wide array of devices, fro= m >> > cloud infrastructure to mobile devices. It is highly scalable, allowin= g >> for >> > fast model training, and supports a flexible programming model and >> multiple >> > languages. MXNet allows you to mix symbolic and imperative programming >> > flavors to maximize both efficiency and productivity. MXNet is built o= n a >> > dynamic dependency scheduler that automatically parallelizes both >> symbolic >> > and imperative operations on the fly. A graph optimization layer on to= p >> of >> > that makes symbolic execution fast and memory efficient. The MXNet >> library >> > is portable and lightweight, and it scales to multiple GPUs and multip= le >> > machines. >> > >> > =3D=3D Background =3D=3D >> > >> > Deep learning is a subset of Machine learning and refers to a class of >> > algorithms that use a hierarchical approach with non-linearities to >> > discover and learn representations within data. Deep Learning has >> recently >> > become very popular due to its applicability and advancement of domain= s >> > such as Computer Vision, Speech Recognition, Natural Language >> Understanding >> > and Recommender Systems. With pervasive and cost effective cloud >> computing, >> > large labeled datasets and continued algorithmic innovation, Deep >> Learning >> > has become the one of the most popular classes of algorithms for machi= ne >> > learning practitioners in recent years. >> > >> > =3D=3D Rational =3D=3D >> > >> > The adoption of deep learning is quickly expanding from initial deep >> > domain experts rooted in academia to data scientists and developers >> working >> > to deploy intelligent services and products. Deep learning however has >> many >> > challenges. These include model training time (which can take days to >> > weeks), programmability (not everyone writes Python or C++ and like >> > symbolic programming) and balancing production readiness (support for >> > things like failover) with development flexibility (ability to program >> > different ways, support for new operators and model types) and speed o= f >> > execution (fast and scalable model training). Other frameworks excel = on >> > some but not all of these aspects. >> > >> > >> > =3D=3D Initial Goals =3D=3D >> > >> > MXNet is a fairly established project on GitHub with its first code >> > contribution in April 2015 and roughly 200 contributors. It is used by >> > several large companies and some of the top research institutions on t= he >> > planet. Initial goals would be the following: >> > >> > 1. Move the existing codebase(s) to Apache >> > 1. Integrate with the Apache development process/sign CLAs >> > 1. Ensure all dependencies are compliant with Apache License version = 2.0 >> > 1. Incremental development and releases per Apache guidelines >> > 1. Establish engineering discipline and a predictable release cadence= of >> > high quality releases >> > 1. Expand the community beyond the current base of expert level users >> > 1. Improve usability and the overall developer/user experience >> > 1. Add additional functionality to address newer problem types and >> > algorithms >> > >> > >> > =3D=3D Current Status =3D=3D >> > >> > =3D=3D=3D Meritocracy =3D=3D=3D >> > >> > The MXNet project already operates on meritocratic principles. Today, >> > MXNet has developers worldwide and has accepted multiple major patches >> from >> > a diverse set of contributors within both industry and academia. We wo= uld >> > like to follow ASF meritocratic principles to encourage more developer= s >> to >> > contribute in this project. We know that only active and committed >> > developers from a diverse set of backgrounds can make MXNet a successf= ul >> > project. We are also improving the documentation and code to help new >> > developers get started quickly. >> > >> > =3D=3D=3D Community =3D=3D=3D >> > >> > Acceptance into the Apache foundation would bolster the growing user a= nd >> > developer community around MXNet. That community includes around 200 >> > contributors from academia and industry. The core developers of our >> project >> > are listed in our contributors below and are also represented by logos= on >> > the mxnet.io site including Amazon, Baidu, Carnegie Mellon University, >> > Turi, Intel, NYU, Nvidia, MIT, Microsoft, TuSimple, University of >> Alberta, >> > University of Washington and Wolfram. >> > >> > =3D=3D=3D Core Developers =3D=3D=3D >> > >> > (with GitHub logins) >> > >> > * Tianqi Chen (@tqchen) >> > * Mu Li (@mli) >> > * Junyuan Xie (@piiswrong) >> > * Bing Xu (@antinucleon) >> > * Chiyuan Zhang (@pluskid) >> > * Minjie Wang (@jermainewang) >> > * Naiyan Wang (@winstywang) >> > * Yizhi Liu (@javelinjs) >> > * Tong He (@hetong007) >> > * Qiang Kou (@thirdwing) >> > * Xingjian Shi (@sxjscience) >> > >> > =3D=3D=3D Alignment =3D=3D=3D >> > >> > ASF is already the home of many distributed platforms, e.g., Hadoop, >> Spark >> > and Mahout, each of which targets a different application domain. MXNe= t, >> > being a distributed platform for large-scale deep learning, focuses on >> > another important domain for which there still lacks a scalable, >> > programmable, flexible and super fast open-source platform. The recent >> > success of deep learning models especially for vision and speech >> > recognition tasks has generated interests in both applying existing de= ep >> > learning models and in developing new ones. Thus, an open-source platf= orm >> > for deep learning backed by some of the top industry and academic play= ers >> > will be able to attract a large community of users and developers. MXN= et >> is >> > a complex system needing many iterations of design, implementation and >> > testing. Apache's collaboration framework which encourages active >> > contribution from developers will inevitably help improve the quality = of >> > the system, as shown in the success of Hadoop, Spark, etc. Equally >> > important is the community of users which helps identify real-life >> > applications of deep learning, and helps to evaluate the system's >> > performance and ease-of-use. We hope to leverage ASF for coordinating = and >> > promoting both communities, and in return benefit the communities with >> > another useful tool. >> > >> > =3D=3D Known Risks =3D=3D >> > >> > =3D=3D=3D Orphaned products =3D=3D=3D >> > >> > Given the current level of investment in MXNet and the stakeholders us= ing >> > it - the risk of the project being abandoned is minimal. Amazon, for >> > example, is in active development to use MXNet in many of its services >> and >> > many large corporations use it in their production applications. >> > >> > =3D=3D=3D Inexperience with Open Source =3D=3D=3D >> > >> > MXNet has existed as a healthy open source project for more than a yea= r. >> > During that time, the project has attracted 200+ contributors. >> > >> > =3D=3D=3D Homogenous Developers =3D=3D=3D >> > >> > The initial list of committers and contributors includes developers fr= om >> > several institutions and industry participants (see above). >> > >> > =3D=3D=3D Reliance on Salaried Developers =3D=3D=3D >> > >> > Like most open source projects, MXNet receives a substantial support f= rom >> > salaried developers. A large fraction of MXNet development is supporte= d >> by >> > graduate students at various universities in the course of research >> degrees >> > - this is more a =E2=80=9Cvolunteer=E2=80=9D relationship, since in mo= st cases students >> > contribute vastly more than is necessary to immediately support resear= ch. >> > In addition, those working from within corporations are devoting >> > significant time and effort in the project - and these come from sever= al >> > organizations. >> > >> > =3D=3D=3D A Excessive Fascination with the Apache Brand =3D=3D=3D >> > >> > We choose Apache not for publicity. We have two purposes. First, we ho= pe >> > that Apache's known best-practices for managing a mature open source >> > project can help guide us. For example, we are feeling the growing pa= ins >> > of a successful open source project as we attempt a major refactor of = the >> > internals while customers are using the system in production. We seek >> > guidance in communicating breaking API changes and version revisions. >> > Also, as our involvement from major corporations increases, we want to >> > assure our users that MXNet will stay open and not favor any particula= r >> > platform or environment. These are some examples of the know-how and >> > discipline we're hoping Apache can bring to our project. >> > >> > Second, we want to leverage Apache's reputation to recruit more >> developers >> > to create a diverse community. >> > >> > =3D=3D=3D Relationship with Other Apache Products =3D=3D=3D >> > >> > Apache Mahout and Apache Spark's MLlib are general machine learning >> > systems. Deep learning algorithms can thus be implemented on these two >> > platforms as well. However, in practice, the overlap will be minimal. >> Deep >> > learning is so computationally intensive that it often requires >> specialized >> > GPU hardware to accomplish tasks of meaningful size. Making efficient >> use >> > of GPU hardware is complex because the hardware is so fast that the >> > supporting systems around it must be carefully optimized to keep the G= PU >> > cores busy. Extending this capability to distributed multi-GPU and >> > multi-host environments requires great care. This is a critical >> > differentiator between MXNet and existing Apache machine learning >> systems. >> > >> > Mahout and Spark ML-LIB follow models where their nodes run >> synchronously. >> > This is the fundamental difference to MXNet who follows the parameter >> > server framework. MXNet can run synchronously or asynchronously. In >> > addition, MXNet has optimizations for training a wide range of deep >> > learning models using a variety of approaches (e.g., model parallelism >> and >> > data parallelism) which makes MXNet much more efficient (near-linear >> > speedup on state of the art models). MXNet also supports both imperati= ve >> > and symbolic approaches providing ease of programming for deep learnin= g >> > algorithms. >> > >> > Other Apache projects that are potentially complimentary: >> > >> > Apache Arrow - read data in Apache Arrow=E2=80=98s internal format fro= m MXNet, >> > that would allow users to run ETL/preprocessing in Spark, save the >> results >> > in Arrow=E2=80=99s format and then run DL algorithms on it. >> > >> > Apache Singa - MXNet and Singa are both deep learning projects, and ca= n >> > benefit from a larger deep learning community at Apache. >> > >> > =3D=3D Documentation =3D=3D >> > >> > Documentation has recently migrated to http://mxnet.io. We continue t= o >> > refine and improve the documentation. >> > >> > =3D=3D Initial Source =3D=3D >> > >> > We currently use Github to maintain our source code, >> > https://github.com/MXNet >> > >> > =3D=3D Source and Intellectual Property Submission Plan =3D=3D >> > >> > MXNet Code is available under Apache License, Version 2.0. We will wor= k >> > with the committers to get CLAs signed and review previous contributio= ns. >> > >> > =3D=3D External Dependencies =3D=3D >> > >> > * required by the core code base: GCC or CLOM, Clang, any BLAS librar= y >> > (ATLAS, OpenBLAS, MKL), dmlc-core, mshadow, ps-lite (which requires >> > lib-zeromq), TBB >> > * required for GPU usage: cudnn, cuda >> > * required for python usage: Python 2/3 >> > * required for R module: R, Rcpp (GPLv2 licensing) >> > * optional for image preparation and preprocessing: opencv >> > * optional dependencies for additional features: torch7, numba, cytho= n >> > (in NNVM branch) >> > >> > Rcpt and lib-zeromq are expected to be licensing discussions. >> > >> > =3D=3D Cryptography =3D=3D >> > >> > Not Applicable >> > >> > =3D=3D Required Resources =3D=3D >> > >> > =3D=3D=3D Mailing Lists =3D=3D=3D >> > >> > There is currently no mailing list. >> > >> > =3D=3D=3D Issue Tracking =3D=3D=3D >> > >> > Currently uses GitHub to track issues. Would like to continue to do so= . >> > >> > =3D=3D Committers and Affiliations =3D=3D >> > >> > * Tianqi Chen (UW) >> > * Mu Li (AWS) >> > * Junyuan Xie (AWS) >> > * Bing Xu (Apple) >> > * Chiyuan Zhang (MIT) >> > * Minjie Wang (UYU) >> > * Naiyan Wang (Tusimple) >> > * Yizhi Liu (Mediav) >> > * Tong He (Simon Fraser University) >> > * Qiang Kou (Indiana U) >> > * Xingjian Shi (HKUST) >> > >> > =3D=3D Sponsors =3D=3D >> > >> > =3D=3D=3D Champion =3D=3D=3D >> > >> > Henri Yandell (bayard at apache.org) >> > >> > =3D=3D=3D Nominated Mentors =3D=3D=3D >> > >> > Sebastian Schelter (ssc@apache.org) >> > >> > >> > =3D=3D=3D Sponsoring Entity =3D=3D=3D >> > >> > We are requesting the Incubator to sponsor this project. >> > >> > >> --=20 Yizhi Liu DMLC member Technical Manager Qihoo 360 Inc, Shanghai, China --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org For additional commands, e-mail: general-help@incubator.apache.org