incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Hitesh Shah <hit...@apache.org>
Subject Re: [VOTE] Accept Horn into the ASF incubator
Date Tue, 01 Sep 2015 18:20:22 GMT
+1

— Hitesh

On Aug 31, 2015, at 4:13 PM, Edward J. Yoon <edwardyoon@apache.org> wrote:

> Hi folks,
> 
> I would like to call a vote to accept Horn, as a new Apache Incubator
> project. The full proposal is available at the end of this mail and as
> a https://wiki.apache.org/incubator/HornProposal (the changes from
> initial discussion draft are addition of 2 committers from cldi-kaist
> team and Rich as a mentor).
> 
> The VOTE is open for at least the next 72 hours:
> 
> [ ] +1 Accept Horn into the Apache Incubator
> [ ] 0
> [ ] -1 Do not accept Horn into the Apache Incubator bc ..
> 
> I'd like to get the voting started w/ my own +1
> 
> Thanks!
> 
> == Abstract ==
> 
> Horn [hɔ:n] (korean meaning of Horn is a "Spirit") is a neuron-centric
> programming APIs and execution framework for large-scale deep
> learning, built on top of Apache Hama.
> 
> == Proposal ==
> 
> It is a goal of the Horn to provide a neuron-centric programming APIs
> which allows user to easily define the characteristic of artificial
> neural network model and its structure, and its execution framework
> that leverages the heterogeneous resources on Hama and Hadoop YARN
> cluster.
> 
> == Background ==
> 
> The initial ANN code was developed at Apache Hama project by a
> committer, Yexi Jiang (Facebook) in 2013. The motivation behind this
> work is to build a framework that provides more intuitive programming
> APIs like Google's MapReduce or Pregel and supports applications
> needing large model with huge memory consumptions in distributed way.
> 
> == Rationale ==
> 
> While many of deep learning open source softwares such as Caffe,
> DeepDist, DL4j, and NeuralGiraph are still data or model parallel
> only, we aim to support both data and model parallelism and also
> fault-tolerant system design. The basic idea of data and model
> parallelism is use of the remote parameter server to parallelize model
> creation and distribute training across machines, and the BSP
> framework of Apache Hama for performing asynchronous mini-batches.
> Within single BSP job, each task group works asynchronously using
> region barrier synchronization instead of global barrier
> synchronization, and trains large-scale neural network model using
> assigned data sets in BSP paradigm. Thus, we achieve data and model
> parallelism. This architecture is inspired by Google's !DistBelief
> (Jeff Dean et al, 2012).
> 
> == Initial Goals ==
> 
> Some current goals include:
> 
> * builds new community
> * provides more intuitive programming APIs
> * needs both data and model parallelism support
> * must run natively on both Hama and Hadoop2
> * needs also GPUs and InfiniBand support (FPGAs if possible)
> 
> == Current Status ==
> 
> === Meritocracy ===
> 
> The core developers understand what it means to have a process based
> on meritocracy. We will provide continuous efforts to build an
> environment that supports this, encouraging community members to
> contribute.
> 
> === Community ===
> 
> A small community has formed within the Apache Hama project community,
> universities, and companies such as deep learning startup, instant
> messenger service company, and mobile manufacturing company. And many
> people are interested in the large-scale deep learning platform
> itself. By bringing Horn into Apache, we believe that the community
> will grow even bigger.
> 
> === Core Developers ===
> 
> Edward J. Yoon, Thomas Jungblut, Jungin Lee, and Minho Kim
> 
> == Known Risks ==
> 
> === Orphaned Products ===
> 
> Apache Hama is already a core open source component at Samsung
> Electronics, and Horn also will be used by Samsung Electronics and
> Cldi Inc., and so there is no direct risk for this project to be
> orphaned.
> 
> === Inexperience with Open Source ===
> 
> Some are very new and the others have experience using and/or working
> on Apache open source projects.
> 
> === Homogeneous Developers ===
> 
> The initial committers are from different organizations such as,
> Microsoft, Samsung Electronics, Seoul National University, Technical
> University of Munich, KAIST, LINE plus, and Cldi Inc.
> 
> === Reliance on Salaried Developers ===
> 
> Few will be worked as a full-time open source developer. Other
> developers will also start working on the project in their spare time.
> 
> === Relationships with Other Apache Products ===
> 
> * Horn is based on Apache Hama
> * Apache Zookeeper is used for distributed locking service
> * Natively run on Apache Hadoop and Mesos
> * Horn can be somewhat overlapped with Singa podling (If possible,
> we'd also like to use Singa or Caffe to do the heavy lifting part).
> 
> === An Excessive Fascination with the Apache Brand ===
> 
> Horn itself will hopefully have benefits from Apache, in terms of
> attracting a community and establishing a solid group of developers,
> but also the relation with Apache Hadoop, Zookeeper, and Hama. These
> are the main reasons for us to send this proposal.
> 
> == Documentation ==
> 
> Initial plan about Horn can be found at
> http://blog.udanax.org/2015/06/googles-distbelief-clone-project-on.html
> 
> == Initial Source ==
> 
> The initial source code has been release as part of Apache Hama
> project developed under Apache Software Foundation. The source code is
> currently hosted at
> https://svn.apache.org/repos/asf/hama/trunk/ml/src/main/java/org/apache/hama/ml/ann/
> 
> == Cryptography ==
> 
> Not applicable.
> 
> == Required Resources ==
> 
> === Mailing Lists ===
> 
> * horn-private
> * horn-dev
> 
> === Subversion Directory ===
> 
> * Git is the preferred source control system: git://git.apache.org/horn
> 
> === Issue Tracking ===
> 
> * a JIRA issue tracker, HORN
> 
> == Initial Committers ==
> 
> * Thomas Jungblut (tjungblut AT apache DOT org)
> * Edward J. Yoon (edwardyoon AT apache DOT org)
> * Dongjin Lee (dongjin.lee.kr AT gmail DOT com)
> * Minho Kim (minwise.kim AT samsung DOT com)
> * Jungin Lee (jilee AT clid DOT io)
> * Kyunghyun Paeng (khpaeng AT kaist DOT ac DOT kr)
> * Chia-Hung Lin (chl501 AT apache DOT org)
> * Behroz Sikander (behroz.sikander AT tum DOT de)
> * Kisuk Lee (ks881115 AT gmail DOT com)
> 
> == Affiliations ==
> 
> * Thomas Jungblut (Microsoft)
> * Edward J. Yoon (Samsung Electronics)
> * Donjin Lee (LINE Plus)
> * Minho Kim (Samsung Electronics)
> * Jungin Lee (Cldi Inc.)
> * Kyunghyun Paeng (KAIST)
> * Chia-Hung Lin (Self)
> * Behroz Sikander (Technical University of Munich)
> * Kisuk Lee (Seoul National University)
> 
> == Sponsors ==
> 
> === Champion ===
> 
> * Edward J. Yoon <ASF member, Samsung Electronics>
> 
> === Nominated Mentors ===
> 
> * Luciano Resende <ASF member, IBM>
> * Robin Anil <ASF member, Tock>
> * Edward J. Yoon <ASF member, Samsung Electronics>
> * Rich Bowen <ASF member, Red Hat>
> 
> === Sponsoring Entity ===
> 
> The Apache Incubator
> 
> -- 
> Best Regards, Edward J. Yoon
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Mime
View raw message