incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Seetharam Venkatesh <venkat...@innerzeal.com>
Subject Re: [VOTE] Accept Horn into the ASF incubator
Date Wed, 02 Sep 2015 03:27:57 GMT
+1

On Tue, Sep 1, 2015 at 11:20 AM Hitesh Shah <hitesh@apache.org> wrote:

> +1
>
> — Hitesh
>
> On Aug 31, 2015, at 4:13 PM, Edward J. Yoon <edwardyoon@apache.org> wrote:
>
> > Hi folks,
> >
> > I would like to call a vote to accept Horn, as a new Apache Incubator
> > project. The full proposal is available at the end of this mail and as
> > a https://wiki.apache.org/incubator/HornProposal (the changes from
> > initial discussion draft are addition of 2 committers from cldi-kaist
> > team and Rich as a mentor).
> >
> > The VOTE is open for at least the next 72 hours:
> >
> > [ ] +1 Accept Horn into the Apache Incubator
> > [ ] 0
> > [ ] -1 Do not accept Horn into the Apache Incubator bc ..
> >
> > I'd like to get the voting started w/ my own +1
> >
> > Thanks!
> >
> > == Abstract ==
> >
> > Horn [hɔ:n] (korean meaning of Horn is a "Spirit") is a neuron-centric
> > programming APIs and execution framework for large-scale deep
> > learning, built on top of Apache Hama.
> >
> > == Proposal ==
> >
> > It is a goal of the Horn to provide a neuron-centric programming APIs
> > which allows user to easily define the characteristic of artificial
> > neural network model and its structure, and its execution framework
> > that leverages the heterogeneous resources on Hama and Hadoop YARN
> > cluster.
> >
> > == Background ==
> >
> > The initial ANN code was developed at Apache Hama project by a
> > committer, Yexi Jiang (Facebook) in 2013. The motivation behind this
> > work is to build a framework that provides more intuitive programming
> > APIs like Google's MapReduce or Pregel and supports applications
> > needing large model with huge memory consumptions in distributed way.
> >
> > == Rationale ==
> >
> > While many of deep learning open source softwares such as Caffe,
> > DeepDist, DL4j, and NeuralGiraph are still data or model parallel
> > only, we aim to support both data and model parallelism and also
> > fault-tolerant system design. The basic idea of data and model
> > parallelism is use of the remote parameter server to parallelize model
> > creation and distribute training across machines, and the BSP
> > framework of Apache Hama for performing asynchronous mini-batches.
> > Within single BSP job, each task group works asynchronously using
> > region barrier synchronization instead of global barrier
> > synchronization, and trains large-scale neural network model using
> > assigned data sets in BSP paradigm. Thus, we achieve data and model
> > parallelism. This architecture is inspired by Google's !DistBelief
> > (Jeff Dean et al, 2012).
> >
> > == Initial Goals ==
> >
> > Some current goals include:
> >
> > * builds new community
> > * provides more intuitive programming APIs
> > * needs both data and model parallelism support
> > * must run natively on both Hama and Hadoop2
> > * needs also GPUs and InfiniBand support (FPGAs if possible)
> >
> > == Current Status ==
> >
> > === Meritocracy ===
> >
> > The core developers understand what it means to have a process based
> > on meritocracy. We will provide continuous efforts to build an
> > environment that supports this, encouraging community members to
> > contribute.
> >
> > === Community ===
> >
> > A small community has formed within the Apache Hama project community,
> > universities, and companies such as deep learning startup, instant
> > messenger service company, and mobile manufacturing company. And many
> > people are interested in the large-scale deep learning platform
> > itself. By bringing Horn into Apache, we believe that the community
> > will grow even bigger.
> >
> > === Core Developers ===
> >
> > Edward J. Yoon, Thomas Jungblut, Jungin Lee, and Minho Kim
> >
> > == Known Risks ==
> >
> > === Orphaned Products ===
> >
> > Apache Hama is already a core open source component at Samsung
> > Electronics, and Horn also will be used by Samsung Electronics and
> > Cldi Inc., and so there is no direct risk for this project to be
> > orphaned.
> >
> > === Inexperience with Open Source ===
> >
> > Some are very new and the others have experience using and/or working
> > on Apache open source projects.
> >
> > === Homogeneous Developers ===
> >
> > The initial committers are from different organizations such as,
> > Microsoft, Samsung Electronics, Seoul National University, Technical
> > University of Munich, KAIST, LINE plus, and Cldi Inc.
> >
> > === Reliance on Salaried Developers ===
> >
> > Few will be worked as a full-time open source developer. Other
> > developers will also start working on the project in their spare time.
> >
> > === Relationships with Other Apache Products ===
> >
> > * Horn is based on Apache Hama
> > * Apache Zookeeper is used for distributed locking service
> > * Natively run on Apache Hadoop and Mesos
> > * Horn can be somewhat overlapped with Singa podling (If possible,
> > we'd also like to use Singa or Caffe to do the heavy lifting part).
> >
> > === An Excessive Fascination with the Apache Brand ===
> >
> > Horn itself will hopefully have benefits from Apache, in terms of
> > attracting a community and establishing a solid group of developers,
> > but also the relation with Apache Hadoop, Zookeeper, and Hama. These
> > are the main reasons for us to send this proposal.
> >
> > == Documentation ==
> >
> > Initial plan about Horn can be found at
> > http://blog.udanax.org/2015/06/googles-distbelief-clone-project-on.html
> >
> > == Initial Source ==
> >
> > The initial source code has been release as part of Apache Hama
> > project developed under Apache Software Foundation. The source code is
> > currently hosted at
> >
> https://svn.apache.org/repos/asf/hama/trunk/ml/src/main/java/org/apache/hama/ml/ann/
> >
> > == Cryptography ==
> >
> > Not applicable.
> >
> > == Required Resources ==
> >
> > === Mailing Lists ===
> >
> > * horn-private
> > * horn-dev
> >
> > === Subversion Directory ===
> >
> > * Git is the preferred source control system: git://git.apache.org/horn
> >
> > === Issue Tracking ===
> >
> > * a JIRA issue tracker, HORN
> >
> > == Initial Committers ==
> >
> > * Thomas Jungblut (tjungblut AT apache DOT org)
> > * Edward J. Yoon (edwardyoon AT apache DOT org)
> > * Dongjin Lee (dongjin.lee.kr AT gmail DOT com)
> > * Minho Kim (minwise.kim AT samsung DOT com)
> > * Jungin Lee (jilee AT clid DOT io)
> > * Kyunghyun Paeng (khpaeng AT kaist DOT ac DOT kr)
> > * Chia-Hung Lin (chl501 AT apache DOT org)
> > * Behroz Sikander (behroz.sikander AT tum DOT de)
> > * Kisuk Lee (ks881115 AT gmail DOT com)
> >
> > == Affiliations ==
> >
> > * Thomas Jungblut (Microsoft)
> > * Edward J. Yoon (Samsung Electronics)
> > * Donjin Lee (LINE Plus)
> > * Minho Kim (Samsung Electronics)
> > * Jungin Lee (Cldi Inc.)
> > * Kyunghyun Paeng (KAIST)
> > * Chia-Hung Lin (Self)
> > * Behroz Sikander (Technical University of Munich)
> > * Kisuk Lee (Seoul National University)
> >
> > == Sponsors ==
> >
> > === Champion ===
> >
> > * Edward J. Yoon <ASF member, Samsung Electronics>
> >
> > === Nominated Mentors ===
> >
> > * Luciano Resende <ASF member, IBM>
> > * Robin Anil <ASF member, Tock>
> > * Edward J. Yoon <ASF member, Samsung Electronics>
> > * Rich Bowen <ASF member, Red Hat>
> >
> > === Sponsoring Entity ===
> >
> > The Apache Incubator
> >
> > --
> > Best Regards, Edward J. Yoon
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> > For additional commands, e-mail: general-help@incubator.apache.org
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message