incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vasas Szabolcs <va...@apache.org>
Subject Re: [VOTE] Accept Marvin-AI into Apache Incubator
Date Thu, 23 Aug 2018 14:13:51 GMT
+1

On 2018/08/23 10:26:43, "Tan,Zhongyi" <tanzhongyi@baidu.com> wrote: 
> +1 (not binding)
> 
> 在 2018/8/23 上午7:47, "Willem Jiang" <willem.jiang@gmail.com> 写入:
> 
> >+1 (binding)
> >
> >
> >Willem Jiang
> >
> >Twitter: willemjiang
> >Weibo: 姜宁willem
> >
> >On Wed, Aug 22, 2018 at 1:43 AM, Luciano Resende <luckbr1975@gmail.com>
> >wrote:
> >
> >> After the initial discussion, please vote on the acceptance of Marvin-AI
> >> Project for incubation at the Apache Incubator. The full proposal is
> >> available at the end of this message and on the wiki at :
> >>
> >> https://wiki.apache.org/incubator/Marvin-AI
> >>
> >> Please cast your votes:
> >>
> >> [ ] +1, bring Marvin-AI into Incubator
> >> [ ] +0, I don't care either way
> >> [ ] -1, do not bring Marvin-AI into Incubator, because...
> >>
> >> The vote is open for the next 72 hours and only votes from the
> >> Incubator PMC are binding.
> >>
> >> ===
> >>
> >> = Marvin-AI =
> >>
> >> == Abstract ==
> >>
> >> Marvin-AI is an open-source artificial intelligence (AI) platform that
> >> helps data scientists, prototype and productionalize complex solutions
> >>with
> >> a scalable, low-latency, language-agnostic, and standardized
> >>architecture
> >> while simplifies the process of exploration and modeling.
> >>
> >> == Proposal ==
> >>
> >> Marvin helps non-experienced developers create industry-grade AI
> >> applications. It has three core components:  a development environment
> >>to
> >> be used during data exploration and hypothesis validation (Toolbox), a
> >> library which should be extended to create Marvin engines, and a Scala
> >> application server which interprets engines (Engine Executor).
> >> A basic premise of Marvin is that it should be language-agnostic, able
> >>to
> >> interpret engines implemented in different programming languages.
> >>
> >> == Background ==
> >>
> >> The Marvin AI project was initiated as an internal project at B2W
> >>Digital
> >> (Brazil), the largest e-commerce company in Latin America. Nowadays, it
> >>is
> >> used by all data scientists within the B2W team. Oftentimes, data
> >> scientists don't have an extensive background in software engineering,
> >>yet
> >> are in charge of creating AI applications that need to scale to high
> >> throughput and provide millisecond-level response times. At B2W, Marvin
> >>AI
> >> plays an important role in this process, abstracting advanced software
> >> engineering procedures, allowing data scientists to focus on their
> >> knowledge domain.
> >>
> >> == Rationale ==
> >>
> >> With recent advances in computer architecture and a corresponding
> >>increase
> >> in the amount of data generated by always-connected devices, AI
> >>algorithms
> >> offer a solution to problems that have long troubled modern
> >>corporations.
> >> Since AI developers come from various fields, such as statistics,
> >>physics,
> >> and math, there exists a strong need for platforms which enable them to
> >> move from prototypes to enterprise applications. Although some tools
> >>claim
> >> to offer this service, in reality, there is no reliable open-source
> >> solution.
> >>
> >> == Initial Goals ==
> >>
> >> The initial goals will most likely be to merge the existing codebase
> >>into a
> >> single repository, migrate it to Apache, and then integrate with the
> >>Apache
> >> development process. Furthermore, we plan for incremental development
> >>and
> >> releases, as per Apache guidelines.
> >>
> >> == Current Status ==
> >>
> >> === Meritocracy ===
> >>
> >> Marvin already works under principles of meritocracy. Today, Marvin
> >>already
> >> has some contributors that are part of other institutions. Although
> >>there
> >> is no formal process defined to become a committer, contributors that
> >>make
> >> major changes/improvements to the platform are naturally granted write
> >> access to the repository.
> >>
> >>
> >> === Community ===
> >>
> >> Acceptance into the Apache foundation would substantially boost both
> >> Marvin's user and developer communities. The current community includes
> >>a
> >> few experienced developers that have either academic or professional
> >> experience with AI. The community is largely comprised of data
> >>scientists
> >> working at B2W and other companies such as Cloudera, MIT, Qume Labs,
> >> Laguro.com, and CBYK. Also, there is a  meetup group of hundreds of
> >>users
> >> who meet regularly to exchange ideas about Marvin and, more generally,
> >>AI.
> >>
> >> Reference to the group: https://www.meetup.com/marvin-ai/members/
> >>
> >> === Core Developers ===
> >>
> >> The core developers for Marvin are listed in the contributor's list and
> >> initial PPMC below. These lists include B2W employees, MIT students,
> >>UFSCAR
> >> researchers, independent contributors, and some employees of other
> >> companies like Cloudera, Qume Labs, Laguro.com, and CBYK.
> >>
> >> === Alignment ===
> >>
> >> The initial committers strongly believe that by being part of the Apache
> >> Software Foundation, Marvin AI will be part of a comprehensive suite
> >>for AI
> >> applications that can process big data and enable enterprises to extract
> >> value from their data lakes. Also, we hope that by integrating with
> >>other
> >> Apache projects such as Apache Spark, Apache Hadoop; that this will
> >>foster
> >> additional collaboration between these projects furthering the already
> >> existing integration points and expanding the community of contributors.
> >>
> >>
> >> == Known Risks ==
> >>
> >> === Orphaned products ===
> >>
> >> Given the current maturity of Marvin and how well it has been received
> >>at
> >> technical conferences, the risk of the project being abandoned is
> >>minimal.
> >> AI is not academia-exclusive anymore, and as enterprises start to add
> >> data-science pipelines to their applications, demand for Marvin will
> >>only
> >> increase.
> >>
> >> === Inexperience with Open Source ===
> >>
> >> Marvin AI has been an open-source project since October 2017. The
> >>project
> >> was started in a company where open-source culture is foundational. B2W
> >> Digital runs the largest e-commerce in Latin America on top of
> >>open-source
> >> projects.
> >>
> >> === Reliance on Salaried Developers ===
> >>
> >> Marvin AI receives substantial efforts from salaried developers -- a
> >>few of
> >> which were hired by companies to work exclusively for the project -- but
> >> the majority devote "after-hours" or spare time to this project. Some
> >> developers are graduate students that contribute in their free time at
> >> school.
> >>
> >> === Relationships with Other Apache Products ===
> >>
> >> Marvin integrates with several Apache products, such as Hadoop (HDFS)
> >>and
> >> Spark. Marvin shares some similar features with PredictionIO,
> >>specifically
> >> the model application server and a design pattern that was inspired by
> >>the
> >> DASE. Despite these similarities, Marvin is catered towards a different
> >> clientele (data scientists), and for that reason, it includes many
> >>critical
> >> features that are not provided by PredictionIO.
> >>
> >> === An Excessive Fascination with the Apache Brand ===
> >>
> >> While the ASF brand will undoubtedly help Marvin become a successful
> >> project, Marvin is already gaining traction at companies around the
> >>globe.
> >>
> >> == Documentation ==
> >>
> >> http://www.marvin-ai.org
> >>
> >>
> >> == Initial Source ==
> >>
> >> The current codebase is available at http://github.com/marvin-ai. This
> >>is
> >> practically the same code that will be migrating to the Apache
> >>Foundation,
> >> the notable difference being that the multiple repositories will be
> >>merged
> >> into a single repository (if necessary).
> >>
> >> These are the main repositories and a very simplified explanation about
> >> each one:
> >>
> >> '''Main repositories'''
> >>
> >>  * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps in
> >> the
> >> creation of new ML engines
> >>  * marvin-ai/marvin-engine-executor - Component responsible for
> >> interpreting, serving and managing Marvin engines
> >>  * marvin-ai/marvin-public-engines - Marvin engine examples to help new
> >> Marvin users to build engines
> >>  * marvin-ai/marvin-platform-book - Documentation in GitHub book site
> >> format
> >>
> >> '''Secondary repositories (Experimental and Initial)'''
> >>  * marvin-ai/marvin-vagrant-dev - Development environment that uses
> >> VirtualBox and vagrant to non mac and Linux users;
> >>  * marvin-ai/marvin-paper - Source code (latex format) of the first
> >>Marvin
> >> paper published in PAPIS.io conference in Boston.
> >>  * marvin-ai/marvin-cluster-admin - Admin module responsible to manage
> >> Marvin cluster;
> >>  * marvin-ai/marvin-automl - AutoML module responsible to help data
> >> scientist to build machine learning models with a very simple visual
> >> interface;
> >>
> >>
> >> == External Dependencies ==
> >>
> >> It is very likely that all our dependencies are using either the Apache
> >>or
> >> MIT license. Upon acceptance to the incubator, we would begin a thorough
> >> analysis of all transitive dependencies to verify this fact and
> >>introduce
> >> license checking into the build and release process.
> >>
> >> == Required Resources ==
> >>
> >> === Mailing lists ===
> >>
> >>   * private@marvin.incubator.apache.org (with moderated subscriptions)
> >>   * dev@marvin.incubator.apache.org
> >>   * commits@marvin.incubator.apache.org
> >>
> >>
> >> === Git Repositories ===
> >>
> >>   * https://git-wip-us.apache.org/repos/asf/incubator-marvin.git
> >>
> >> === Issue Tracking ===
> >>
> >>   * JIRA (MARVIN)
> >>
> >> == Initial Committers ==
> >>
> >>  * Lucas Bonatto Miguel <lucasbonatto@gmail.com> - Qume Labs
> >>(California -
> >> USA)
> >>  * Daniel Takabayashi <daniel.takabayashi@gmail.com> - B2W Digital (São
> >> Paulo - BR) / Laguro.com (California - USA)
> >>  * Bruno Piraja <bruno.piraja@b2wdigital.com> - B2W Digital (São Paulo
-
> >> BR)
> >>  * Zhang Yifei <zhang.yifei@b2wdigital.com> - B2W Digital (São Paulo
-
> >>BR)
> >>  * Harrison Wang <hwang123@mit.edu> - MIT (USA)
> >>  * Brody West <brodyw@mit.edu> - MIT (USA)
> >>  * Rafael Novello <rafael.novello@b2wdigital.com> - B2W Digital (São
> >>Paulo
> >> - BR)
> >>  * Willian Leite <willian.leite@cbyk.com.br> - CBYK (São Paulo - BR)
> >>  * Danilo Nunes <nunesdanilo@gmail.com> - Qume Labs (California - USA)
> >>  * Alan Silva <alan.silva@cloudera.com> Cloudera (USA)
> >>  * Jeremy Elster <jeremy.elster@b2wdigital.com> - B2W Digital (São
> >>Paulo -
> >> BR)
> >>
> >>
> >> == Sponsors ==
> >>
> >> === Champion ===
> >>
> >>  * Luciano Resende - (lresende)
> >>
> >> === Nominated Mentors ===
> >>
> >>  * Luciano Resende - (lresende)
> >>  * Jim Jagielski - (jim)
> >>  * William Colen - (colen)
> >>
> >> === Sponsoring Entity ===
> >> We would like to propose the Apache Incubator to sponsor this project.
> >>
> >>
> >> --
> >> Luciano Resende
> >> http://twitter.com/lresende1975
> >> http://lresende.blogspot.com/
> >>
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Mime
View raw message