Return-Path: X-Original-To: apmail-incubator-general-archive@www.apache.org Delivered-To: apmail-incubator-general-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 69ECBD2FD for ; Sat, 13 Oct 2012 14:39:01 +0000 (UTC) Received: (qmail 88936 invoked by uid 500); 13 Oct 2012 14:39:00 -0000 Delivered-To: apmail-incubator-general-archive@incubator.apache.org Received: (qmail 88703 invoked by uid 500); 13 Oct 2012 14:39:00 -0000 Mailing-List: contact general-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@incubator.apache.org Delivered-To: mailing list general@incubator.apache.org Received: (qmail 88695 invoked by uid 99); 13 Oct 2012 14:39:00 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 13 Oct 2012 14:39:00 +0000 Received: from localhost (HELO mail-ie0-f175.google.com) (127.0.0.1) (smtp-auth username olamy, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Sat, 13 Oct 2012 14:39:00 +0000 Received: by mail-ie0-f175.google.com with SMTP id c13so6090252ieb.6 for ; Sat, 13 Oct 2012 07:38:59 -0700 (PDT) Received: by 10.50.168.69 with SMTP id zu5mr4827871igb.23.1350139139353; Sat, 13 Oct 2012 07:38:59 -0700 (PDT) MIME-Version: 1.0 Received: by 10.231.115.215 with HTTP; Sat, 13 Oct 2012 07:38:39 -0700 (PDT) In-Reply-To: References: From: Olivier Lamy Date: Sat, 13 Oct 2012 16:38:39 +0200 Message-ID: Subject: Re: [VOTE] Accept Helix into Apache Incubator To: general@incubator.apache.org Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable +1 (binding). Looks very interesting. Let me know if you need an other mentor for the incubation. 2012/10/10 kishore g : > Hi, > > I would like to call a vote for accepting Helix for incubation in the > Apache Incubator. I have pasted the full proposal below. > > Please cast your vote: > > [ ] +1, bring Helix into Incubator > [ ] +0, I don't care either way, > [ ] -1, do not bring Helix into Incubator, because ... > > This vote will be open for 72 hours and only votes from the Incubator > PMC are binding. > > Thanks, > Kishore G > > > =3D=3D Abstract =3D=3D > Helix is a cluster management system for managing partitioned and > replicated resources in distributed data systems. > > =3D=3D Proposal =3D=3D > Helix provides an abstraction that separates coordination and > management tasks from functional tasks of a distributed system. The > developer defines the system behavior via a state machine, the > transitions between those states, and constraints on states and > transitions that govern the system=92s valid settings. Helix ensures the > distributed system satisfies the state machine, controlling state > changes as appropriate during common operational activities such as > upgrades, component failures, bootstrapping, running maintenance > tasks, and adding capacity. > > =3D=3D Background =3D=3D > Helix was developed at LinkedIn to manage large clusters for several > diverse applications, including a distributed, partitioned, > replicated, highly available document store with a master-slave model, > a search service with multiple replicas that are updated atomically > and in near real-time, and a change data capture service for reliably > transporting database changes to caches, other dependent databases and > indexes. > > These services use Helix to reliably manage dozens of clusters in > multiple data centers. These services meet stringent SLAs at large > scale for mission-critical production applications such as search, > social gestures, and profiles. > Helix has proven to be flexible for a wide variety of system > configurations and operational patterns, is easy to integrate, with > pluggable interfaces enabling custom behavior. It depends on Apache > Zookeeper for coordination and tracking of system state across the > cluster, as well as providing fault tolerance. > Helix is written in Java. It was developed internally at LinkedIn to > meet our particular use cases, but will be useful to many > organizations facing a similar need to manage large clusters. > Therefore, we would like to share it the ASF and begin developing a > community of developers and users within Apache. > > =3D=3D Rationale =3D=3D > Many organizations can benefit from a generalized cluster management > system such as Helix. While our distributed data systems use-cases for > a very large website like LinkedIn has driven the design of Helix, its > uses are varied and we expect many new use cases to emerge. > > =3D=3D Current Status =3D=3D > =3D=3D=3D Meritocracy =3D=3D=3D > Our intent with this incubator proposal is to start building a diverse > developer community around Helix following the Apache meritocracy > model. Since Helix was initially developed in late 2011, we have had > fast adoption and contributions by multiple teams at LinkedIn. > We plan to continue support for new contributors and work with those > who contribute significantly to the project to make them committers. > > =3D=3D=3D Community =3D=3D=3D > Helix is currently being used internally at LinkedIn and is in > production in that company for customer-facing features. Recent public > presentations of Helix and its goals garnered much interest from > potential contributors. We hope to extend our contributor base > significantly and invite all those who are interested in building > large-scale distributed systems to participate. > To further this goal, we use GitHub issue tracking and branching faciliti= es. > > =3D=3D=3D Core Developers =3D=3D=3D > Helix is currently being developed by three engineers at LinkedIn: > Kishore Gopalakrishna, Shi Lu and Jason Zheng, and Adam Silberstein, > an engineer at Trifacta. Kishore, the lead developer and architect, > has experience within Apache as an S4 committer. Shi developed the > partition to node mapping and rebalancing algorithm, cluster admin > APIs, and the health check framework. Jason developed the cluster > controller and most of the test framework. Adam developed the rich > alerting framework that enables cluster-wide, =93intelligent=93 alerts. > > =3D=3D=3D Alignment =3D=3D=3D > The ASF is the natural choice to host the Helix project as its goal of > encouraging community-driven open-source projects fits with our vision > for Helix. Many projects that can benefit from Helix will rely on > Apache ZooKeeper for cluster state management, and can far more easily > achieve their operational goals by using Helix. > > =3D=3D Known Risks =3D=3D > =3D=3D=3D Orphaned Products =3D=3D=3D > The core developers plan to work full time on the project. There is > very little risk of Helix being abandoned as it is a critical part of > LinkedIn's internal infrastructure and is in production use. > > =3D=3D=3D Inexperience with Open Source =3D=3D=3D > Only one of the core developers has experience with open source > development. Kishore has been actively involved with the ASF as a > committer and lead developer of S4. > > =3D=3D=3D Homogeneous Developers =3D=3D=3D > The current core developers are all from LinkedIn. However, we hope to > establish a developer community that includes contributors from > several corporations and we are actively encouraging new contributors > via the mailing lists and public presentations of Helix. > > =3D=3D=3D Reliance on Salaried Developers =3D=3D=3D > Currently, the developers are paid to do work on Helix. However, once > the project has a community built around it, we expect to get > committers, developers and community from outside the current core > developers. However, because LinkedIn relies on Helix internally, the > reliance on salaried developers is unlikely to change. > > =3D=3D=3D Relationships with Other Apache Products =3D=3D=3D > Helix uses Apache ZooKeeper to coordinate its state amongst the > managed cluster components and for leader election to provide fault > tolerance, and uses Apache Maven for build management. > > =3D=3D=3D An Excessive Fascination with the Apache Brand =3D=3D=3D > While we respect the reputation of the Apache brand and have no doubts > that it will attract contributors and users, our interest is primarily > to give Helix a solid home as an open source project following an > established development model. We have also given reasons in the > Rationale and Alignment sections. > > =3D=3D Documentation =3D=3D > Information about Helix can be found at > [https://github.com/linkedin/helix/wiki]. The following links provide > more information about the project: > > * Home Page: http://linkedin.github.com/helix/ > * Github source: https://github.com/linkedin/helix > * Documentation: https://github.com/linkedin/helix/wiki > * Javadocs: http://linkedin.github.com/helix/apidocs/ > > =3D=3D Initial Source =3D=3D > Helix has been under development at LinkedIn since April 2011. It is > currently hosted on github under the Apache license 2 at > [https://github.com/linkedin/helix] > Helix is written in Java. Its source tree is entirely self-contained > and relies on Maven as its build system and dependency resolution > mechanism. > > =3D=3D External Dependencies =3D=3D > The dependencies all have Apache compatible licenses. > * log4j > * zookeeper > * xstream > * jackson-core-asl > * jackson-mapper-asl > * commons-io > * commons-cli > * commons-math > * zkclient > * camel-josql > * camel-core > * gentlyweb-utils > * josql > * commons-management > * commons-logging-api > * org.restlet > * com.noelios.restlet > * net.sf.jsqlparser > > Non-Apache build tools that are used by Helix are as follows: > * Cobertura: GNU GPLv2 > Note that Cobertura is optional and is only used for calculating unit > test coverage. > > > =3D=3D Cryptography =3D=3D > Not applicable. > > =3D=3D Required Resources =3D=3D > =3D=3D=3D Mailing Lists =3D=3D=3D > * helix-private for private PMC discussions (with moderated subscription= s) > * helix-dev > * helix-commits > * helix-user > > =3D=3D=3D Git Directory =3D=3D=3D > Since Git is now available to be used as primary repo type, Helix > would be available in the git repository instead of svn. > [https://git.apache.org/helix.git] > > =3D=3D=3D Issue Tracking =3D=3D=3D > JIRA Helix (HELIX) > > =3D=3D=3D Other Resources =3D=3D=3D > The existing code already has unit and integration tests, so we would > like a Jenkins instance to run them whenever a new patch is submitted. > This can be added after project creation. > > =3D=3D Initial Committers =3D=3D > * Kishore Gopalakrishna > * Shi Lu > * Zhen Zheng > * Adam Silberstein > * Kapil Surlaker > * Bob Schulman > * Swaroop Jagadish > * Rahul Aggarwal > * Terence Yim > * Santiago Perez > > > =3D=3D Affiliations =3D=3D > * Kishore Gopalakrishna (LinkedIn) > * Shi Lu (LinkedIn) > * Jason Zheng (LinkedIn) > * Adam Silberstein (Trifacta) > * Kapil Surlaker (LinkedIn) > * Bob Schulman (LinkedIn) > * Swaroop Jagadish (LinkedIn) > * Rahul Aggarwal (LinkedIn) > * Terence Yim (LinkedIn) > * Santiago Perez (LinkedIn) > > =3D=3D Sponsors =3D=3D > =3D=3D=3D Champion =3D=3D=3D > * Patrick Hunt (Apache Member) > > =3D=3D=3D Nominated Mentors =3D=3D=3D > * Patrick Hunt (Apache Member) > * Mahadev Konar (Apache Member) > * Owen O'Malley (Apache Member) > > =3D=3D=3D Sponsoring Entity =3D=3D=3D > We are requesting the Incubator to sponsor this project. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org > For additional commands, e-mail: general-help@incubator.apache.org > --=20 Olivier Lamy Talend: http://coders.talend.com http://twitter.com/olamy | http://linkedin.com/in/olamy --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org For additional commands, e-mail: general-help@incubator.apache.org