hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alan Gates <ga...@yahoo-inc.com>
Subject Begin a discussion about Pig as a top level project
Date Fri, 19 Mar 2010 17:59:16 GMT
You have probably heard by now that there is a discussion going on in  
the Hadoop PMC as to whether a number of the subprojects (Hbase, Avro,  
Zookeeper, Hive, and Pig) should move out from under the Hadoop  
umbrella and become top level Apache projects (TLP).  This discussion  
has picked up recently since the Apache board has clearly communicated  
to the Hadoop PMC that it is concerned that Hadoop is acting as an  
umbrella project with many disjoint subprojects underneath it.  They  
are concerned that this gives Apache little insight into the health  
and happenings of the subproject communities which in turn means  
Apache cannot properly mentor those communities.

The purpose of this email is to start a discussion within the Pig  
community about this topic.  Let me cover first what becoming TLP  
would mean for Pig, and then I'll go into what options I think we as a  
community have.

Becoming a TLP would mean that Pig would itself have a PMC that would  
report directly to the Apache board.  Who would be on the PMC would be  
something we as a community would need to decide.  Common options  
would be to say all active committers are on the PMC, or all active  
committers who have been a committer for at least a year.  We would  
also need to elect a chair of the PMC.  This lucky person would have  
no additional power, but would have the additional responsibility of  
writing quarterly reports on Pig's status for Apache board meetings,  
as well as coordinating with Apache to get accounts for new   
committers, etc.  For more information see http://www.apache.org/foundation/how-it-works.html#roles

Becoming a TLP would not mean that we are ostracized from the Hadoop  
community.  We would continue to be invited to Hadoop Summits, HUGs,  
etc.  Since all Pig developers and users are by definition Hadoop  
users, we would continue to be a strong presence in the Hadoop  

I see three ways that we as a community can respond to this:

1) Say yes, we want to be a TLP now.
2) Say yes, we want to be a TLP, but not yet.  We feel we need more  
time to mature.  If we choose this option we need to be able to  
clearly articulate how much time we need and what we hope to see  
change in that time.
3) Say no, we feel the benefits for us staying with Hadoop outweigh  
the drawbacks of being a disjoint subproject.  If we choose this, we  
need to be able to say exactly what those benefits are and why we feel  
they will be compromised by leaving the Hadoop project.

There may other options that I haven't thought of.  Please feel free  
to suggest any you think of.

Questions?  Thoughts?  Let the discussion begin.


View raw message