incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Update of "PigProposal" by OlgaN
Date Mon, 17 Sep 2007 16:20:51 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The following page has been changed by OlgaN:
http://wiki.apache.org/incubator/PigProposal

------------------------------------------------------------------------------
  
  The Pig project consists of high-level languages for expressing data analysis programs,
coupled with infrastructure for evaluating these programs. The salient property of Pig programs
is that their structure is amenable to substantial parallelization, which in turns enables
them to handle very large data sets.
  
- At the present time, Pig's infrastructure layer consists of a compiler that produces sequences
of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g.,
the Hadoop project). Pig's language layer currently consists of a textual language called
Pig Latin, which has the following key properties:
+ At the present time, Pig's infrastructure layer consists of a compiler that produces sequences
of Map-Reduce programs, for which large-scale parallel implementations already exist (e.g.,
the Hadoop subproject). Pig's language layer currently consists of a textual language called
Pig Latin, which has the following key properties:
  
   1. ''Ease of programming''. It is trivial to achieve parallel execution of simple, "embarrassingly
parallel" data analysis tasks. Complex tasks comprised of multiple interrelated data transformations
are explicitly encoded as data flow sequences, making them easy to write, understand, and
maintain.
   2. ''Optimization opportunities''. The way in which tasks are encoded permits the system
to optimize their execution automatically, allowing the user to focus on semantics rather
than efficiency.
@@ -41, +41 @@

  
  === Meritocracy ===
  
- Pig was started as a project that was developed by Yahoo! research team. Recently we have
added a development team that works in harmony with the research team with both teams actively
and successfully contributing to the project. We are planning to create the environment that
encourages meritocracy and is consistent with the meritocracy principles of Apache. Within
the team we have people actively participating in the Hadoop project.
+ Pig was started as a project that was developed by Yahoo! research team. Recently we have
added a development team that works in harmony with the research team with both teams actively
and successfully contributing to the project. We are planning to create the environment that
encourages meritocracy and is consistent with the meritocracy principles of Apache. Within
the team we have people actively participating in the Hadoop subproject.
  
  === Community ===
  
@@ -62, +62 @@

  == Known Risks ==
  === Orphaned products ===
  
- All current contributors are part of Yahoo which is a major player in the space and is committed
to grid computing. Also we expect high degree of synergy with Hadoop project.
+ All current contributors are part of Yahoo which is a major player in the space and is committed
to grid computing. Also we expect high degree of synergy with Hadoop subproject.
  
  === Inexperience with Open Source ===
  
@@ -78, +78 @@

  
  === Relationships with Other Apache Products ===
  
- Pig is built on top of Hadoop and we expect deep collaboration with Hadoop project.
+ Pig is built on top of Hadoop and we expect deep collaboration with Hadoop subproject.
  
  === An Excessive Fascination with the Apache Brand ===
  

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message