hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jay Tang (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-1331) Owl Hadoop Table Management Service
Date Fri, 26 Mar 2010 18:14:27 GMT
Owl Hadoop Table Management Service

                 Key: PIG-1331
                 URL: https://issues.apache.org/jira/browse/PIG-1331
             Project: Pig
          Issue Type: New Feature
            Reporter: Jay Tang

This JIRA is a proposal to create a Hadoop table management service: Owl. Today, MapReduce
and Pig applications interacts directly with HDFS directories and files and must deal with
low level data management issues such as storage format, serialization/compression schemes,
data layout, and efficient data accesses, etc, often with different solutions. Owl aims to
provide a standard way to addresses this issue and abstracts away the complexities of reading/writing
huge amount of data from/to HDFS.

Owl has a data access API that is modeled after the traditional Hadoop !InputFormt and a management
API to manipulate Owl objects.  This JIRA is related to Pig-823 (Hadoop Metadata Service)
as Owl has an internal metadata store.  Owl integrates with different storage module like
Zebra with a pluggable architecture.

 Initially, the proposal is to submit Owl as a Pig contrib project.  Over time, it makes sense
to move it to a Hadoop subproject.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message