hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jeff Hammerbacher (JIRA)" <j...@apache.org>
Subject [jira] Issue Comment Edited: (PIG-823) Hadoop Metadata Service
Date Fri, 29 May 2009 18:22:45 GMT

    [ https://issues.apache.org/jira/browse/PIG-823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12714539#action_12714539
] 

Jeff Hammerbacher edited comment on PIG-823 at 5/29/09 11:20 AM:
-----------------------------------------------------------------

Hey,

Hadoop already has a metadata service (well defined at http://svn.apache.org/viewvc/hadoop/hive/trunk/metastore/if/hive_metastore.thrift)
and a SQL implementation in production use at scale at several organizations. Can any of that
work be reused for this purpose? It seems like duplicating effort across subprojects is a
bad idea.

Later,
Jeff

      was (Author: hammer):
    Hey,

Hadoop already had a metadata service (well defined at http://svn.apache.org/viewvc/hadoop/hive/trunk/metastore/if/hive_metastore.thrift)
and a SQL implementation in production use at scale at several organizations. Can any of that
work be reused for this purpose? It seems like duplicating effort across subprojects is a
bad idea.

Later,
Jeff
  
> Hadoop Metadata Service
> -----------------------
>
>                 Key: PIG-823
>                 URL: https://issues.apache.org/jira/browse/PIG-823
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Olga Natkovich
>
> This JIRA is created to track development of a metadata system for  Hadoop. The goal
of the system is to allow users and applications to register data stored on HDFS, search for
the data available on HDFS, and associate metadata such as schema, statistics, etc. with a
particular data unit or a data set stored on HDFS. The initial goal is to provide a fairly
generic, low level abstraction that any user or application on HDFS can use to store an retrieve
metadata. Over time a higher level abstractions closely tied to particular applications or
tools can be developed.
> Over time, it would make sense for the metadata service to become a subproject within
Hadoop. For now, the proposal is to make it a contrib to Pig since Pig SQL is likely to be
the first user of the system.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message