hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zheng Shao (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-431) Auto-add table property "select" to be the select statement that created the table
Date Fri, 05 Mar 2010 20:43:27 GMT

    [ https://issues.apache.org/jira/browse/HIVE-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12842014#action_12842014

Zheng Shao commented on HIVE-431:

I guess the information is already in lineage.

I think it's a good idea to keep lineage information away from the core metadata, especially
given that we are going to have column lineage etc.
But we should provide an easy way for users to retrieve the lineage information.

> Auto-add table property "select" to be the select statement that created the table
> ----------------------------------------------------------------------------------
>                 Key: HIVE-431
>                 URL: https://issues.apache.org/jira/browse/HIVE-431
>             Project: Hadoop Hive
>          Issue Type: Wish
>            Reporter: Adam Kramer
> A syntactic copy of the query that was used to fill a table would often be AMAZINGLY
useful for figuring out where the data in the table came from.
> I think the best way to implement this would be to automatically add a table property
which includes the SELECT statement. For partitioned tables, this would need to exist for
each partition...or perhaps use some canonical name like selectquery for unpartitioned tables,
plus selectquery_ds=<DATEID> for partitioned tables.
> This problem is growing as more and more tables in our database are generated by either
"root" or by people who are no longer easy to contact.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message