hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zheng Shao (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1133) Refactor InputFormat and OutputFormat for Hive
Date Fri, 05 Feb 2010 06:42:27 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12829977#action_12829977
] 

Zheng Shao commented on HIVE-1133:
----------------------------------

Thanks for the note, Bennie. In the future, please assign it to yourself click "submit patch"
so that we know it's ready for review (we will "cancel patch" if we have comments).



> Refactor InputFormat and OutputFormat for Hive
> ----------------------------------------------
>
>                 Key: HIVE-1133
>                 URL: https://issues.apache.org/jira/browse/HIVE-1133
>             Project: Hadoop Hive
>          Issue Type: Improvement
>    Affects Versions: 0.6.0
>            Reporter: Zheng Shao
>
> Currently we ran into several problems of the FileInputFormat/OutputFormat in Hive.
> The requirements are:
> R1. We want to support HBase: HIVE-806
> R2. We want to selectively include files based on file names: HIVE-951
> R3. We want to optionally choose to recurse on the directory structure: HIVE-1083
> R4. We want to pass the filter condition into the storage (very useful for HBase, and
indexed data format)
> R5. We want to pass the column selection information into the storage (already done as
part of the RCFile, but we can do it better)
> We need to structure these requirements and the code structure in a good way to make
it extensible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message