hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dmitriy V. Ryaboy (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1205) Enhance HBaseStorage-- Make it support loading row key and implement StoreFunc
Date Wed, 17 Mar 2010 15:30:27 GMT

    [ https://issues.apache.org/jira/browse/PIG-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846442#action_12846442

Dmitriy V. Ryaboy commented on PIG-1205:

A brief list of issues that come to mind (these apply to the 0.6 version, but I think things
are substantially the same in 0.7):

1) extending Utf8Converter means data in HBase is expected to be stored as strings. Conversion
using the Hbase Bytes class should be supported instead (or at least in addition).
2) No projection push-down. For some reason even though it is clear what columns to pull,
this client pulls everything, and filters out the right columns when constructing a tuple.
The columns should be pushed into the Scan.
3) No filter push-down. HBase has a number of efficient filters available, none of which are
used. At a minimum, range constraints on the row key should be supported.
4) No way to pull out the row key (but you are adding that in this ticket, so that's good).
5) No way to control row version / timestamp

None of this is rocket science, and in fact I am making good progress on all of them for 0.6,
but it's unlikely to get done and ported for 0.7 by Monday.


> Enhance HBaseStorage-- Make it support loading row key and implement StoreFunc
> ------------------------------------------------------------------------------
>                 Key: PIG-1205
>                 URL: https://issues.apache.org/jira/browse/PIG-1205
>             Project: Pig
>          Issue Type: Sub-task
>    Affects Versions: 0.7.0
>            Reporter: Jeff Zhang
>            Assignee: Jeff Zhang
>             Fix For: 0.7.0
>         Attachments: PIG_1205.patch, PIG_1205_2.patch, PIG_1205_3.patch, PIG_1205_4.patch

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message