hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chao Wang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-1375) [Zebra] To support writing multiple Zebra tables through Pig
Date Mon, 19 Apr 2010 21:34:51 GMT

     [ https://issues.apache.org/jira/browse/PIG-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chao Wang updated PIG-1375:
---------------------------

    Attachment: PIG-1375.patch

Thank Xuefu for the feedback.

Updated the patch to incorporate in comment 2 and 4.
For comment 1) The indentation change is only incidental to make some files (impacted by this
feature) to follow Zebra's tab policy - space of width two.
For comment 3) The flag idea needs to be justified by further performance profiling work.
The check here should be trivial compared with other operations such as generateKey() and
insert().


 

> [Zebra] To support writing multiple Zebra tables through Pig
> ------------------------------------------------------------
>
>                 Key: PIG-1375
>                 URL: https://issues.apache.org/jira/browse/PIG-1375
>             Project: Pig
>          Issue Type: New Feature
>    Affects Versions: 0.7.0
>            Reporter: Chao Wang
>            Assignee: Chao Wang
>             Fix For: 0.8.0
>
>         Attachments: PIG-1375.patch, PIG-1375.patch, PIG-1375.patch
>
>
> In Zebra, we already have multiple outputs support for map/reduce.  But we do not support
this feature if users use Zebra through Pig.
> This jira is to address this issue. We plan to support writing to multiple output tables
through Pig as well.
> We propose to support the following Pig store statements with multiple outputs:
> store relation into 'loc1,loc2,loc3....' using org.apache.hadoop.zebra.pig.TableStorer('storagehint_string',
> 'complete name of your custom partition class', 'some arguments to partition class');
/* if certain partition class arguments is needed */
> store relation into 'loc1,loc2,loc3....' using org.apache.hadoop.zebra.pig.TableStorer('storagehint_string',
> 'complete name of your custom partition class'); /* if no partition class arguments is
needed */
> Note that users need to specify up to three arguments - storage hint string, complete
name of partition class and partition class arguments string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message