hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (PIG-257) Allow usage of custom Hadoop InputFormat in Pig
Date Tue, 22 Dec 2009 18:32:29 GMT

     [ https://issues.apache.org/jira/browse/PIG-257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Olga Natkovich resolved PIG-257.

       Resolution: Fixed
    Fix Version/s: 0.7.0

This is what we are doing with Load-Store redesign

> Allow usage of custom Hadoop InputFormat in Pig
> -----------------------------------------------
>                 Key: PIG-257
>                 URL: https://issues.apache.org/jira/browse/PIG-257
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Pi Song
>             Fix For: 0.7.0
> This very cool idea sprang out from a discussion in mailing-list (Thanks Manish Shah)
> There is a semantic issue that Hadoop Input Format generally expects K,V but Pig expects
Tuple. We can solve this by sticking K,V as fields in Tuple. 
> Provided that we've got rich built-in string/binary manipulation functions, Hadoop users
shouldn't find it too costly to use Pig. This should definitely help accelerate Pig adoption
> After a brief look at the current code, this new feature will require changes in Map
Reduce execution engine so I will wait until the type branch is complete before start working
on this (If nobody expresses interest in doing it :) ) 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message