hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] Closed: (PIG-257) Allow usage of custom Hadoop InputFormat in Pig
Date Fri, 14 May 2010 06:46:54 GMT

     [ https://issues.apache.org/jira/browse/PIG-257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Daniel Dai closed PIG-257.
--------------------------


> Allow usage of custom Hadoop InputFormat in Pig
> -----------------------------------------------
>
>                 Key: PIG-257
>                 URL: https://issues.apache.org/jira/browse/PIG-257
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Pi Song
>             Fix For: 0.7.0
>
>
> This very cool idea sprang out from a discussion in mailing-list (Thanks Manish Shah)
> There is a semantic issue that Hadoop Input Format generally expects K,V but Pig expects
Tuple. We can solve this by sticking K,V as fields in Tuple. 
> Provided that we've got rich built-in string/binary manipulation functions, Hadoop users
shouldn't find it too costly to use Pig. This should definitely help accelerate Pig adoption
process.
> After a brief look at the current code, this new feature will require changes in Map
Reduce execution engine so I will wait until the type branch is complete before start working
on this (If nobody expresses interest in doing it :) ) 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message