hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-257) Allow usage of custom Hadoop InputFormat in Pig
Date Wed, 11 Jun 2008 21:12:45 GMT

    [ https://issues.apache.org/jira/browse/PIG-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12604342#action_12604342

Olga Natkovich commented on PIG-257:

Pi, how would InputFormat be specified?

> Allow usage of custom Hadoop InputFormat in Pig
> -----------------------------------------------
>                 Key: PIG-257
>                 URL: https://issues.apache.org/jira/browse/PIG-257
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Pi Song
> This very cool idea sprang out from a discussion in mailing-list (Thanks Manish Shah)
> There is a semantic issue that Hadoop Input Format generally expects K,V but Pig expects
Tuple. We can solve this by sticking K,V as fields in Tuple. 
> Provided that we've got rich built-in string/binary manipulation functions, Hadoop users
shouldn't find it too costly to use Pig. This should definitely help accelerate Pig adoption
> After a brief look at the current code, this new feature will require changes in Map
Reduce execution engine so I will wait until the type branch is complete before start working
on this (If nobody expresses interest in doing it :) ) 

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message