hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Pradeep Kamath (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-1063) Pig does not call checkOutSpecs() on OutputFormat provided by StoreFunc in the multistore case
Date Thu, 29 Oct 2009 23:02:05 GMT

     [ https://issues.apache.org/jira/browse/PIG-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Pradeep Kamath updated PIG-1063:
--------------------------------

    Attachment: PIG-1063.patch

Attached patch fixes the issue. The fix is in PigOutputFormat.checkOutputSpecs(), we now check
if there are any stores in the map or reduce plan which provide a non null OutputFormat as
return value of  getStorePreparationClass(). In this case, the checkOutputSpecs() method on
the OutputFormat of the store is called.

> Pig does not call checkOutSpecs() on OutputFormat provided by StoreFunc in the multistore
case
> ----------------------------------------------------------------------------------------------
>
>                 Key: PIG-1063
>                 URL: https://issues.apache.org/jira/browse/PIG-1063
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Pradeep Kamath
>            Assignee: Pradeep Kamath
>         Attachments: PIG-1063.patch
>
>
> A StoreFunc implementation can inform pig of an OutputFormat it uses through the getStoragePreparationClass()
method. In a query with multiple stores which gets optimized into a single mapred job, Pig
does not call the checkOutputSpecs() method on the outputformat. An example of such a script
is:
> {noformat}
> a = load 'input.txt';
> b = filter a by $0 < 10;
> store b into 'output1' using StoreWithOutputFormat();
> c = group a by $0;
> d = foreach c generate group, COUNT(a.$0);
> store d into 'output2' using StoreWithOutputFormat();
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message