hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Richard Ding (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-1238) Dump does not respect the schema
Date Tue, 09 Mar 2010 01:47:27 GMT

     [ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Richard Ding updated PIG-1238:

    Attachment: PIG-1238.patch

Pig inserts a new Limit (or Top-K) job with one reducer after a Limit (or Top-K) job with
multiple reducers to ensure the output has the right number of records. 

In the case of Top-K, the new job must also preserve the ordering of the original job. Therefore
the sorting key must be passed to the new job. This patch moves the last ForEach statement
in above script from the original job to the new job so that the sorting keys are not removed
by the first job. 

> Dump does not respect the schema
> --------------------------------
>                 Key: PIG-1238
>                 URL: https://issues.apache.org/jira/browse/PIG-1238
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Ankur
>            Assignee: Richard Ding
>             Fix For: 0.7.0
>         Attachments: PIG-1238.patch
> For complex data type and certain sequence of operations dump produces results with non-existent
field in the relation.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message