hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Johannes Zillmann (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-110) Jobs using the combiner and flatten of group keys produce wrong results
Date Mon, 25 Feb 2008 01:34:52 GMT

     [ https://issues.apache.org/jira/browse/PIG-110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Johannes Zillmann updated PIG-110:
----------------------------------

    Attachment: PIG-110_test_v2.patch

Attached test in version 2 with corrected assertions.

> Jobs using the combiner and flatten of group keys produce wrong results
> -----------------------------------------------------------------------
>
>                 Key: PIG-110
>                 URL: https://issues.apache.org/jira/browse/PIG-110
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.0.0
>            Reporter: Alan Gates
>            Assignee: Antonio Magnaghi
>         Attachments: PIG-110_test.patch, PIG-110_test_v2.patch
>
>
> A job such as the following:
> a = load 'mydata';
> b = group a by ($0, $1);
> c = foreach b generate flatten(group), COUNT($1)
> currently produces wrong results.  Instead of returning the count in the 3rd column,
it returns a repeat of the value in the second column.  If the combiner is forced off, this
does not occur, so I assume this is in the combiner logic somewhere.
> This bug was introduced in revision 617338 (the checkin for PIG-32).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message