pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel Dai (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PIG-3915) MapReduce queries in Pigmix outputs different results than Pig's
Date Thu, 29 May 2014 05:15:03 GMT

     [ https://issues.apache.org/jira/browse/PIG-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Daniel Dai updated PIG-3915:

       Resolution: Fixed
    Fix Version/s:     (was: 0.14.0)
           Status: Resolved  (was: Patch Available)

Run pigmix with the patch, all query runs and number looks reasonable. Patch committed to
0.13 branch and trunk. 

Also need to mention Pigmix datagenerator is broken by PIG-3967.

> MapReduce queries in Pigmix outputs different results than Pig's
> ----------------------------------------------------------------
>                 Key: PIG-3915
>                 URL: https://issues.apache.org/jira/browse/PIG-3915
>             Project: Pig
>          Issue Type: Bug
>          Components: tools
>    Affects Versions: 0.12.0
>            Reporter: Keren Ouaknine
>            Assignee: Keren Ouaknine
>             Fix For: 0.13.0
>         Attachments: PIG-3915.2.patch
> Hello,
> The Pigmix benchmark has 17 queries comparing Pig to MapReduce Java. Some of these queries
are not outputting the same results in Pig and MapReduce. Looking into the outputs, it seems
the errors reside in the MapReduce  code. For example, L6 has no output because the output
of the map function sends the wrong key to the reducer: "query_term" (field 3) instead of
"timespent" (field 2). Hence an exception is thrown, and there is no output to the query in
MapReduce. I am planning to submit a patch once I fixed all the queries in MapReduce :)
> Thanks,
> Keren  

This message was sent by Atlassian JIRA

View raw message