hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1408) Annotate explain plans with aliases
Date Tue, 11 May 2010 05:27:29 GMT

    [ https://issues.apache.org/jira/browse/PIG-1408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866061#action_12866061
] 

Hadoop QA commented on PIG-1408:
--------------------------------

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12443996/PIG-1408.patch
  against trunk revision 941976.

    +1 @author.  The patch does not contain any @author tags.

    +1 tests included.  The patch appears to include 45 new or modified tests.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    +1 javac.  The applied patch does not increase the total number of javac compiler warnings.

    +1 findbugs.  The patch does not introduce any new Findbugs warnings.

    +1 release audit.  The applied patch does not increase the total number of release audit
warnings.

    -1 core tests.  The patch failed core unit tests.

    +1 contrib tests.  The patch passed contrib unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/323/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/323/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: http://hudson.zones.apache.org/hudson/job/Pig-Patch-h7.grid.sp2.yahoo.net/323/console

This message is automatically generated.

> Annotate explain plans with aliases
> -----------------------------------
>
>                 Key: PIG-1408
>                 URL: https://issues.apache.org/jira/browse/PIG-1408
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.7.0
>            Reporter: Richard Ding
>            Assignee: Richard Ding
>             Fix For: 0.8.0
>
>         Attachments: PIG-1408.patch
>
>
> PIG-1156 added aliases in Pig scripts to the corresponding LogicalOperators and PhysicalOperators.
The aliases in the operators, however, are not displayed in the output created by the explain
command. 
> Since a Pig script can generate many MR jobs, it will be helpful, for debugging purposes,
to annotate the explain output plans with aliases, so that users can correlate the jobs with
the statements in their scripts.
> Here is an example: given the following script
> {code}
> A = load 'input';
> B = group A by $0;
> C = foreach B generate group, flatten(A);
> explain C
> {code}
> The output without alias annotation is 
> {code}
> MapReduce node 1-28
> Map Plan
> Local Rearrange[tuple]{bytearray}(false) - 1-22
> |   |
> |   Project[bytearray][0] - 1-23
> |
> |---Load(file:///test/input:org.apache.pig.builtin.PigStorage) - 1-19--------
> Reduce Plan
> Store(fakefile:org.apache.pig.builtin.PigStorage) - 1-27
> |
> |---New For Each(false,true)[bag] - 1-26
>     |   |
>     |   Project[bytearray][0] - 1-24
>     |   |
>     |   Project[bag][1] - 1-25
>     |
>     |---Package[tuple]{bytearray} - 1-21--------
> Global sort: false
> {code} 
>    
> While the output with alias annotation will be
> {code}
> MapReduce node 1-28
> Map Plan
> B: Local Rearrange[tuple]{bytearray}(false) - 1-22
> |   |
> |   Project[bytearray][0] - 1-23
> |
> |---A: Load(file:///test/input:org.apache.pig.builtin.PigStorage) - 1-19--------
> Reduce Plan
> C: Store(fakefile:org.apache.pig.builtin.PigStorage) - 1-27
> |
> |---C: New For Each(false,true)[bag] - 1-26
>     |   |
>     |   Project[bytearray][0] - 1-24
>     |   |
>     |   Project[bag][1] - 1-25
>     |
>     |---B: Package[tuple]{bytearray} - 1-21--------
> Global sort: false
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message