pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Olga Natkovich (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (PIG-2374) streaming regression with dotNext
Date Wed, 07 Dec 2011 22:16:40 GMT

    [ https://issues.apache.org/jira/browse/PIG-2374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164769#comment-13164769
] 

Olga Natkovich commented on PIG-2374:
-------------------------------------

I think Ashutosh is brining a really good point. We seemed to always fixing things in Pig
because understandably it is easier for us. However, if Hadoop is breaking contract they should
be fixing this especially if we have to be paying performance penalty on this
                
> streaming regression with dotNext
> ---------------------------------
>
>                 Key: PIG-2374
>                 URL: https://issues.apache.org/jira/browse/PIG-2374
>             Project: Pig
>          Issue Type: Bug
>         Environment: hadoopApache Pig version 0.9.2.1111101150 (r1200499)
> compiled Nov 10 2011, 19:50:15
>  -bash-3.1$ hadoop version
> Hadoop 0.23.0.1111080202
> Subversion http://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.23.0/hadoop-common-project/hadoop-common
-r 1196973
> Compiled by hadoopqa on Tue Nov  8 02:12:04 PST 2011
> From source with checksum 4e42b2d96c899a98a8ab8c7cc23f27ae
>            Reporter: Araceli Henley
>            Assignee: Daniel Dai
>              Labels: hadoop2.0
>             Fix For: 0.9.2
>
>         Attachments: PIG-2374-1.patch
>
>
> Streaming seems to be broken in dotNext. There are several tests that are failing.
> The results from C below produce clean results.
> The results from D which are streamed through CMD produce control characters on some
of the output.
> define CMD `perl GroupBy.pl '\t' 0` ship('/homes/monster/pigtest/pigtest_next/pigharness/dist/pig_harness/libexec/PigTest/GroupBy.pl');
> A = load '/user/user1/pig/tests/data/singlefile/studenttab10k';
> B = group A by $0;
> C = foreach B generate flatten(A);
> D = stream C through CMD;
> store C into '/user/user1/pig/out/user1.1321117428/ComputeSpec_7_C.out';
> store D into '/user/user1/pig/out/user1.1321117428/ComputeSpec_7_D.out';
> Other streaming tests that fail with control characters:
> EST FAILED <ComputeSpec_7>
> TEST FAILED <ComputeSpec_8>
> TEST FAILED <ComputeSpec_10>
> TEST FAILED <ComputeSpec_11>
> TEST FAILED <ComputeSpec_12>
> TEST FAILED <JobManagement_2>
> TEST FAILED <JobManagement_3>
> TEST FAILED <StreamingIO_4>
> TEST FAILED <NonStreaming_1>
> TEST FAILED <MultiQuery_21>
> ...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message