hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Niv (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-1100) PIG hangs on second call to DUMP or STORE
Date Thu, 19 Nov 2009 20:13:39 GMT

     [ https://issues.apache.org/jira/browse/PIG-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Michael Niv updated PIG-1100:
-----------------------------

    Description: 
pig hangs on the last line on the script below when I run with -x local. It runs fine when
run on hadoop.
see attached tarball for repro details. 
I initially ran into a problem which involved cogrouping two things like id_docdate_s1 below,
but this is what I came up with while tightening down my bugreport.
Thanks in advance!  -MN

-- bugrep.pig

DEFINE get_doc_date `document-date.pl`;

id_text1 = LOAD 'v2.txt' AS (id,text);
id_docdate1 = STREAM id_text1 THROUGH  get_doc_date AS (id,docdate);
id_docdate_s1 = ORDER id_docdate1 BY docdate;
store id_docdate_s1 into 'f1.out';

id_text2 = LOAD 'v2.txt' AS (id,text);
id_docdate2 = STREAM id_text2 THROUGH  get_doc_date AS (id,docdate);
id_docdate_s2 = ORDER id_docdate2 BY docdate;
store id_docdate_s2 into 'f2.out';    -- second store call hangs pig


  was:
pig hangs on the last line on the script below when I run with -x local. It runs fine when
run on hadoop.
Happy to provide the files used in bugrep.pig below: v2.txt and document-date.pl (michaelniv@gmail.com)
I initially ran into a problem which involved cogrouping two things like id_docdate_s1 below,
but this is what I came up with while tightening down my bugreport.
Thanks in advance.

-- bugrep.pig

DEFINE get_doc_date `document-date.pl`;

id_text1 = LOAD 'v2.txt' AS (id,text);
id_docdate1 = STREAM id_text1 THROUGH  get_doc_date AS (id,docdate);
id_docdate_s1 = ORDER id_docdate1 BY docdate;
store id_docdate_s1 into 'f1.out';

id_text2 = LOAD 'v2.txt' AS (id,text);
id_docdate2 = STREAM id_text2 THROUGH  get_doc_date AS (id,docdate);
id_docdate_s2 = ORDER id_docdate2 BY docdate;
store id_docdate_s2 into 'f2.out';    -- second store call hangs pig



> PIG hangs on second call to DUMP or STORE
> -----------------------------------------
>
>                 Key: PIG-1100
>                 URL: https://issues.apache.org/jira/browse/PIG-1100
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.5.0
>         Environment: Linux mniv-laptop 2.6.24-25-generic #1 SMP Tue Oct 20 07:31:10 UTC
2009 i686 GNU/Linux
> java version "1.6.0_16"
> Java(TM) SE Runtime Environment (build 1.6.0_16-b01)
> Java HotSpot(TM) Server VM (build 14.2-b01, mixed mode)
> Apache Pig version 0.5.0 (r829623) 
> Hadoop 0.20.1
> Subversion http://svn.apache.org/repos/asf/hadoop/common/tags/release-0.20.1-rc1 -r 810220
>            Reporter: Michael Niv
>         Attachments: bugrep.tar
>
>
> pig hangs on the last line on the script below when I run with -x local. It runs fine
when run on hadoop.
> see attached tarball for repro details. 
> I initially ran into a problem which involved cogrouping two things like id_docdate_s1
below, but this is what I came up with while tightening down my bugreport.
> Thanks in advance!  -MN
> -- bugrep.pig
> DEFINE get_doc_date `document-date.pl`;
> id_text1 = LOAD 'v2.txt' AS (id,text);
> id_docdate1 = STREAM id_text1 THROUGH  get_doc_date AS (id,docdate);
> id_docdate_s1 = ORDER id_docdate1 BY docdate;
> store id_docdate_s1 into 'f1.out';
> id_text2 = LOAD 'v2.txt' AS (id,text);
> id_docdate2 = STREAM id_text2 THROUGH  get_doc_date AS (id,docdate);
> id_docdate_s2 = ORDER id_docdate2 BY docdate;
> store id_docdate_s2 into 'f2.out';    -- second store call hangs pig

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message