hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1273) Skewed join throws error
Date Wed, 03 Mar 2010 05:45:27 GMT

    [ https://issues.apache.org/jira/browse/PIG-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840482#action_12840482
] 

Ankur commented on PIG-1273:
----------------------------

Here is a simple script to reproduce it

a = load 'test.dat' using PigStorage() as (nums:chararray);
b = load 'join.dat' using PigStorage('\u0001') as (number:chararray,text:chararray);
c = filter a by nums == '7';
d = join c by nums LEFT OUTER, b by number USING "skewed";
dump d;

==== test.dat ====
1
2
3
4
5

===== join.dat =====
1^Aone
2^Atwo
3^Athree

where ^A means Control-A charatcer used as a separator.

> Skewed join throws error 
> -------------------------
>
>                 Key: PIG-1273
>                 URL: https://issues.apache.org/jira/browse/PIG-1273
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Ankur
>
> When the sampled relation is too small or empty then skewed join fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message