hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ying He (JIRA)" <j...@apache.org>
Subject [jira] Updated: (PIG-954) Skewed join fails when pig.skewedjoin.reduce.memusage is not configured
Date Fri, 11 Sep 2009 21:57:57 GMT

     [ https://issues.apache.org/jira/browse/PIG-954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ying He updated PIG-954:
------------------------

    Description: query fails if pig.skewedjoin.reduce.memusage is not configured.   (was:
Fragmented replicated join has a few limitations:
 - One of the tables needs to be loaded into memory
 - Join is limited to two tables

Skewed join partitions the table and joins the records in the reduce phase. It computes a
histogram of the key space to account for skewing in the input records. Further, it adjusts
the number of reducers depending on the key distribution.

We need to implement the skewed join in pig.)

> Skewed join fails when pig.skewedjoin.reduce.memusage is not configured
> -----------------------------------------------------------------------
>
>                 Key: PIG-954
>                 URL: https://issues.apache.org/jira/browse/PIG-954
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Ying He
>         Attachments: PIG-954.patch, PIG-954.patch2
>
>
> query fails if pig.skewedjoin.reduce.memusage is not configured. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message