hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John DeTreville (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-211) Replicating small tables for joins
Date Thu, 17 Apr 2008 23:47:21 GMT
Replicating small tables for joins
----------------------------------

                 Key: PIG-211
                 URL: https://issues.apache.org/jira/browse/PIG-211
             Project: Pig
          Issue Type: New Feature
          Components: data
            Reporter: John DeTreville
            Priority: Minor


Joining a table A with a small table B can be disproportionately expensive if A must be sorted
before the join, and the result must be sorted again. This effort can often be reduced or
eliminated if table B is replicated in whole to all nodes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message