hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John DeTreville (JIRA)" <j...@apache.org>
Subject [jira] Created: (PIG-209) Indexes for accelerating joins
Date Thu, 17 Apr 2008 23:33:21 GMT
Indexes for accelerating joins

                 Key: PIG-209
                 URL: https://issues.apache.org/jira/browse/PIG-209
             Project: Pig
          Issue Type: New Feature
          Components: data
            Reporter: John DeTreville

Computing the inner join of a very large table (i.e., bag or mapping) with a smaller table
can take time proportional to the size of the very large table. This time required can be
greatly reduced if the very large table is indexed, taking time proportional to the size of
the smaller table. It should be possible for clients to index tables for use by future joins.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message