hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alan Gates (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (PIG-209) Indexes for accelerating joins
Date Fri, 15 Jan 2010 06:02:54 GMT

     [ https://issues.apache.org/jira/browse/PIG-209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Alan Gates resolved PIG-209.

    Resolution: Won't Fix

At this point Pig is relying on storage formats such as Zebra to do indexing.  We have no
near term plans to provide indexing inside Pig itself.

> Indexes for accelerating joins
> ------------------------------
>                 Key: PIG-209
>                 URL: https://issues.apache.org/jira/browse/PIG-209
>             Project: Pig
>          Issue Type: New Feature
>          Components: data
>            Reporter: John DeTreville
> Computing the inner join of a very large table (i.e., bag or mapping) with a smaller
table can take time proportional to the size of the very large table. This time required can
be greatly reduced if the very large table is indexed, taking time proportional to the size
of the smaller table. It should be possible for clients to index tables for use by future

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message