Return-Path: Delivered-To: apmail-hadoop-pig-dev-archive@www.apache.org Received: (qmail 44939 invoked from network); 28 Oct 2009 02:12:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 28 Oct 2009 02:12:23 -0000 Received: (qmail 92970 invoked by uid 500); 28 Oct 2009 02:12:23 -0000 Delivered-To: apmail-hadoop-pig-dev-archive@hadoop.apache.org Received: (qmail 92949 invoked by uid 500); 28 Oct 2009 02:12:23 -0000 Mailing-List: contact pig-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: pig-dev@hadoop.apache.org Delivered-To: mailing list pig-dev@hadoop.apache.org Received: (qmail 92939 invoked by uid 99); 28 Oct 2009 02:12:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Oct 2009 02:12:23 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Oct 2009 02:12:19 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 963FD234C48C for ; Tue, 27 Oct 2009 19:11:59 -0700 (PDT) Message-ID: <1065484545.1256695919614.JavaMail.jira@brutus> Date: Wed, 28 Oct 2009 02:11:59 +0000 (UTC) From: "Pradeep Kamath (JIRA)" To: pig-dev@hadoop.apache.org Subject: [jira] Updated: (PIG-953) Enable merge join in pig to work with loaders and store functions which can internally index sorted data In-Reply-To: <122390380.1252608537482.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/PIG-953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pradeep Kamath updated PIG-953: ------------------------------- Attachment: PIG-953-8.patch > Enable merge join in pig to work with loaders and store functions which can internally index sorted data > --------------------------------------------------------------------------------------------------------- > > Key: PIG-953 > URL: https://issues.apache.org/jira/browse/PIG-953 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.3.0 > Reporter: Pradeep Kamath > Assignee: Pradeep Kamath > Attachments: PIG-953-2.patch, PIG-953-3.patch, PIG-953-4.patch, PIG-953-5.patch, PIG-953-6.patch, PIG-953-7.patch, PIG-953-8.patch, PIG-953.patch > > > Currently merge join implementation in pig includes construction of an index on sorted data and use of that index to seek into the "right input" to efficiently perform the join operation. Some loaders (notably the zebra loader) internally implement an index on sorted data and can perform this seek efficiently using their index. So the use of the index needs to be abstracted in such a way that when the loader supports indexing, pig uses it (indirectly through the loader) and does not construct an index. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.