Return-Path: Delivered-To: apmail-pig-dev-archive@www.apache.org Received: (qmail 77079 invoked from network); 17 Feb 2011 21:00:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 17 Feb 2011 21:00:36 -0000 Received: (qmail 53079 invoked by uid 500); 17 Feb 2011 21:00:36 -0000 Delivered-To: apmail-pig-dev-archive@pig.apache.org Received: (qmail 52999 invoked by uid 500); 17 Feb 2011 21:00:35 -0000 Mailing-List: contact dev-help@pig.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pig.apache.org Delivered-To: mailing list dev@pig.apache.org Received: (qmail 52991 invoked by uid 500); 17 Feb 2011 21:00:35 -0000 Delivered-To: apmail-hadoop-pig-dev@hadoop.apache.org Received: (qmail 52988 invoked by uid 99); 17 Feb 2011 21:00:35 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Feb 2011 21:00:35 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 17 Feb 2011 21:00:35 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 449671A8233 for ; Thu, 17 Feb 2011 21:00:15 +0000 (UTC) Date: Thu, 17 Feb 2011 21:00:15 +0000 (UTC) From: "Richard Ding (JIRA)" To: pig-dev@hadoop.apache.org Message-ID: <1397252899.50.1297976415277.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <4588706.260931293001321751.JavaMail.jira@thor> Subject: [jira] Updated: (PIG-1779) Worng stats shown when there are multiple loads but same file names MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/PIG-1779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Ding updated PIG-1779: ------------------------------ Status: Patch Available (was: Open) > Worng stats shown when there are multiple loads but same file names > ------------------------------------------------------------------- > > Key: PIG-1779 > URL: https://issues.apache.org/jira/browse/PIG-1779 > Project: Pig > Issue Type: Bug > Components: tools > Affects Versions: 0.8.0 > Reporter: Vivek Padmanabhan > Assignee: Richard Ding > Fix For: 0.9.0 > > Attachments: PIG-1779.patch > > > In Pig 0.8 , the stats is showing wrong information when ever I have multiple loads and the the file names are similar . > a) Problem 1 > Sample Script : > A = LOAD 'myfolder/tryme' AS (f1); > B = LOAD 'myfolder/anotherfolder/tryme' AS (f2); > C = JOIN A BY f1, B BY f2; > DUMP C; > Here I have 10 records for A and 3 records for B , but pig says > Successfully read 6 records from: "/myfolder/anotherfolder/tryme" > Successfully read 6 records from: "myfolder/tryme" > b) Problem 2 > A = LOAD 'myfolder/tryme' AS (f1); > B = LOAD 'myfolder/an1111otherfolder/tryme' AS (f2); > C = JOIN A BY f1, B BY f2; > DUMP C; > Here there is no folder named an1111otherfolder while "myfolder/tryme" exists . But pig says > Failed to read data from "/myfolder/an1111otherfolder/tryme" > Failed to read data from "/myfolder/tryme" -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira