hadoop-pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gaurav Jain (JIRA)" <j...@apache.org>
Subject [jira] Commented: (PIG-1258) [zebra] Number of sorted input splits is unusually high
Date Fri, 19 Mar 2010 20:30:27 GMT

    [ https://issues.apache.org/jira/browse/PIG-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12847560#action_12847560
] 

Gaurav Jain commented on PIG-1258:
----------------------------------


+1

> [zebra] Number of sorted input splits is unusually high
> -------------------------------------------------------
>
>                 Key: PIG-1258
>                 URL: https://issues.apache.org/jira/browse/PIG-1258
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.6.0
>            Reporter: Yan Zhou
>         Attachments: PIG-1258.patch
>
>
> Number of sorted input splits is unusually high if the projections are on multiple column
groups, or a union of tables, or column group(s) that hold many small tfiles. In one test,
the number is about 100 times bigger that from unsorted input splits on the same input tables.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message