hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ning Zhang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HIVE-1412) CombineHiveInputFormat bug on tablesample
Date Tue, 22 Jun 2010 00:17:58 GMT

     [ https://issues.apache.org/jira/browse/HIVE-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ning Zhang updated HIVE-1412:
-----------------------------

    Attachment: HIVE-1412.2.patch

Added a unit test

> CombineHiveInputFormat bug on tablesample
> -----------------------------------------
>
>                 Key: HIVE-1412
>                 URL: https://issues.apache.org/jira/browse/HIVE-1412
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.6.0, 0.7.0
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>             Fix For: 0.6.0, 0.7.0
>
>         Attachments: HIVE-1412.2.patch, HIVE-1412.patch
>
>
> CombineHiveInputFormat should combine all files inside one partition to form a split
but should not takes files cross partition boundary. This works for regular table and partitions
since all input paths are directory. However this breaks when the input is files (in which
case tablesample could be the use case). CombineHiveInputFormat should adjust to the case
when input could also be non-directories. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message