hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-5023) Hive get wrong result when partition has the same path but different schema or authority
Date Tue, 13 Aug 2013 05:58:48 GMT

    [ https://issues.apache.org/jira/browse/HIVE-5023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13737870#comment-13737870
] 

Ashutosh Chauhan commented on HIVE-5023:
----------------------------------------

+1
                
> Hive get wrong result when partition has the same path but different schema or authority
> ----------------------------------------------------------------------------------------
>
>                 Key: HIVE-5023
>                 URL: https://issues.apache.org/jira/browse/HIVE-5023
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Shuaishuai Nie
>            Assignee: Shuaishuai Nie
>         Attachments: HIVE-5023.1.patch, HIVE-5023.2.patch
>
>
> Hive does not differentiate scheme and authority in file uris which cause wrong result
when partition has the same path but different schema or authority. Here is a simple repro
> partition file path:
> asv://container1@secondary1.blob.core.windows.net/2013-08-05/00/text1.txt
> with content "2013-08-05 00:00:00"
> asv://container2@secondary1.blob.core.windows.net/2013-08-05/00/text2.txt
> with content "2013-08-05 00:00:20"
> {noformat}
> CREATE EXTERNAL TABLE IF NOT EXISTS T1 (t STRING) PARTITIONED BY (ProcessDate STRING,
Hour STRING, ClusterName STRING) ROW FORMAT DELIMITED FIELDS TERMINATED by '\t' STORED AS
TEXTFILE;
> ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName
='CLusterA');
> ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName
='ClusterA') LOCATION 'asv://container1@secondary1.blob.core.windows.net/2013-08-05/00';
> ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName
='ClusterB');
> ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName
='ClusterB') LOCATION 'asv://container2@secondary1.blob.core.windows.net/2013-08-05/00';
> {noformat}
> the expect output of the hive query
> {noformat}
> SELECT ClusterName, t FROM T1 WHERE ProcessDate=’2013-08-05’ AND Hour=’00’;
> {noformat}
> should be
> {noformat}
> ClusterA        2013-08-05 00:00:00
> ClusterB        2013-08-05 00:00:20
> {noformat}
> However it is
> {noformat}
> ClusterA        2013-08-05 00:00:00
> ClusterA        2013-08-05 00:00:20
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message