hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Børge Svingen (JIRA) <j...@apache.org>
Subject [jira] [Resolved] (HIVE-5237) Incorrect group-by aggregation in 0.11.0
Date Fri, 13 Sep 2013 09:25:59 GMT

     [ https://issues.apache.org/jira/browse/HIVE-5237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Børge Svingen resolved HIVE-5237.
---------------------------------

    Resolution: Duplicate
    
> Incorrect group-by aggregation in 0.11.0
> ----------------------------------------
>
>                 Key: HIVE-5237
>                 URL: https://issues.apache.org/jira/browse/HIVE-5237
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.11.0
>            Reporter: Børge Svingen
>            Priority: Critical
>
> group by with sub queries does not correctly aggregate results in Hive 0.11.0.
> To reproduce:
> Put the file
> {code}
> 1,b
> 2,c
> 2,b
> 3,a
> 3,c
> 4,a
> {code}
> in HDFS, and run
> {code}
> create external table abc (x int, y string) row format delimited fields terminated by
',' location '/data/';
> {code}
> The query
> {code}
> select
>         x,
>         count(*)
> from
> (select
>         x,
>         y
> from
>         abc
> group by
>       x,
>       y
> ) a
> group by
>         x;
> {code}
> will then give the result
> {code}
> 2	1
> 3	1
> 2	1
> 4	1
> 3	1
> 1	1
> {code}
> instead of the correct
> {code}
> 1	1
> 2	2
> 3	2
> 4	1
> {code}
> In 0.9.0 and 0.10.0 this is all working correctly.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message