hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aihua Xu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-12574) windowing function returns incorrect result when the window size is larger than the partition size
Date Thu, 03 Dec 2015 16:03:11 GMT

    [ https://issues.apache.org/jira/browse/HIVE-12574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15037992#comment-15037992
] 

Aihua Xu commented on HIVE-12574:
---------------------------------

Patch#2: include the fix for count() and collect_set(). We need to check if rowToProcess is
negative before processing the row.

> windowing function returns incorrect result when the window size is larger than the partition
size
> --------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-12574
>                 URL: https://issues.apache.org/jira/browse/HIVE-12574
>             Project: Hive
>          Issue Type: Sub-task
>          Components: PTF-Windowing
>    Affects Versions: 2.0.0
>            Reporter: Aihua Xu
>            Assignee: Aihua Xu
>         Attachments: HIVE-12574.2.patch, HIVE-12574.patch
>
>
> In PTF windowing, when the partition is small and the window size is larger than the
partition size, we are seeing incorrect result. It happens for max, min, first_value, last_value
and sum functions. 
> {noformat}
> CREATE TABLE sdy1(
> ord int,
> type string);
> {noformat}
> The data is:
> {noformat}
> 2 a
> 3 a
> 1 a 
> {noformat}
> The result is as follows for the query {{select ord, min(ord) over (partition by type
order by ord rows between 1 preceding and 7 following)}}
> {noformat}
> 1 1
> 2 1
> 3 1 
> {noformat}
> The expected result is:
> {noformat}
> 1 1
> 2 1
> 3 2
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message