hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aihua Xu (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-15520) Improve the sum performance for Range based window
Date Thu, 12 Jan 2017 14:35:51 GMT

     [ https://issues.apache.org/jira/browse/HIVE-15520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Aihua Xu updated HIVE-15520:
----------------------------
       Resolution: Fixed
    Fix Version/s: 2.2.0
     Release Note: 
This is to improve the performance of sum function over range based windowing. 

One issue related to sum(lag(x)) over (partition by c1 order by c2 range between ...)  and
sum(lead(x)) over (partition by c1 order by c2 range between ...) has been fixed which would
produce different result. Without the patch, lag(x)/lead(x) would only consider the previous/next
row in the windowing, not within the partition, which doesn't match other databases, also
doesn't match rows based windowing. 
           Status: Resolved  (was: Patch Available)

Committed to master. Thanks Yongzhi for reviewing.

> Improve the sum performance for Range based window
> --------------------------------------------------
>
>                 Key: HIVE-15520
>                 URL: https://issues.apache.org/jira/browse/HIVE-15520
>             Project: Hive
>          Issue Type: Sub-task
>          Components: PTF-Windowing
>            Reporter: Aihua Xu
>            Assignee: Aihua Xu
>             Fix For: 2.2.0
>
>         Attachments: HIVE-15520.1.patch, HIVE-15520.2.patch, HIVE-15520.3.patch, HIVE-15520.4.patch
>
>
> Currently streaming process is not supported for range based windowing. Thus sum( x )
over (partition by y order by z) is O(n^2) running time. 
> Investigate the possibility of streaming support.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message