hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aihua Xu (JIRA)" <>
Subject [jira] [Updated] (HIVE-15520) Improve the sum performance for Range based window
Date Mon, 09 Jan 2017 19:06:58 GMT


Aihua Xu updated HIVE-15520:
    Attachment: HIVE-15520.3.patch

patch-3: continue to work on affected unit tests. One change is: before, Lead/Lag is operating
on defined window, while it's reasonable to be on partition so that it matches the databases,
like Postgres and function call like sum(lag(f)) will get more meaningful result.

> Improve the sum performance for Range based window
> --------------------------------------------------
>                 Key: HIVE-15520
>                 URL:
>             Project: Hive
>          Issue Type: Sub-task
>          Components: PTF-Windowing
>            Reporter: Aihua Xu
>            Assignee: Aihua Xu
>         Attachments: HIVE-15520.1.patch, HIVE-15520.2.patch, HIVE-15520.3.patch
> Currently streaming process is not supported for range based windowing. Thus sum( x )
over (partition by y order by z) is O(n^2) running time. 
> Investigate the possibility of streaming support.

This message was sent by Atlassian JIRA

View raw message