hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-20174) Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions
Date Sat, 14 Jul 2018 21:58:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-20174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matt McCline updated HIVE-20174:
--------------------------------
    Description: 
Write new UT tests that use random data and intentional isRepeating batches to checks for
NULL and Wrong Results for vectorized aggregation functions.

 

BUGs found:

1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count = 0, sum = 0
(All data types).  For AVG DECIMAL, only return NULL if there was an overflow.

2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP, INTERVAL_DAY_TIME,
and String Family.  Eliminated redundant code.

3) Fix incorrect calculation  for VARIANCE (family) in PARTIAL2 and FINAL modes (HIVE-18758).

4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in COMPLETE and FINAL
modes.

 

  was:
Write new UT tests that use random data and intentional isRepeating batches to checks for
NULL and Wrong Results for vectorized aggregation functions.

 

BUGs found:

1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count = 0, sum = 0
(All data types).  For AVG DECIMAL, only return NULL if there was an overflow.

2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP, INTERVAL_DAY_TIME,
and String Family.  Eliminated redundant code.

3) Fix incorrect calculation  for VARIANCE (family) in PARTIAL2 and FINAL modes.

4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in COMPLETE and FINAL
modes.

 


> Vectorization: Fix NULL / Wrong Results issues in GROUP BY Aggregation Functions
> --------------------------------------------------------------------------------
>
>                 Key: HIVE-20174
>                 URL: https://issues.apache.org/jira/browse/HIVE-20174
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>         Attachments: HIVE-20174.01.patch
>
>
> Write new UT tests that use random data and intentional isRepeating batches to checks
for NULL and Wrong Results for vectorized aggregation functions.
>  
> BUGs found:
> 1) AVG/VARIANCE (family) in PARTIAL1 mode was returning NULL instead of count = 0, sum
= 0 (All data types).  For AVG DECIMAL, only return NULL if there was an overflow.
> 2) AVG/MIN/MAX was not detecting repeated NULL correctly for the TIMESTAMP, INTERVAL_DAY_TIME,
and String Family.  Eliminated redundant code.
> 3) Fix incorrect calculation  for VARIANCE (family) in PARTIAL2 and FINAL modes (HIVE-18758).
> 4) Fix row-mode AVG DECIMAL to enforce output type precision and scale in COMPLETE and
FINAL modes.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message