hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-9168) Vectorized Coalesce for strings is broken
Date Fri, 19 Dec 2014 00:12:14 GMT

    [ https://issues.apache.org/jira/browse/HIVE-9168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14252605#comment-14252605
] 

Prasanth Jayachandran commented on HIVE-9168:
---------------------------------------------

LGTM, +1. Pending tests.

> Vectorized Coalesce for strings is broken
> -----------------------------------------
>
>                 Key: HIVE-9168
>                 URL: https://issues.apache.org/jira/browse/HIVE-9168
>             Project: Hive
>          Issue Type: Bug
>          Components: Vectorization
>    Affects Versions: 0.13.0, 0.14.0
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>             Fix For: 0.15.0, 0.14.1
>
>         Attachments: HIVE-9168.01.patch
>
>
> Vectorized Coalesce uses BytesColumnVector.setElement which does not set the output string
length correctly.
> {noformat}
> create table str_str_orc (str1 string, str2 string) stored as orc;
> insert into table str_str_orc values (null, "X"), ("0", "X"), ("1", "X"), (null, "y");
> EXPLAIN
> SELECT
>    str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
> from str_str_orc
> GROUP BY str2;
> SELECT
>    str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
> from str_str_orc
> GROUP BY str2;
> EXPLAIN
> SELECT COALESCE(str1, 0) as result
> from str_str_orc;
> SELECT COALESCE(str1, 0) as result
> from str_str_orc;
> {noformat}
> Produces different results when vectorized and not vectorized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message