hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-9168) Vectorized Coalesce for strings is broken
Date Thu, 18 Dec 2014 23:58:14 GMT
Matt McCline created HIVE-9168:
----------------------------------

             Summary: Vectorized Coalesce for strings is broken
                 Key: HIVE-9168
                 URL: https://issues.apache.org/jira/browse/HIVE-9168
             Project: Hive
          Issue Type: Bug
          Components: Vectorization
    Affects Versions: 0.14.0, 0.13.0
            Reporter: Matt McCline
            Assignee: Matt McCline
            Priority: Critical
             Fix For: 0.15.0, 0.14.1


Vectorized Coalesce uses BytesColumnVector.setElement which does not set the output string
length correctly.

{noformat}
create table str_str_orc (str1 string, str2 string) stored as orc;

insert into table str_str_orc values (null, "X"), ("0", "X"), ("1", "X"), (null, "y");

EXPLAIN
SELECT
   str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
from str_str_orc
GROUP BY str2;

SELECT
   str2, ROUND(sum(cast(COALESCE(str1, 0) as int))/60, 2) as result
from str_str_orc
GROUP BY str2;

EXPLAIN
SELECT COALESCE(str1, 0) as result
from str_str_orc;

SELECT COALESCE(str1, 0) as result
from str_str_orc;
{noformat}

Produces different results when vectorized and not vectorized.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message