hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jesus Camacho Rodriguez (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-17073) Incorrect result with vectorization and SharedWorkOptimizer
Date Tue, 11 Jul 2017 12:58:00 GMT
Jesus Camacho Rodriguez created HIVE-17073:
----------------------------------------------

             Summary: Incorrect result with vectorization and SharedWorkOptimizer
                 Key: HIVE-17073
                 URL: https://issues.apache.org/jira/browse/HIVE-17073
             Project: Hive
          Issue Type: Bug
          Components: Vectorization
    Affects Versions: 3.0.0
            Reporter: Jesus Camacho Rodriguez
            Assignee: Jesus Camacho Rodriguez


We get incorrect result with vectorization and multi-output Select operator created by SharedWorkOptimizer.
It can be reproduced in the following way.

{code:title=Correct}
select count(*) as h8_30_to_9
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_278";
OK
2
{code}

{code:title=Correct}
select count(*) as h9_to_9_30
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_255";
OK
2
{code}

{code:title=Incorrect}
select * from (
  select count(*) as h8_30_to_9
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_278") s1
join (
  select count(*) as h9_to_9_30
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_255") s2;
OK
2	0
{code}

Problem seems to be that some ds in the batch row need to be re-initialized after they have
been forwarded to each output.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message