hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zoltan Haindrich (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17767) Rewrite correlated EXISTS/IN subqueries into LEFT SEMI JOIN
Date Wed, 08 Nov 2017 18:47:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16244529#comment-16244529
] 

Zoltan Haindrich commented on HIVE-17767:
-----------------------------------------

[~vgarg] I think you've developed this on a separate branch which was forked before the "cross
product" warning stabilization; so I've added an addendum to change back the order. for the
following files:
{code}
ql/src/test/results/clientpositive/perf/tez/query23.q.out
ql/src/test/results/clientpositive/perf/tez/query14.q.out
{code}


> Rewrite correlated EXISTS/IN subqueries into LEFT SEMI JOIN
> -----------------------------------------------------------
>
>                 Key: HIVE-17767
>                 URL: https://issues.apache.org/jira/browse/HIVE-17767
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Planning
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>         Attachments: HIVE-17767.1.patch, HIVE-17767.2.patch, HIVE-17767.3.patch, HIVE-17767.4.patch,
HIVE-17767.5.patch, HIVE-17767.6.patch, HIVE-17767.7.patch
>
>
> Currently such queries are written into group by + inner join with value generator and
is inefficient. Value generator consists of join with outer query to fetch all correlated
values. This value generator could be completely eliminated if such queries are instead rewritten
into LEFT SEMI JOIN.
> Note that to do this first hive need to support LEFT SEMI JOIN with non-equi condition
(HIVE-17766).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message