hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Szehon Ho (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-6395) multi-table insert from select transform fails if optimize.ppd enabled
Date Sat, 08 Feb 2014 05:36:19 GMT
Szehon Ho created HIVE-6395:
-------------------------------

             Summary: multi-table insert from select transform fails if optimize.ppd enabled
                 Key: HIVE-6395
                 URL: https://issues.apache.org/jira/browse/HIVE-6395
             Project: Hive
          Issue Type: Bug
          Components: Query Processor
            Reporter: Szehon Ho
         Attachments: test.py

{code}
set hive.optimize.ppd=true;
add file ./test.py;

from (select transform(test.*) using 'python ./test.py'
as id,name,state from test) t0
insert overwrite table test2 select * where state=1
insert overwrite table test3 select * where state=2;
{code}

In the above example, the select transform returns an extra column, and that column is used
in where clause of the multi-insert selects.  However, if optimize is on, the query plan is
wrong:

filter (state=1 and state=2) //impossible
--> select, insert into test1
--> select, insert into test2

The correct query plan for hive.optimize.ppd=false is:
filter (state=1)
--> select, insert into test1
filter (state=2)
--> select, insert into test2



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message