hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2566) reduce the number map-reduce jobs for union all
Date Fri, 09 Dec 2011 06:45:40 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165885#comment-13165885
] 

Hudson commented on HIVE-2566:
------------------------------

Integrated in Hive-trunk-h0.21 #1133 (See [https://builds.apache.org/job/Hive-trunk-h0.21/1133/])
    HIVE-2634 [jira] revert HIVE-2566
(Namit Jain via Yongqiang He)

Summary:
HIVE-2634

This is leading to some problems.   

I will upload the offending testcase in a new jira.

Test Plan: EMPTY

Reviewers: JIRA, heyongqiang

Reviewed By: heyongqiang

CC: heyongqiang

Differential Revision: 651

heyongqiang : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1212182
Files : 
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRProcContext.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMRUnion1.java
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GenMapRedUtils.java
* /hive/trunk/ql/src/test/queries/clientpositive/union25.q
* /hive/trunk/ql/src/test/results/clientpositive/auto_join27.q.out
* /hive/trunk/ql/src/test/results/clientpositive/input25.q.out
* /hive/trunk/ql/src/test/results/clientpositive/input26.q.out
* /hive/trunk/ql/src/test/results/clientpositive/join35.q.out
* /hive/trunk/ql/src/test/results/clientpositive/lineage1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/load_dyn_part14.q.out
* /hive/trunk/ql/src/test/results/clientpositive/merge4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/ppd_union_view.q.out
* /hive/trunk/ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out
* /hive/trunk/ql/src/test/results/clientpositive/stats1.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union10.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union11.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union12.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union14.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union15.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union17.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union18.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union19.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union20.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union24.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union25.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union3.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union4.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union5.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union6.q.out
* /hive/trunk/ql/src/test/results/clientpositive/union7.q.out

                
> reduce the number map-reduce jobs for union all
> -----------------------------------------------
>
>                 Key: HIVE-2566
>                 URL: https://issues.apache.org/jira/browse/HIVE-2566
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.8.0, 0.9.0
>
>         Attachments: HIVE-2566.D405.1.patch, HIVE-2566.D405.2.patch, HIVE-2566.D405.3.patch
>
>
> A query like:
> select s.key, s.value from (
>   select key, value from src2  where key < 10
>   union all 
>   select key, value from src3  where key < 10
>   union all 
>   select key, value from src4  where key < 10
>   union all 
>   select key, count(1) as value from src5 group by key
> )s;
> should run the last sub-query 
> 'select key, count(1) as value from src5 group by key'
> as a map-reduce job.
> And then the union should be a map-only job reading from the first 3 map-only subqueries
> and the output of the last map-reduce job.
> The current plan is very inefficient.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message