hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-3403) user should not specify mapjoin to perform sort-merge bucketed join
Date Mon, 18 Feb 2013 18:37:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-3403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13580753#comment-13580753
] 

Ashutosh Chauhan commented on HIVE-3403:
----------------------------------------

I see. We were unnecessarily dealing with SMBJOp in the mapper which was streaming through
records for third table of Join. Ideally, each mapper(and reducer) should just have a plan
for itself and not the global plan, ie mapper for third table shouldn't see SMBJOp at all.
But thats a quite fundamental change, given that at the moment we generate uniform plan for
whole of MR job. 
This patch is already outstanding for more than 6 months. Lets get this in. 
+1 Namit, can you commit this. 
Also would you like to take up follow on HIVE-3980 ?
                
> user should not specify mapjoin to perform sort-merge bucketed join
> -------------------------------------------------------------------
>
>                 Key: HIVE-3403
>                 URL: https://issues.apache.org/jira/browse/HIVE-3403
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: auto_sortmerge_join_1_modified.q, hive.3403.10.patch, hive.3403.11.patch,
hive.3403.12.patch, hive.3403.13.patch, hive.3403.14.patch, hive.3403.15.patch, hive.3403.16.patch,
hive.3403.17.patch, hive.3403.18.patch, hive.3403.19.patch, hive.3403.1.patch, hive.3403.21.patch,
hive.3403.22.patch, hive.3403.23.patch, hive.3403.24.patch, hive.3403.25.patch, hive.3403.26.patch,
hive.3403.27.patch, hive.3403.28.patch, hive.3403.29.patch, hive.3403.2.patch, hive.3403.3.patch,
hive.3403.4.patch, hive.3403.5.patch, hive.3403.6.patch, hive.3403.7.patch, hive.3403.8.patch,
hive.3403.9.patch
>
>
> Currently, in order to perform a sort merge bucketed join, the user needs
> to set hive.optimize.bucketmapjoin.sortedmerge to true, and also specify the 
> mapjoin hint.
> The user should not specify any hints.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message