hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sergey Shelukhin (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17783) Hybrid Grace Hash Join has performance degradation for N-way join using Hive on Tez
Date Fri, 13 Oct 2017 01:53:01 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16202933#comment-16202933
] 

Sergey Shelukhin commented on HIVE-17783:
-----------------------------------------

I think we can disable it by default  
The main motivation was actually avoiding OOMs as far as I understand. I don't thin anyone
is working on perf improvements right now.
cc [~gopalv]

> Hybrid Grace Hash Join has performance degradation for N-way join using Hive on Tez
> -----------------------------------------------------------------------------------
>
>                 Key: HIVE-17783
>                 URL: https://issues.apache.org/jira/browse/HIVE-17783
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 2.2.0
>         Environment: 8*Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz
> 1 master + 7 workers
> TPC-DS at 3TB data scales
> Hive version : 2.2.0
>            Reporter: Ferdinand Xu
>         Attachments: Hybrid_Grace_Hash_Join.xlsx, screenshot-1.png
>
>
> Most configurations are using default value. And the benchmark is to test enabling against
disabling hybrid grace hash join using TPC-DS queries at 3TB data scales. Many queries related
to N-way join has performance degradation over three times test. Detailed result  is attached.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message