hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vikram Dixit K (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-8597) SMB join small table side should use the same set of serialized payloads across tasks
Date Fri, 24 Oct 2014 23:59:33 GMT

    [ https://issues.apache.org/jira/browse/HIVE-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14183755#comment-14183755
] 

Vikram Dixit K commented on HIVE-8597:
--------------------------------------

LGTM +1. +1 for 0.14 as well.

> SMB join small table side should use the same set of serialized payloads across tasks
> -------------------------------------------------------------------------------------
>
>                 Key: HIVE-8597
>                 URL: https://issues.apache.org/jira/browse/HIVE-8597
>             Project: Hive
>          Issue Type: Improvement
>          Components: Tez
>    Affects Versions: 0.14.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 0.14.0
>
>         Attachments: HIVE-8597.1.patch
>
>
> Each task sees all splits belonging to the bucket being processed by the task. At the
moment, we end up using different instances of the same serialized split which adds unnecessary
memory pressure.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message