hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Gopal V (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-20113) Shuffle avoidance: Disable 1-1 edges for sorted shuffle
Date Tue, 31 Jul 2018 20:55:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-20113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Gopal V updated HIVE-20113:
---------------------------
    Attachment: HIVE-20113.4.patch

> Shuffle avoidance: Disable 1-1 edges for sorted shuffle 
> --------------------------------------------------------
>
>                 Key: HIVE-20113
>                 URL: https://issues.apache.org/jira/browse/HIVE-20113
>             Project: Hive
>          Issue Type: Bug
>          Components: Tez
>            Reporter: Gopal V
>            Assignee: Gopal V
>            Priority: Major
>              Labels: Branch3Candidate
>         Attachments: HIVE-20113.1.patch, HIVE-20113.2.patch, HIVE-20113.3.patch, HIVE-20113.4.patch,
HIVE-20113.4.patch
>
>
> The sorted shuffle avoidance can have some issues when the shuffle data gets broken up
into multiple chunks on disk.
> The 1-1 edge cannot skip the tez final merge - there's no reason for 1-1 to have a final
merge at all, it should open a single compressed file and write a single index entry.
> Until the shuffle issue is resolved & a lot more testing, it is prudent to disable
the optimization for sorted shuffle edges and stop rewriting the RS(sorted) = = = RS(sorted)
into RS(sorted) = = = RS(FORWARD).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message