hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-15573) Vectorization: ACID shuffle ReduceSink is not specialized
Date Sun, 05 Feb 2017 07:47:41 GMT

     [ https://issues.apache.org/jira/browse/HIVE-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matt McCline updated HIVE-15573:
--------------------------------
    Attachment: HIVE-15573.04.patch

> Vectorization: ACID shuffle ReduceSink is not specialized 
> ----------------------------------------------------------
>
>                 Key: HIVE-15573
>                 URL: https://issues.apache.org/jira/browse/HIVE-15573
>             Project: Hive
>          Issue Type: Improvement
>          Components: Transactions, Vectorization
>    Affects Versions: 2.2.0
>            Reporter: Gopal V
>            Assignee: Matt McCline
>             Fix For: 2.2.0
>
>         Attachments: acid-test.svg, HIVE-15573.01.patch, HIVE-15573.02.patch, HIVE-15573.03.patch,
HIVE-15573.04.patch, screenshot-1.png
>
>
> The ACID shuffle disabled murmur hash for the shuffle, due to the bucketing requirements
demanding the writable hashcode for the shuffles.
> {code}
>     boolean useUniformHash = desc.getReducerTraits().contains(UNIFORM);
>     if (!useUniformHash) {
>       return false;
>     }
> {code}
> This check protects the fast ReduceSink ops from being used in ACID inserts.
> A specialized case for the following pattern will make ACID insert much faster.
> {code}
>                     Reduce Output Operator
>                       sort order: 
>                       Map-reduce partition columns: _col0 (type: bigint)
>                       value expressions:  ....
> {code}
> !screenshot-1.png!



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message