hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Charles Pritchard (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-7148) Use murmur hash to create bucketed tables
Date Fri, 05 Feb 2016 01:34:39 GMT

    [ https://issues.apache.org/jira/browse/HIVE-7148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15133477#comment-15133477
] 

Charles Pritchard commented on HIVE-7148:
-----------------------------------------

I could really use custom bucketing functions, as I want to use buckets instead of partitions
based on a derived value.

> Use murmur hash to create bucketed tables
> -----------------------------------------
>
>                 Key: HIVE-7148
>                 URL: https://issues.apache.org/jira/browse/HIVE-7148
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Gunther Hagleitner
>
> HIVE-7121 introduced murmur hashing for queries that don't insert into bucketed tables.
This was done to achieve better distribution of the data. The same should be done for bucketed
tables as well, but this involves making sure we don't break backwards compat. This probably
means that we have to store the partitioning function used in the metadata and use that to
determine if SMB and bucketed map-join optimizations apply.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message