Mailing-List: contact issues-help@hive.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@hive.apache.org
Date: Fri, 5 Feb 2016 01:34:39 +0000 (UTC)
From: "Charles Pritchard (JIRA)" <jira@apache.org>
To: issues@hive.apache.org
Message-ID: <JIRA.12717467.1401425221000.307031.1454636079792@Atlassian.JIRA>
In-Reply-To: <JIRA.12717467.1401425221000@Atlassian.JIRA>
References: <JIRA.12717467.1401425221000@Atlassian.JIRA>
 <JIRA.12717467.1401425221920@arcas>
Subject: [jira] [Commented] (HIVE-7148) Use murmur hash to create bucketed
 tables
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit


    [ https://issues.apache.org/jira/browse/HIVE-7148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15133477#comment-15133477 ] 

Charles Pritchard commented on HIVE-7148:
-----------------------------------------

I could really use custom bucketing functions, as I want to use buckets instead of partitions based on a derived value.

> Use murmur hash to create bucketed tables
> -----------------------------------------
>
>                 Key: HIVE-7148
>                 URL: https://issues.apache.org/jira/browse/HIVE-7148
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Gunther Hagleitner
>
> HIVE-7121 introduced murmur hashing for queries that don't insert into bucketed tables. This was done to achieve better distribution of the data. The same should be done for bucketed tables as well, but this involves making sure we don't break backwards compat. This probably means that we have to store the partitioning function used in the metadata and use that to determine if SMB and bucketed map-join optimizations apply.


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)