Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 901D9200C45 for ; Tue, 14 Mar 2017 01:25:46 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id 8EBB5160B5D; Tue, 14 Mar 2017 00:25:46 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id D3F7E160B85 for ; Tue, 14 Mar 2017 01:25:45 +0100 (CET) Received: (qmail 17515 invoked by uid 500); 14 Mar 2017 00:25:45 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 17506 invoked by uid 99); 14 Mar 2017 00:25:45 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 14 Mar 2017 00:25:45 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id A756DC0BF8 for ; Tue, 14 Mar 2017 00:25:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.451 X-Spam-Level: * X-Spam-Status: No, score=1.451 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_NEUTRAL=0.652] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 2YSfk7GzIhUq for ; Tue, 14 Mar 2017 00:25:44 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id C296A5F649 for ; Tue, 14 Mar 2017 00:25:43 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 0E96EE043B for ; Tue, 14 Mar 2017 00:25:43 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id B62A1243C0 for ; Tue, 14 Mar 2017 00:25:41 +0000 (UTC) Date: Tue, 14 Mar 2017 00:25:41 +0000 (UTC) From: "Sergey Shelukhin (JIRA)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-16151) BytesBytesHashTable allocates large arrays MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Tue, 14 Mar 2017 00:25:46 -0000 [ https://issues.apache.org/jira/browse/HIVE-16151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15923302#comment-15923302 ] Sergey Shelukhin commented on HIVE-16151: ----------------------------------------- [~gopalv] can you review this at some point? thanks [~mmccline] mentioned that you may not like this approach ;) > BytesBytesHashTable allocates large arrays > ------------------------------------------ > > Key: HIVE-16151 > URL: https://issues.apache.org/jira/browse/HIVE-16151 > Project: Hive > Issue Type: Bug > Reporter: Prasanth Jayachandran > Assignee: Sergey Shelukhin > Attachments: HIVE-16151.patch > > > These arrays cause GC pressure and also impose key count limitations on the table. Wrt the latter, we won't be able to get rid of it without a 64-bit hash function, but for now we can get rid of the former. If we need the latter we'd add murmur64 and probably account for it differently for resize (we don't want to blow up the hashtable by 4 bytes/key in the common case where #of keys is less than ~1.5B :)) -- This message was sent by Atlassian JIRA (v6.3.15#6346)