Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5955110339 for ; Mon, 17 Mar 2014 18:39:55 +0000 (UTC) Received: (qmail 34843 invoked by uid 500); 17 Mar 2014 18:39:50 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 34688 invoked by uid 500); 17 Mar 2014 18:39:46 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 34300 invoked by uid 500); 17 Mar 2014 18:39:45 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 34294 invoked by uid 99); 17 Mar 2014 18:39:45 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Mar 2014 18:39:45 +0000 Date: Mon, 17 Mar 2014 18:39:44 +0000 (UTC) From: "Jitendra Nath Pandey (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-6518) Add a GC canary to the VectorGroupByOperator to flush whenever a GC is triggered MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jitendra Nath Pandey updated HIVE-6518: --------------------------------------- Fix Version/s: (was: 0.14.0) 0.13.0 > Add a GC canary to the VectorGroupByOperator to flush whenever a GC is triggered > -------------------------------------------------------------------------------- > > Key: HIVE-6518 > URL: https://issues.apache.org/jira/browse/HIVE-6518 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.13.0 > Reporter: Gopal V > Assignee: Gopal V > Priority: Minor > Fix For: 0.13.0 > > Attachments: HIVE-6518.1-tez.patch, HIVE-6518.2-tez.patch, HIVE-6518.2.patch, HIVE-6518.3.patch > > > The current VectorGroupByOperator implementation flushes the in-memory hashes when the maximum entries or fraction of memory is hit. > This works for most cases, but there are some corner cases where we hit GC ovehead limits or heap size limits before either of those conditions are reached due to the rest of the pipeline. > This patch adds a SoftReference as a GC canary. If the soft reference is dead, then a full GC pass happened sometime in the near past & the aggregation hashtables should be flushed immediately before another full GC is triggered. -- This message was sent by Atlassian JIRA (v6.2#6252)