Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7BD4A10C93 for ; Thu, 12 Dec 2013 09:50:11 +0000 (UTC) Received: (qmail 9869 invoked by uid 500); 12 Dec 2013 09:50:10 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 9816 invoked by uid 500); 12 Dec 2013 09:50:09 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 9598 invoked by uid 500); 12 Dec 2013 09:50:07 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 9591 invoked by uid 99); 12 Dec 2013 09:50:07 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 12 Dec 2013 09:50:07 +0000 Date: Thu, 12 Dec 2013 09:50:07 +0000 (UTC) From: "Vikram Dixit K (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-5973) SMB joins produce incorrect results with multiple partitions and buckets MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-5973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13846211#comment-13846211 ] Vikram Dixit K commented on HIVE-5973: -------------------------------------- https://reviews.apache.org/r/16213/ > SMB joins produce incorrect results with multiple partitions and buckets > ------------------------------------------------------------------------ > > Key: HIVE-5973 > URL: https://issues.apache.org/jira/browse/HIVE-5973 > Project: Hive > Issue Type: Bug > Components: Query Processor > Affects Versions: 0.13.0 > Reporter: Vikram Dixit K > Assignee: Vikram Dixit K > Fix For: 0.13.0 > > Attachments: HIVE-5973.1.patch > > > It looks like there is an issue with re-using the output object array in the select operator. When we read rows of the non-big tables, we hold on to the output object in the priority queue. This causes hive to produce incorrect results because all the elements in the priority queue refer to the same object and the join happens on only one of the buckets. > {noformat} > output[i] = eval[i].evaluate(row); > {noformat} -- This message was sent by Atlassian JIRA (v6.1.4#6159)