Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 85FFF180A7 for ; Tue, 1 Mar 2016 18:28:14 +0000 (UTC) Received: (qmail 80857 invoked by uid 500); 1 Mar 2016 18:28:12 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 80722 invoked by uid 500); 1 Mar 2016 18:28:12 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 80703 invoked by uid 99); 1 Mar 2016 18:28:12 -0000 Received: from mail-relay.apache.org (HELO mail-relay.apache.org) (140.211.11.15) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 01 Mar 2016 18:28:12 +0000 Received: from [192.168.1.108] (c-24-5-80-16.hsd1.ca.comcast.net [24.5.80.16]) by mail-relay.apache.org (ASF Mail Server at mail-relay.apache.org) with ESMTPSA id 0FA981A010E; Tue, 1 Mar 2016 18:28:10 +0000 (UTC) User-Agent: Microsoft-MacOutlook/14.5.9.151119 Date: Tue, 01 Mar 2016 10:28:07 -0800 Subject: Re: Wrong column is picked in HIVE 2.0.0 + TEZ 0.8.2 left join From: Gopal Vijayaraghavan Sender: Gopal Vijayaraghavan To: "user@hive.apache.org" CC: GAO Chi Message-ID: Thread-Topic: Wrong column is picked in HIVE 2.0.0 + TEZ 0.8.2 left join References: <061b01d17372$310a9c60$931fd520$@microfun.com> <063e01d1737b$8ff477b0$afdd6710$@microfun.com> <065701d173db$36caafa0$a4600ee0$@microfun.com> In-Reply-To: <065701d173db$36caafa0$a4600ee0$@microfun.com> Mime-version: 1.0 Content-type: text/plain; charset="ISO-8859-1" Content-transfer-encoding: quoted-printable (Bcc: Tez, Cross-post to hive) > I added =B3set hive.execution.engine=3Dmr;=B2 at top of the script, seems the >result is correct=8A Pretty sure it's due to the same table aliases for both dummy tables (they're both called _dummy_table) auto join conversion. hive> set hive.auto.convert.join=3Dfalse; Should go back to using slower tagged joins even in Tez, which will add a table-tag i.e first table will be (, 0) amd 2nd table will be (, 1). I suspect the difference between the MR and Tez runs are lookup between the table-name + expr (both equal for _dummy_table.11). > per Jeff Zhang's thinking if you were to set the exec engine to 'mr' >would it still fail? if so, then its not Tez . :) Hive has a a whole set of join algorithms which can only work on Tez, so it's not always that easy. Considering this is on hive-2.0.0, I recommend filing a JIRA on 2.0.0 and marking it with 2.0.1 as a target version. Cheers, Gopal =20 =20