Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5DD241174F for ; Fri, 2 May 2014 19:10:40 +0000 (UTC) Received: (qmail 76791 invoked by uid 500); 2 May 2014 18:38:34 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 76710 invoked by uid 500); 2 May 2014 18:38:25 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 76622 invoked by uid 500); 2 May 2014 18:38:07 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 76586 invoked by uid 99); 2 May 2014 18:38:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 May 2014 18:38:02 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of thejas@hortonworks.com designates 209.85.214.170 as permitted sender) Received: from [209.85.214.170] (HELO mail-ob0-f170.google.com) (209.85.214.170) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 02 May 2014 18:37:57 +0000 Received: by mail-ob0-f170.google.com with SMTP id vb8so5648072obc.15 for ; Fri, 02 May 2014 11:37:34 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=/0JhE7N3z10VRk+z1DGKA7qE4s+dijVwM4qNxVyFLR0=; b=ak+lf/KL5OpJNokRQahnw06OoLUe2KT7DrzyEgyfL/WwgUuMLIeelnG0zujUfd2j/M S9pAEVo3LXQNS9K7bKnqoEYJEITN/wNt1VZkr872n8EJG+XiPjmB1xOwD+sVO/RPYuTI Us/i4Nsff4jqPxarnKo2/+gNWp5/zHrPE+qm5OAZ/p6e8Cq8sj+0B5T6GI1y6LLhG1Mn Zf0qUiDi3hi2kvYkvfBg5piS3vsfnM2Ebu/T6C6bGSWFzmhaCSv3PlSIFahjnm2KATdw GUisGZRoaKLBZ/DOAzg7gQA812j2YwkDAKf1L+BwuiGgHNNRBXd9L2JexvnOZxt3uNJI +O3g== X-Gm-Message-State: ALoCoQnsgbQke6jZ2us6WEov7f0JkbCOhuO+hfKReMhZN4wGXiChYJHGDNqCYm651yb2xx+3Z8GLSYvgu4fxv8jEfUXeT/DLE9H7aT+Bn2+8jk13UM256pw= MIME-Version: 1.0 X-Received: by 10.182.213.168 with SMTP id nt8mr16673454obc.7.1399055854602; Fri, 02 May 2014 11:37:34 -0700 (PDT) Received: by 10.76.156.99 with HTTP; Fri, 2 May 2014 11:37:34 -0700 (PDT) In-Reply-To: References: Date: Fri, 2 May 2014 11:37:34 -0700 Message-ID: Subject: Re: SMB join bug From: Thejas Nair To: "dev@hive.apache.org" Cc: "" Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org It is possible that you hit this issue - https://issues.apache.org/jira/browse/HIVE-5973 It is fixed in apache hive 0.13 release. On Thu, May 1, 2014 at 7:10 PM, Sukhendu Chakraborty wrote: > I am seeing very different number of rows in this query output depending on > whether I enable SMB join: > > select count(*) > from dss.hist_hshld_profl_mc a > join > dss.hshld_summary_mc b > on a.hh_key = b.hh_key > where ('2012-02-27' between a.hshld_profl_eff_dt and a.hshld_profl_exp_dt) > and a.hshld_exp_dt='9999-12-31' > and trim(a.cntry_id) = 'USA' > > The SMB join returns 60 rows (wrong value) while the regular join returns > 30million plus rows (correct value). > > Is there a known issue/jira for this? We are using CDH5.0/hive-0.12. > > -Sukhendu -- CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.