Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3DCB190EF for ; Thu, 19 Jan 2012 15:54:03 +0000 (UTC) Received: (qmail 36929 invoked by uid 500); 19 Jan 2012 15:54:02 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 36690 invoked by uid 500); 19 Jan 2012 15:54:01 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 36682 invoked by uid 99); 19 Jan 2012 15:54:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jan 2012 15:54:01 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [128.105.6.20] (HELO sabe.cs.wisc.edu) (128.105.6.20) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Jan 2012 15:53:52 +0000 Received: from [192.168.61.110] (adsl-99-97-234-17.dsl.mdsnwi.sbcglobal.net [99.97.234.17]) (authenticated bits=0) by sabe.cs.wisc.edu (8.14.1/8.14.1) with ESMTP id q0JFrTYd032216 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO) for ; Thu, 19 Jan 2012 09:53:30 -0600 From: Avrilia Floratou Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: Question on bucketed map join Date: Thu, 19 Jan 2012 09:53:28 -0600 Message-Id: <67B1A444-AF6A-45B9-A978-58293180CB69@cs.wisc.edu> To: user@hive.apache.org Mime-Version: 1.0 (Apple Message framework v1251.1) X-Mailer: Apple Mail (2.1251.1) X-Virus-Checked: Checked by ClamAV on apache.org Hi, I have two tables with 8 buckets each on the same key and want to join = them. I ran "explain extended" and get the plan produced by HIVE which shows = that a map-side join is a possible plan. I then set in my script the hive.optimize.bucketmapjoin option to true = and reran the "explain extended" query. I get the exact same plans as = output. I ran the query with and without the bucketmapjoin optimization and saw = no difference in the running time. I have the following questions: 1. Should I see sth different in the explain extended output if I set = and unset the hive.optimize.bucketmapjoin option? 2. Should I see something different in the output of hive while running = the query if again I set and unset the hive.optimize.bucketmapjoin? 3. Is it possible that even though I set bucketmapjoin to true, Hive = will still perform a normal map-side join for some reason? How can I = check if this has actually happened? Thanks, Avrilia=20=