Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 36D3DD29D for ; Fri, 7 Dec 2012 20:33:05 +0000 (UTC) Received: (qmail 23994 invoked by uid 500); 7 Dec 2012 20:33:03 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 23955 invoked by uid 500); 7 Dec 2012 20:33:03 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 23947 invoked by uid 99); 7 Dec 2012 20:33:03 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Dec 2012 20:33:03 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of souvikbanerjee@gmail.com designates 209.85.212.48 as permitted sender) Received: from [209.85.212.48] (HELO mail-vb0-f48.google.com) (209.85.212.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Dec 2012 20:32:55 +0000 Received: by mail-vb0-f48.google.com with SMTP id fc21so857681vbb.35 for ; Fri, 07 Dec 2012 12:32:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=0dc0eB1eo0wSQfoysJHKqP8kfmrAeK+UhloPqjmTNc8=; b=LFBG3KiQ//yDRAHDR9WESwf4Jb7A859iDrWBZDTMntCMsvAUoNivIDj9Hi7h1afUJT xaIOgPuDOFNAypiUZ3bL96+ffa2y/DabYGpZfw5z/48goSVOAytOjvPoD2cRPDwO2py3 j9AXvlkjCr0B2wNZxbeC53Sxq2Xktluc+BXcjyjYF7rPMtuHjpYFEg53LGGTMgT5C8Q6 JS6/EUKC+9A8cOHxG4jf5GBFvtaf4Y6Bv7RJtucKnqai28FbnepHKzbYCNa8MPMPEkr7 un4ghLt+NcPsCGJxAm+9PopgPTxxaAXOKg3mCcNEFhn3yY3Ae8TL2tMKvW5kg9p3MTQW gcEA== MIME-Version: 1.0 Received: by 10.52.35.129 with SMTP id h1mr3983199vdj.74.1354912354831; Fri, 07 Dec 2012 12:32:34 -0800 (PST) Received: by 10.220.142.140 with HTTP; Fri, 7 Dec 2012 12:32:34 -0800 (PST) In-Reply-To: <531705801-1354911009-cardhu_decombobulator_blackberry.rim.net-752895296-@b3.c16.bise7.blackberry> References: <531705801-1354911009-cardhu_decombobulator_blackberry.rim.net-752895296-@b3.c16.bise7.blackberry> Date: Fri, 7 Dec 2012 14:32:34 -0600 Message-ID: Subject: Re: Map side join From: Souvik Banerjee To: user@hive.apache.org, bejoy_ks@yahoo.com Content-Type: multipart/alternative; boundary=20cf3079b9049c8cac04d0491d05 X-Virus-Checked: Checked by ClamAV on apache.org --20cf3079b9049c8cac04d0491d05 Content-Type: text/plain; charset=ISO-8859-1 Hi Bejoy, That's wonderful. Thanks for your reply. What I was wondering if HIVE can do map side join with more than one condition on JOIN clause. I'll simply try it out and post the result. Thanks once again. Regards, Souvik. On Fri, Dec 7, 2012 at 2:10 PM, wrote: > ** > Hi Souvik > > In earlier versions of hive you had to give the map join hint. But in > later versions just set hive.auto.convert.join = true; > Hive automatically selects the smaller table. It is better to give the > smaller table as the first one in join. > > You can use a map join if you are joining a small table with a large one, > in terms of data size. By small, better to have the smaller table size in > range of MBs. > Regards > Bejoy KS > > Sent from remote device, Please excuse typos > ------------------------------ > *From: *Souvik Banerjee > *Date: *Fri, 7 Dec 2012 13:58:25 -0600 > *To: * > *ReplyTo: *user@hive.apache.org > *Subject: *Map side join > > Hello everybody, > > I have got a question. I didn't came across any post which says somethign > about this. > I have got two tables. Lets say A and B. > I want to join A & B in HIVE. I am currently using HIVE 0.9 version. > The join would be on few columns. like on (A.id1 = B.id1) AND (A.id2 = > B.id2) AND (A.id3 = B.id3) > > Can I ask HIVE to use map side join in this scenario? Should I give a hint > to HIVE by saying /*+mapjoin(B)*/ > > Get back to me if you want any more information in this regard. > > Thanks and regards, > Souvik. > --20cf3079b9049c8cac04d0491d05 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi Bejoy,
=A0
That's wonderful. Thanks for you= r reply.
What I was wondering if HIVE can do = map side join with more than one condition on JOIN clause.
I'll simply try it out and post = the result.
=A0
Thanks once again.
=A0
Regards,
Souvik.

On Fri, Dec 7, 2012 at 2:10 PM, <bejoy_ks@yaho= o.com> wrote:
Hi Souvik

In earlier versions of hive you had to give the map j= oin hint. But in later versions just set hive.auto.convert.join =3D true;Hive automatically selects the smaller table. It is better to give the sm= aller table as the first one in join.

You can use a map join if you are joining a small table with a large on= e, in terms of data size. By small, better to have the smaller table size i= n range of MBs.
Regards
Bejoy KS

Sent from remote device, Please excuse typ= os

From: Souvik Banerjee <souvikbanerjee@gmail.com>
Date: Fri, 7 Dec 2012 13:58:25 -0600
Subject: Map side join

Hello everybody,
=A0
I have got a question. I didn't = came across any post which says somethign about this.
I have got two tables. Lets say A an= d B.
I want to join A & B in HIVE. I = am currently using HIVE 0.9 version.
The join would be on few columns. li= ke on (A.id1 =3D B.id1) AND (A.id2 =3D B.id2) AND (A.id3 =3D B.id3)<= /div>
=A0
Can I ask HIVE to use map side join = in this scenario?=A0Should I give a hint to HIVE by saying /*+mapjoin(B)*/ =
=A0
Get back to me if you want any more = information in this regard.
=A0
Thanks and regards,
Souvik.

--20cf3079b9049c8cac04d0491d05--