hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jay Vyas <jayunit...@gmail.com>
Subject Re: CompositeInputFormat
Date Thu, 11 Jul 2013 21:10:15 GMT
Map Side joins will use the CompositeInputFormat.  They will only really be
worth doing if one data set is small, and the other is large.

This is a good example :
http://www.congiu.com/joins-in-hadoop-using-compositeinputformat/

the trick is to google for CompositeInputFormat.compose() .... :)


On Thu, Jul 11, 2013 at 5:02 PM, Botelho, Andrew <Andrew.Botelho@emc.com>wrote:

> Hi,****
>
> ** **
>
> I want to perform a JOIN on two sets of data with Hadoop.  I read that the
> class CompositeInputFormat can be used to perform joins on data, but I
> can’t find any examples of how to do it.****
>
> Could someone help me out? It would be much appreciated. J****
>
> ** **
>
> Thanks in advance,****
>
> ** **
>
> Andrew****
>



-- 
Jay Vyas
http://jayunit100.blogspot.com

Mime
View raw message