hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Praveen Sripati <praveensrip...@gmail.com>
Subject Re: Modifying Hadoop For join Operation
Date Thu, 24 Jan 2013 16:52:56 GMT
Vikas,

Check the below paper on the different ways on performing joins in MR

http://lintool.github.com/MapReduceAlgorithms/index.html

Also, `Hadoop - The Definitive Guide` has a section on the different
approaches and when to use them.


Thanks,
Praveen

Cloudera Certified Developer for Apache Hadoop CDH4 (95%)
http://www.thecloudavenue.com/
http://stackoverflow.com/users/614157/praveen-sripati

If you aren’t taking advantage of big data, then you don’t have big data,
you have just a pile of data.


On Thu, Jan 24, 2013 at 8:39 PM, Harsh J <harsh@cloudera.com> wrote:

> Hi,
>
> Can you also define 'efficient way' and the idea you have in mind to
> implement that isn't already doable today?
>
> On Thu, Jan 24, 2013 at 6:51 PM, Vikas Jadhav <vikascjadhav87@gmail.com>
> wrote:
> > Anyone has idea about how should i modify Hadoop Code for
> > Performing Join operation in efficient Way.
> > Thanks.
> >
> > --
> >
> >
> > Thanx and Regards
> >  Vikas Jadhav
>
>
>
> --
> Harsh J
>

Mime
View raw message