hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pankil Doshi <forpan...@gmail.com>
Subject Question regarding Map side Join
Date Mon, 13 Jul 2009 23:18:40 GMT
I have question regarding Mapside Join.
Finally I got a copy of your book.I tried Implementing it. and I have few
Questions on it.

File 1:
31    Rafferty
33    Jones
33    Steinberg
34    Robinson
34    Smith
<null>    Jasper

File 2:
31    sales
33    Engg
34    Clerical
35    Marketing

Results I got using mapside join

File1 inner join with File2
31    Rafferty
31    sales
33    Jones
33    Engg
33    Steinberg
33    Engg


File2 inner join with File1

31    sales
31    Rafferty
33    Engg
33    Jones
33    Engg
33    Steinberg
34    Clerical
34    Robinson
34    Clerical
34    Smith


But I am looking some result like below:

31    sales    Rafferty
33    Engg    Jones
33    Engg    Steinberg
34    Clerical    Robinson
34    Clerical    Smith


Is it possible using map-side join only??

I  am looking simple join such that key values present in both files .

Pankil

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message