hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: Right way to implement MR ?
Date Wed, 23 May 2012 19:54:04 GMT
Samir,

You can use MultipleInputs for multiple forms of inputs per mapper
(with their own input K/V types, but common output K/V types) with a
common reduce-side join/compare.

See http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapreduce/lib/input/MultipleInputs.html.

On Thu, May 24, 2012 at 1:17 AM, samir das mohapatra
<samir.helpdoc@gmail.com> wrote:
> Hi All,
>     How to compare to input file In M/R Job.
>     let A Log file around 30GB
>    and B Log file size is around 60 GB
>
>  I wanted to know how  i will  define <K,V> inside the mapper.
>
>  Thanks
>  samir.



-- 
Harsh J

Mime
View raw message