hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joman Chu" <jom...@andrew.cmu.edu>
Subject Re: multiple Output Collectors ?
Date Mon, 14 Jul 2008 21:50:22 GMT
One cheap hack that comes to mind is to extend the GenericWritable and
ArrayWritable classes and write a second and third MapReduce job that
will both parse over your first job's output, and each will select for
the Key-Value pair it wants.

Joman Chu
IRC: irc.liquid-silver.net

On Mon, Jul 14, 2008 at 2:19 PM, Khanh Nguyen <knguyen@cs.umb.edu> wrote:
> Hello,
> Is it possible to have more than one output collector for one map?
> My input are records of html pages. I am mapping each url to its
> html-content and want to have two output collectors. One that maps
> each <url, html-content> --> <url, outlinks> and another one that map
> <url, html-content> to something else (difficult to explain).
> Please help. Thanks
> -k

View raw message