hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ilyal levin <nipponil...@gmail.com>
Subject How to Create an effective chained MapReduce program.
Date Mon, 05 Sep 2011 15:49:07 GMT

I'm trying to write a chained mapreduce program. i'm doing so with a simple
loop where in each iteration i

create a job ,execute it and every time the current job's output is the next
job's input.

how can i configure the outputFormat of the current job and the inputFormat
of the next job so that

i will not use the TextInputFormat (TextOutputFormat), because if i do use
it, i need to parse the input file in the Map function?

i.e if possible i want the next job to "consider" the input file as
<key,value> and not plain Text.

Thanks a lot.

View raw message