hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Venner <ja...@attributor.com>
Subject Chaining jobs hadoop 0.15 best practices question.
Date Mon, 26 Nov 2007 19:06:42 GMT
We are new to hadoop - 1 week and counting :)

We have a number of tasks that we want to accomplish with hadoop, and 
would like to each each of the hadoop steps very simple.

To our current limited understanding this means that we need to set up N 
hadoop jobs, and run them manually one after the other, using the output 
of one as as the input of the next.

Is there a best practices way of accomplishing this? We are hoping to 
avoid gigantic map tasks.

Thank you all and happy computing.

View raw message