hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From fan wei fang <eagleeye8...@gmail.com>
Subject Location reduce task running.
Date Mon, 24 Aug 2009 03:47:08 GMT
Hello guys,

I am a newbie of Hadoop and doing an experiment with Hadoop.
My situation is:
 +My job is expected to run continuously/frequently
 +My reduce task require a large amount of configuration data. This config
data is specific to map output's key.
-->That's why, I want to avoid moving this config data around.
As far as I read, nodes where reduce tasks are assigned are picked without
consideration of data locality.

My question is: Is there any way to force the reduce tasks for a specific
key running on the same node?


View raw message