hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From James Teng <tenglinx...@outlook.com>
Subject How to recover reducer task data on a different data node?
Date Thu, 03 Jul 2014 06:28:57 GMT
First i would like to declare that although i am not new to hadoop, but not expert on it as
well.i would like to consult one issue on mapreduce framework. below is the description of
the scenarios.
When one reduce task is failed on one datanode, then the job tracker will try to schedule
another node to set up this reduce job and  continue running, my question is how to get the
assigned data back on the new node? when the map phase is done, the output data will be copied
to the respective partitioned reducer, now if the reduce is created on the a new node, what
kind of actions does the new node take to get all the map-allocated data back.

thanks in advance.
View raw message