hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Bigdatafun <sean.bigdata...@gmail.com>
Subject Re: Who actually does the split computation?
Date Wed, 09 Feb 2011 21:49:05 GMT
Where does this computation happen (in the context of the original picture
in the posted link )?

JobClient? or JobTracker? (Either way I think they need to contact HDFS
Namenode to do such a work, which did not seem to get described in that
link) --- I can't post on mapreduce-user mailing list, so I have to ask it
here.

On Wed, Feb 9, 2011 at 1:13 PM, David Rosenstrauch <darose@darose.net>wrote:

> On 02/09/2011 04:09 PM, Sean Bigdatafun wrote:
>
>> 1. My first question: who is responsible to compute the input splits?
>>
>
> The InputFormat computes InputSplits.  See:
> http://hadoop.apache.org/common/docs/r0.20.1/api/org/apache/hadoop/mapreduce/InputFormat.html
>
> DR
>



-- 
--Sean

Mime
View raw message