hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From shashwat shriparv <dwivedishash...@gmail.com>
Subject Re: Mappers vs. Map tasks
Date Tue, 25 Feb 2014 06:08:59 GMT
You are really confused :) Please read this :

http://developer.yahoo.com/hadoop/tutorial/module4.html#closer
http://wiki.apache.org/hadoop/HowManyMapsAndReduces



* Warm Regards_**∞_*
* Shashwat Shriparv*
 [image: http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9]<http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9>[image:
https://twitter.com/shriparv] <https://twitter.com/shriparv>[image:
https://www.facebook.com/shriparv] <https://www.facebook.com/shriparv>[image:
http://google.com/+ShashwatShriparv]
<http://google.com/+ShashwatShriparv>[image:
http://www.youtube.com/user/sShriparv/videos]<http://www.youtube.com/user/sShriparv/videos>[image:
http://profile.yahoo.com/SWXSTW3DVSDTF2HHSRM47AV6DI/] <shriparv@yahoo.com>



On Tue, Feb 25, 2014 at 11:27 AM, Sugandha Naolekar
<sugandha.n87@gmail.com>wrote:

> Hello,
>
> As per the various articles I went through till date, the File(s) are
> split in chunks/blocks. On the same note, would like to ask few things:
>
>
>    1. No. of mappers are decided as: Total_File_Size/Max. Block Size.
>    Thus, if the file is smaller than the block size, only one mapper will be
>    invoked. Right?
>    2. If yes, it means, the map() will be called only once. Right? In
>    this case, if there are two datanodes with a replication factor as 1: only
>    one datanode(mapper machine) will perform the task. Right?
>    3. The map() function is called by all the datanodes/slaves right? If
>    the no. of mappers are more than the no. of slaves, what happens?
>
> --
> Thanks & Regards,
> Sugandha Naolekar
>
>
>
>

Mime
View raw message