hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Answer Agrawal <yrsna.tse...@gmail.com>
Subject Re: Can we control data distribution and load balancing in Hadoop Cluster?
Date Mon, 04 May 2015 07:35:19 GMT
Thanks Mr Chandrashekhar

The input data sets in HDFS breaks it in blocks of default size 128 MB and
replicate it by default replication factor 3. It also balance load by
transfering job of failed or busy nodes to free or active nodes. Can we
manage how much data sets and load should assign to which node by ourselves.

On Mon, May 4, 2015 at 12:03 AM, Chandrashekhar Kotekar <
shekhar.kotekar@gmail.com> wrote:

> Your question is very vague. Can you give us more details about the
> problem you are trying to solve?
>
>
> Regards,
> Chandrash3khar Kotekar
> Mobile - +91 8600011455
>
> On Sun, May 3, 2015 at 11:59 PM, Answer Agrawal <yrsna.tset01@gmail.com>
> wrote:
>
>> Hi
>>
>> As I studied that data distribution, load balancing, fault tolerance are
>> implicit in Hadoop. But I need to customize it, can we do that?
>>
>> Thanks
>>
>>
>

Mime
View raw message