hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From ranjith raghunath <ranjith.raghuna...@gmail.com>
Subject Re: Replication
Date Wed, 31 Oct 2012 04:19:44 GMT
The namenode does decide the replica for either case. It just so happens
that when running from a datanode the first replica is housed on the same
node. Hope this makes sense.
On Oct 30, 2012 8:13 PM, "Mohit Anchlia" <mohitanchlia@gmail.com> wrote:

> Thanks and if it is not the datanode then I am guessing namenode decides
> the nodes in replication pipeline?
>
> On Tue, Oct 30, 2012 at 5:36 PM, ranjith raghunath <
> ranjith.raghunath1@gmail.com> wrote:
>
>> If your client node is a datanode with your cluster then the first copy
>> does get written to that data node.
>>
>> Experts please feel free to correct me here.
>>  On Oct 30, 2012 7:11 PM, "Mohit Anchlia" <mohitanchlia@gmail.com> wrote:
>>
>>> With respect to replication if I run pig job from one of the nodes
>>> within the Hadoop cluster then do I always end up with writing 1 replica
>>> copy to that client node always and remaining 2 replica copies to other
>>> nodes?
>>>
>>>
>>
>

Mime
View raw message