hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harsh J <ha...@cloudera.com>
Subject Re: Datanodes going down frequently
Date Fri, 16 Sep 2011 04:33:00 GMT
I bet its swapping. You may just be oversubscribing those machines
with your MR slots and heap per slot or otherwise. Could also be low
heap given number of blocks its gotta report (which would equate to a
small files issue given your cluster size possibly, but that's a
different discussion).

On Fri, Sep 16, 2011 at 3:36 AM, john smith <js1987.smith@gmail.com> wrote:
> Hi all,
> I am running a 10 node cluster (1NN + 9DN, ubuntu server 10.04, 2GB RAM
> each). I am facing a strange problem. My datanodes go down randomly and
> nothing showup in the logs. They lose their network connectivity suddenly
> and NN declares them as dead. Any one faced this problem? Is it because of
> hadoop or is it some problem with my infrastructure?
> The worst part of the problem is, I need to manually go to the remote
> machine and restart networking. Can someone help me with this? Did any one
> face a similar kind of a problem
> Btw: my had version : 0.20.2
> Thanks,
> jS

Harsh J

View raw message