hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mathijs Homminga <mathijs.hommi...@knowlogy.nl>
Subject Re: non responding tasks (solved)
Date Tue, 24 Apr 2007 08:29:07 GMT
Hi Eelco,

We use Ubuntu Server 6.06 for all our nodes. As said, two of the nodes 
could not ping themselves, but they could ping other nodes.

Adding the loopback interface to /etc/network/interfaces solved our 
problem. The nodes can now ping themselves (after a reboot).
# The loopback network interface
auto lo
iface lo inet loopback

auto eth0
iface eth0 inet dhcp


Eelco Lempsink wrote:
> On 23-apr-2007, at 12:14, Mathijs Homminga wrote:
>> I have had some troubles with 2 nodes on one of our clusters.
>> While most nodes finished their map tasks successfully in about 2 
>> secs, two were not responding well. On their Task Trackers the task 
>> status remained UNASSIGNED for a couple of minutes (and the Job 
>> Tracker receives no heartbeats) and then changed to RUNNING but in 
>> the end the task got killed after 600 secs because no status update 
>> had been received.
>> I found out that this was caused by the fact that we had not 
>> installed the loopback interface correctly on these two nodes. So, 
>> although all machines could connect to each other, two of them could 
>> not connect to themselves.
> Could you explain how you installed your loopback device now?  I ran 
> into a similar (maybe the same) problem, where I could only reach the 
> _local_ tasktracker by poking a hole in my firewall.
>> Btw, could I have seen this in any of the logs?
> I don't think so, it just times out.
> --Regards,
> Eelco Lempsink

Helperpark 290 C
9723 ZA Groningen

+31 (0)6 15312977

View raw message