hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Neil Blue" <Neil.B...@biowisdom.com>
Subject Slave data node failing to connect?
Date Tue, 03 Nov 2009 11:23:25 GMT
Hello


I am trying to start up my first twin node hadoop cluster. I have followed
this guide:

http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Multi-Nod
e_Cluster%29
,and got two machines running as single node instances and then moved on to
connect them into a multi-node cluster.

I have two ubuntu instances running in virtual box with a bridged network
adapter.

I have configured the xml files slaves and master to point to the correct
machines, along with the ssh key.

When I start up the services I get all these starting on the master:

JobTracker
DataNode
SecondaryNameNode
TaskTracker
NameNode

The web interface shows the system is up and running with one node. 

On the slave these are running:
TaskTracker
DataNode

The output logs on the slave show:

hadoop-hadoop-datanode-slave.log
2009-11-03 11:15:52,055 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 9 time(s).
2009-11-03 11:15:52,057 INFO org.apache.hadoop.ipc.RPC: Server at
master/172.18.11.95:4310 not available yet, Zzzzz...
2009-11-03 11:15:54,063 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 0 time(s).
2009-11-03 11:15:55,064 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 1 time(s).
2009-11-03 11:15:56,068 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 2 time(s).
2009-11-03 11:15:57,073 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 3 time(s).

hadoop-hadoop-tasktracker-slave.log
2009-11-03 11:18:01,002 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 9 time(s).
2009-11-03 11:18:01,004 INFO org.apache.hadoop.ipc.RPC: Server at
master/172.18.11.95:9001 not available yet, Zzzzz...
2009-11-03 11:18:03,007 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 0 time(s).
2009-11-03 11:18:04,009 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 1 time(s).
2009-11-03 11:18:05,011 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 2 time(s).

Tcpdump shows that the packets are being sent between the machines, and ssh
works, so there does not seem to be any network problems. Also on the slave,
the remote http://master:50070/dfshealth.jsp page is visible.

I have also tried changing the port numbers used by the master, but no luck.

Any suggestions please.

Thanks
Neil

********************************************* 

The information contained in this message is likely to be confidential.  It is intended only
for the person named above.  Any dissemination, distribution, copying, disclosure or use of
this message or its contents unless authorised by BioWisdom Ltd is strictly prohibited. Any
views or opinions expressed within this e-mail are those of the author and do not necessarily
represent those of BioWisdom Ltd. If you have received this message in error, please immediately
notify us and delete it.  Thank you.  Registered Office: BioWisdom Ltd, Harston Mill, Harston,
Cambridge, CB22 7GG.  Registered in England: (GB) 3861669.  VAT registered: (GB) 750899881.
 Tel: +44 (0)1223 874800, Fax: +44 (0) 1223 874801, Internet:www.biowisdom.com 

*********************************************


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message