hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kuhu Shukla <kshu...@yahoo-inc.com>
Subject Re: Help troubleshooting multi-cluster setup
Date Wed, 23 Sep 2015 20:49:05 GMT
Hi Daniel,
The RM will list only NodeManagers and not the datanodes. You can view the datanodes on the
NameNode page (eg.
The one node you see on the RM page 'Nodes' list is from this:
hadoop@hadoop-master:~$ jps24641 SecondaryNameNode24435 DataNode24261 NameNode24791 ResourceManager25483
Jps24940 NodeManager   <<<<<<<<<<
You might want NodeManagers and dataNodes to run on the same physical hosts in most of the
cases, AFAIK. 
Hope this helps.Regards,Kuhu 

     On Wednesday, September 23, 2015 3:31 PM, Daniel Watrous <dwmaillist@gmail.com>

 I was able to get the jobs submitting to the cluster by adding the following property to

I also had to add the following properties to yarn-site.xml

I'm still not sure why the datanodes don't show up in the nodes view. Is the idea that a data
node is only used for HDFS and yarn doesn't schedule jobs there? If so, how can I add additional
compute hosts? What are those called?
On Wed, Sep 23, 2015 at 3:08 PM, Daniel Watrous <dwmaillist@gmail.com> wrote:

I'm not sure if this is related, but I'm seeing some errors in hadoop-hadoop-namenode-hadoop-master.log
2015-09-23 19:56:27,798 WARN org.apache.hadoop.hdfs.server.blockmanagement.DatanodeManager:
Unresolved datanode registration: hostname cannot be resolved (ip=, hostname=
2015-09-23 19:56:27,800 INFO org.apache.hadoop.ipc.Server: IPC Server handler 6 on 54310,
call org.apache.hadoop.hdfs.server.protocol.DatanodeProtocol.registerDatanode from
Call#373 Retry#0
org.apache.hadoop.hdfs.server.protocol.DisallowedDatanodeException: Datanode denied communication
with namenode because hostname cannot be resolved (ip=, hostname=
DatanodeRegistration(, datanodeUuid=8a5d90c8-b909-46d3-80ec-2a3a8f1fe904, infoPort=50075,
infoSecurePort=0, ipcPort=50020, storageInfo=lv=-56;cid=CID-bc60d031-11b0-4eb5-8f9b-da0f8a069ea6;nsid=1223814533;c=0)
	at org.apache.hadoop.hdfs.server.blockmanagement.DatanodeManager.registerDatanode(DatanodeManager.java:863)
	at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.registerDatanode(FSNamesystem.java:4529)
	at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.registerDatanode(NameNodeRpcServer.java:1279)
	at org.apache.hadoop.hdfs.protocolPB.DatanodeProtocolServerSideTranslatorPB.registerDatanode(DatanodeProtocolServerSideTranslatorPB.java:95)
	at org.apache.hadoop.hdfs.protocol.proto.DatanodeProtocolProtos$DatanodeProtocolService$2.callBlockingMethod(DatanodeProtocolProtos.java:28539)
	at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2049)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2045)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:422)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2043)

I don't have a server with the IP and I don't think I'm referencing that anywhere.
Is there some reason that it's trying to add that host as a namenode?
On Wed, Sep 23, 2015 at 1:58 PM, Daniel Watrous <dwmaillist@gmail.com> wrote:

I have deployed a multi-node cluster with one master and two data nodes. Here's what jps shows:
hadoop@hadoop-master:~$ jps24641 SecondaryNameNode24435 DataNode24261 NameNode24791 ResourceManager25483
Jps24940 NodeManager
hadoop@hadoop-data1:~$ jps15556 DataNode16198 NodeManager16399 Jps
hadoop@hadoop-data2:~$ jps16418 Jps15575 DataNode16216 NodeManager
When I open the web console, I only see one node running: http://screencast.com/t/E6yehRvUbt
Where are the other two nodes? Why don't they show up?
Next I run one of the example scripts
hadoop@hadoop-master:~$ hadoop jar /usr/local/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.1.jar
pi 10 30Number of Maps  = 10Samples per Map = 30Wrote input for Map #0
Wrote input for Map #1...Job Finished in 2.956 secondsEstimated value of Pi is 3.14146666666666666667
I can't see this anywhere in the web interface. I thought it might show in the Applications
sub-menu. Should I be able to see this? It appears to run successfully.

View raw message