hama-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Luis Eduardo Pineda Morales <luis.pineda...@gmail.com>
Subject Re: Hama help (how the distributed mode is working)
Date Thu, 15 Sep 2011 15:42:49 GMT
Hi all,

I am attempting to run the distributed mode. I have HDFS running in a single machine (pseudo-distributed

pineda@server00:~/hadoop$ jps
472 SecondaryNameNode
1429 Jps
32733 NameNode
364 DataNode

pineda@net-server00:~/hadoop$ lsof -i
java      364 pineda   46u  IPv6 2532945       TCP *:41462 (LISTEN)
java      364 pineda   52u  IPv6 2533275       TCP server00:42445->server00:54310 (ESTABLISHED)
java      364 pineda   60u  IPv6 2533307       TCP *:50010 (LISTEN)
java      364 pineda   61u  IPv6 2533511       TCP *:50075 (LISTEN)
java      364 pineda   66u  IPv6 2533518       TCP *:50020 (LISTEN)
java      472 pineda   46u  IPv6 2533286       TCP *:43098 (LISTEN)
java      472 pineda   59u  IPv6 2533536       TCP *:50090 (LISTEN)
java    32733 pineda   46u  IPv6 2532751       TCP *:54763 (LISTEN)
java    32733 pineda   56u  IPv6 2533062       TCP server00:54310 (LISTEN)
java    32733 pineda   67u  IPv6 2533081       TCP *:50070 (LISTEN)
java    32733 pineda   76u  IPv6 2533276       TCP server00:54310->server00:42445 (ESTABLISHED)

i.e.    fs.defaul.name  =  hdfs://server00:54310/

then I run hama in server04 (groom in server03, zookeeper in server05):

pineda@server04:~/hama$ bin/start-bspd.sh 
server05: starting zookeeper, logging to /logs/hama-pineda-zookeeper-server05.out
starting bspmaster, logging to /logs/hama-pineda-bspmaster-server04.out
2011-09-15 17:08:43.349:INFO::Logging to STDERR via org.mortbay.log.StdErrLog
2011-09-15 17:08:43.409:INFO::jetty-0.3.0-incubating
server03: starting groom, logging to /logs/hama-pineda-groom-server03.out

this is my hama-site.xml file:



In theory I can connect to the HDFS, because I don't get any ConnectException, but Hama doesn't
run, and I get this Exception trace in my bspmaster.log after the Jetty is bound:

2011-09-15 17:08:43,409 INFO org.apache.hama.http.HttpServer: Jetty bound to port 40013
2011-09-15 17:08:44,070 INFO org.apache.hama.bsp.BSPMaster: problem cleaning system directory:
java.io.IOException: Call to server00/ failed on local exception: java.io.EOFException
        at org.apache.hadoop.ipc.Client.wrapException(Client.java:775)
        at org.apache.hadoop.ipc.Client.call(Client.java:743)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
        at $Proxy4.getProtocolVersion(Unknown Source)
        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
        at org.apache.hadoop.hdfs.DFSClient.createRPCNamenode(DFSClient.java:106)
        at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:207)
        at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:170)
        at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:82)
        at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1378)
        at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:66)
        at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1390)
        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:196)
        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:95)
        at org.apache.hama.bsp.BSPMaster.<init>(BSPMaster.java:263)
        at org.apache.hama.bsp.BSPMaster.startMaster(BSPMaster.java:421)
        at org.apache.hama.bsp.BSPMaster.startMaster(BSPMaster.java:415)
        at org.apache.hama.BSPMasterRunner.run(BSPMasterRunner.java:46)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
        at org.apache.hama.BSPMasterRunner.main(BSPMasterRunner.java:56)
Caused by: java.io.EOFException
        at java.io.DataInputStream.readInt(DataInputStream.java:375)
        at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:501)
        at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446)

Do you know how to fix this? Do you know what is the directory that it is trying to clean?

Any idea is welcomed!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message