giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sardeshmukh, Vivek" <vivek-sardeshm...@uiowa.edu>
Subject RE: giraph 1.1.0 Execution Error
Date Fri, 08 Aug 2014 05:34:11 GMT
Hi Xenia, 

I think there is some problem with Zookeeper. Can you make sure that Zookeeper server is running.
If it is running then is it on port 22181? (because your Giraph job is trying to connect on
this port). If Zookeeper is running on some different port then try running your Giraph job
with -Dgiraph.zkList=<zookeper server ip>:<zookeeper port> 

I'm not sure whether you have to start an instance of zookeeper separately or Giraph will
start one for you, I have a separate instance running on my cluster and I specify the server
and port via -Dgiraph.zkList option. 

I hope that works. 

Vivek

________________________________________
From: xeniad20 <xeniad20@gmail.com>
Sent: Thursday, August 7, 2014 3:46 PM
To: user@giraph.apache.org
Subject: giraph 1.1.0 Execution Error

Hi experts,

I try to execute Giraph 1.1.0 on a small cluster but I have the
following Errors:

2014-08-07 23:35:46,141 INFO org.apache.zookeeper.ClientCnxn: Opening
socket connection to server DataNode2/10.190.12.33:22181. Will not
attempt to authenticate using SASL (unknown error)
2014-08-07 23:35:46,142 WARN org.apache.zookeeper.ClientCnxn: Session
0x147b22ebf420001 for server null, unexpected error, closing socket
connection and attempting reconnect
java.net.ConnectException: Connection refused
     at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
     at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
     at
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
     at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
2014-08-07 23:35:46,243 WARN org.apache.giraph.zk.ZooKeeperExt:
deleteExt: Connection loss on attempt 2, waiting 5000 msecs before retrying.
org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for
/_hadoopBsp/job_201408072332_0003/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/datanode1_1
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
     at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
     at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:873)
     at org.apache.giraph.zk.ZooKeeperExt.deleteExt(ZooKeeperExt.java:302)
     at
org.apache.giraph.worker.BspServiceWorker.unregisterHealth(BspServiceWorker.java:768)
     at
org.apache.giraph.worker.BspServiceWorker.failureCleanup(BspServiceWorker.java:782)
     at
org.apache.giraph.graph.GraphTaskManager.workerFailureCleanup(GraphTaskManager.java:900)
     at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:100)
     at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364)
     at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
     at java.security.AccessController.doPrivileged(Native Method)
     at javax.security.auth.Subject.doAs(Subject.java:415)
     at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190)
     at org.apache.hadoop.mapred.Child.main(Child.java:249)
2014-08-07 23:35:48,126 INFO org.apache.zookeeper.ClientCnxn: Opening
socket connection to server DataNode2/10.190.12.33:22181. Will not
attempt to authenticate using SASL (unknown error)
2014-08-07 23:35:48,127 WARN org.apache.zookeeper.ClientCnxn: Session
0x147b22ebf420001 for server null, unexpected error, closing socket
connection and attempting reconnect
java.net.ConnectException: Connection refused
     at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
     at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
     at
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
     at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
2014-08-07 23:35:49,368 FATAL org.apache.giraph.graph.GraphMapper:
uncaughtException: OverrideExceptionHandler on thread Thread-12, msg =
createExt: Failed to create
/_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!,
exiting...
java.lang.IllegalStateException: createExt: Failed to create
/_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!
     at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:182)
     at
org.apache.giraph.zk.ZooKeeperExt.createOrSetExt(ZooKeeperExt.java:247)
     at
org.apache.giraph.worker.WorkerProgress.writeToZnode(WorkerProgress.java:110)
     at
org.apache.giraph.worker.WorkerProgressWriter$1.run(WorkerProgressWriter.java:59)
     at java.lang.Thread.run(Thread.java:724)

However Giraph 1.0.0 version run without any problems.
What might be the solution for the above errors?

Any help is appreciated.

Thanks
Xenia

Mime
View raw message