giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Xenia Demetriou <xenia...@gmail.com>
Subject Re: giraph 1.1.0 Execution Error
Date Fri, 08 Aug 2014 18:51:44 GMT
Hi Vivek,

How can I see in which port Zookeeper running in my cluster?
Giraph start Zookeeper for me.

Thanks


2014-08-08 8:34 GMT+03:00 Sardeshmukh, Vivek <vivek-sardeshmukh@uiowa.edu>:

> Hi Xenia,
>
> I think there is some problem with Zookeeper. Can you make sure that
> Zookeeper server is running. If it is running then is it on port 22181?
> (because your Giraph job is trying to connect on this port). If Zookeeper
> is running on some different port then try running your Giraph job with
> -Dgiraph.zkList=<zookeper server ip>:<zookeeper port>
>
> I'm not sure whether you have to start an instance of zookeeper separately
> or Giraph will start one for you, I have a separate instance running on my
> cluster and I specify the server and port via -Dgiraph.zkList option.
>
> I hope that works.
>
> Vivek
>
> ________________________________________
> From: xeniad20 <xeniad20@gmail.com>
> Sent: Thursday, August 7, 2014 3:46 PM
> To: user@giraph.apache.org
> Subject: giraph 1.1.0 Execution Error
>
> Hi experts,
>
> I try to execute Giraph 1.1.0 on a small cluster but I have the
> following Errors:
>
> 2014-08-07 23:35:46,141 INFO org.apache.zookeeper.ClientCnxn: Opening
> socket connection to server DataNode2/10.190.12.33:22181. Will not
> attempt to authenticate using SASL (unknown error)
> 2014-08-07 23:35:46,142 WARN org.apache.zookeeper.ClientCnxn: Session
> 0x147b22ebf420001 for server null, unexpected error, closing socket
> connection and attempting reconnect
> java.net.ConnectException: Connection refused
>      at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
>      at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
>      at
>
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
>      at
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 2014-08-07 23:35:46,243 WARN org.apache.giraph.zk.ZooKeeperExt:
> deleteExt: Connection loss on attempt 2, waiting 5000 msecs before
> retrying.
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for
>
> /_hadoopBsp/job_201408072332_0003/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/datanode1_1
>      at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>      at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>      at org.apache.zookeeper.ZooKeeper.delete(ZooKeeper.java:873)
>      at org.apache.giraph.zk.ZooKeeperExt.deleteExt(ZooKeeperExt.java:302)
>      at
>
> org.apache.giraph.worker.BspServiceWorker.unregisterHealth(BspServiceWorker.java:768)
>      at
>
> org.apache.giraph.worker.BspServiceWorker.failureCleanup(BspServiceWorker.java:782)
>      at
>
> org.apache.giraph.graph.GraphTaskManager.workerFailureCleanup(GraphTaskManager.java:900)
>      at org.apache.giraph.graph.GraphMapper.run(GraphMapper.java:100)
>      at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
>      at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364)
>      at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
>      at java.security.AccessController.doPrivileged(Native Method)
>      at javax.security.auth.Subject.doAs(Subject.java:415)
>      at
>
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190)
>      at org.apache.hadoop.mapred.Child.main(Child.java:249)
> 2014-08-07 23:35:48,126 INFO org.apache.zookeeper.ClientCnxn: Opening
> socket connection to server DataNode2/10.190.12.33:22181. Will not
> attempt to authenticate using SASL (unknown error)
> 2014-08-07 23:35:48,127 WARN org.apache.zookeeper.ClientCnxn: Session
> 0x147b22ebf420001 for server null, unexpected error, closing socket
> connection and attempting reconnect
> java.net.ConnectException: Connection refused
>      at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
>      at
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:708)
>      at
>
> org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
>      at
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 2014-08-07 23:35:49,368 FATAL org.apache.giraph.graph.GraphMapper:
> uncaughtException: OverrideExceptionHandler on thread Thread-12, msg =
> createExt: Failed to create
> /_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!,
> exiting...
> java.lang.IllegalStateException: createExt: Failed to create
> /_hadoopBsp/job_201408072332_0003/_workerProgresses/1 after 3 tries!
>      at org.apache.giraph.zk.ZooKeeperExt.createExt(ZooKeeperExt.java:182)
>      at
> org.apache.giraph.zk.ZooKeeperExt.createOrSetExt(ZooKeeperExt.java:247)
>      at
>
> org.apache.giraph.worker.WorkerProgress.writeToZnode(WorkerProgress.java:110)
>      at
>
> org.apache.giraph.worker.WorkerProgressWriter$1.run(WorkerProgressWriter.java:59)
>      at java.lang.Thread.run(Thread.java:724)
>
> However Giraph 1.0.0 version run without any problems.
> What might be the solution for the above errors?
>
> Any help is appreciated.
>
> Thanks
> Xenia
>

Mime
View raw message