giraph-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Avery Ching (JIRA)" <>
Subject [jira] [Created] (GIRAPH-381) Ensure we get the original exception from GraphMapper#run()
Date Thu, 18 Oct 2012 22:12:04 GMT
Avery Ching created GIRAPH-381:

             Summary: Ensure we get the original exception from GraphMapper#run()
                 Key: GIRAPH-381
             Project: Giraph
          Issue Type: Improvement
            Reporter: Avery Ching
            Assignee: Avery Ching

We can lose the original exception if failureCleanup() fails.


INFO    2012-10-18 14:23:25,417 [main] org.apache.giraph.graph.WorkerAggregatorHandler  -
marshalAggregatorValues: Finished assembling aggregator values
INFO    2012-10-18 14:23:25,451 [main-SendThread(] org.apache.zookeeper.ClientCnxn
 - Unable to read additional data from server sessionid 0x13a75baca440014, likely server has
closed socket, closing socket c\
onnection and attempting reconnect
ERROR   2012-10-18 14:23:25,552 [main] org.apache.giraph.graph.BspServiceWorker  - unregisterHealth:
Got failure, unregistering health on /_hadoopBsp/job_201209271814.8652_0001/_applicationAttemptsDir/0/_superstepDir/1/_workerHea\
lthyDir/xxx.machine.xxx_9 on superstep 1
WARN    2012-10-18 14:23:25,554 [main-EventThread] org.apache.giraph.graph.BspService  - process:
Disconnected from ZooKeeper (will automatically try to recover) WatchedEvent state:Disconnected
type:None path:null
INFO    2012-10-18 14:23:26,916 [main-SendThread(] org.apache.zookeeper.ClientCnxn
 - Opening socket connection to server
INFO    2012-10-18 14:23:26,917 [main-SendThread(] org.apache.zookeeper.ClientCnxn
 - Socket connection established to, initiating session
WARN    2012-10-18 14:23:26,977 [main-SendThread(] org.apache.zookeeper.ClientCnxn
 - Session 0x13a75baca440014 for server, unexpected error,
closing socket connection and\
 attempting reconnect Connection reset by peer
at Method)
at org.apache.zookeeper.ClientCnxn$SendThread.doIO(
at org.apache.zookeeper.ClientCnxn$
WARN    2012-10-18 14:23:27,082 [main] org.apache.hadoop.mapred.Child  - Error running child
java.lang.IllegalStateException: unregisterHealth: KeeperException - Couldn't delete /_hadoopBsp/job_201209271814.8652_0001/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/xxx.machine.xxx_9
at org.apache.giraph.graph.BspServiceWorker.unregisterHealth(
at org.apache.giraph.graph.BspServiceWorker.failureCleanup(
at org.apache.hadoop.mapred.MapTask.runNewMapper(
at org.apache.hadoop.mapred.Child.main(
Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =
ConnectionLoss for /_hadoopBsp/job_201209271814.8652_0001/_applicationAttemptsDir/0/_superstepDir/1/_workerHealthyDir/xxx.machine.xxx_9
at org.apache.zookeeper.KeeperException.create(
at org.apache.zookeeper.KeeperException.create(
at org.apache.zookeeper.ZooKeeper.delete(
at org.apache.giraph.graph.BspServiceWorker.unregisterHealth(
... 5 more

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message