giraph-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Chu <mikechum...@gmail.com>
Subject Connection reset by peer
Date Thu, 10 Apr 2014 21:58:58 GMT
Hi

We have a Giraph program which works fine if the graph is small. However, it complains about
“Connection reset by peer” when the graph is big.
We added a log statement right before the sendMessageToAllEdges method to capture the message
size and the number of messages being send. We noticed the exception is raised whenever a
vertex attempt to send large messages to lots of neighbor. For example, the log below indicates
that the 64-edges-vertex is fine but the 3853-edges-vertex is causing exception. We cannot
see any other error from the log. Seems like it is related to Netty communication. May be
the buffer size? Any advise is highly appreciated.

2014-04-10 09:55:36,070 INFO [compute-0] com.neimanmarcus.api.matching.giraph.cc.CCVertex3:
!!!*** omx136986717936641d8a715150240372f9ef8da1a306 has 64 neighbors
2014-04-10 09:55:36,129 INFO [compute-0] com.neimanmarcus.api.matching.giraph.cc.CCVertex3:
!!!*** omx135795296165908348809827498ffc192bedd9f136 has 3853 neighbors
2014-04-10 09:55:58,161 INFO [netty-server-exec-0] org.apache.giraph.comm.netty.handler.RequestDecoder:
decode: Server window metrics MBytes/sec sent = 0.0004, MBytes/sec received = 12.7831, MBytesSent
= 
0.0134, MBytesReceived = 383.7369, ave sent req MBytes = 0, ave received req MBytes = 0.0481,
secs waited = 30.018
2014-04-10 09:56:08,806 WARN [netty-server-exec-5] org.apache.giraph.comm.netty.handler.RequestServerHandler:
exceptionCaught: Channel failed with remote address /10.241.17.33:43398
java.io.IOException: Connection reset by peer
	at sun.nio.ch.FileDispatcher.write0(Native Method)
	at sun.nio.ch.SocketDispatcher.write(SocketDispatcher.java:29)
	at sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:69)
	at sun.nio.ch.IOUtil.write(IOUtil.java:26)
	at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:334)
	at org.jboss.netty.channel.socket.nio.SocketSendBufferPool$UnpooledSendBuffer.transferTo(SocketSendBufferPool.java:198)
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.write0(AbstractNioWorker.java:468)
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.writeFromTaskLoop(AbstractNioWorker.java:423)
	at org.jboss.netty.channel.socket.nio.AbstractNioChannel$WriteTask.run(AbstractNioChannel.java:364)
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.processWriteTaskQueue(AbstractNioWorker.java:341)
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:237)
	at org.jboss.netty.channel.socket.nio.NioWorker.run(NioWorker.java:38)
	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
	at java.lang.Thread.run(Thread.java:662)
2014-04-10 09:56:09,040 WARN [netty-server-exec-6] org.apache.giraph.comm.netty.handler.RequestServerHandler:
exceptionCaught: Channel failed with remote address /10.241.17.33:43398
java.nio.channels.ClosedChannelException
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.cleanUpWriteBuffer(AbstractNioWorker.java:673)
	at org.jboss.netty.channel.socket.nio.AbstractNioWorker.writeFromUserCode(AbstractNioWorker.java:400)
	at org.jboss.netty.channel.socket.nio.NioServerSocketPipelineSink.handleAcceptedSocket(NioServerSocketPipelineSink.java:120)
text: Unable to write to output stream.
Mime
View raw message