hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anfernee Xu <anfernee...@gmail.com>
Subject Writable RPC had a lot of leftover TCP connections in CLOSE_WAIT after RPC_TIMEOUT is enabled
Date Tue, 10 Jun 2014 17:42:45 GMT

I'm using hadoop-2.2.0 and take advantage of Hadoop WritableRpcEngine to
build my distributed application, and I have 'heartbeat' interface in my
application to check availability periodically, in order to detect any
potential failure, I enabled "rpc_timeout" when creating the proxy as below

 int rpcTimeout=1000;// 1 second as rpc timeout

          MyApplicationInterface.class, MyApplicationInterface.versionID,
          socAddr, conf, rpcTimeout, timeout);

Everything went fine initially, I can see failures can be detected by the
heartbeat, but after a period of time(2 days or so), I saw a lot of TCP
connections in CLOSE_WAIT state on server side, and client was not able to
connect to it again.

Any clue about this?



View raw message